Tuesday, November 30, 2021

R_Learning

Finishing up the AI lecture on Learning, below reinforcement Learning.


The agent - represented by the yellow dot - learns to move to the green space
at the expense of a penalty for giong to a red square, and otherwise a reward.


It is a Markov Decision Process, similar to a Markov Chain.

 

No comments: