Finishing up the AI lecture on Learning, below reinforcement Learning.
The agent - represented by the yellow dot - learns to move to the green space
at the expense of a penalty for giong to a red square, and otherwise a reward.
It is a Markov Decision Process, similar to a Markov Chain.
No comments:
Post a Comment