由于组里新同学进来,需要带着他入门RL,选择从silver的课程开始。
对于我自己,增加一个仔细阅读《reinforcement learning:an introduction》的要求。
因为之前读的不太认真,这一次希望可以认真一点,将对应的知识点也做一个简单总结。
8.1 Models and Planning
By a model of the environment we mean anything that an agent can use to predict how the environment will respond to its actions
The word planning is used in several different ways in different fields. We use the term to refer to any computational process that takes a model as input and produces or improves a policy for interacting with the modeled environment
The difference is that whereas planning uses simulated experience generated by a model, learning methods use real experience generated by the environment. Of course this difference lead