随笔档案「2018年10月30日」：[Reinforcement Learning] Model-Free Pred... - Poll的笔记

2018年10月30日

[Reinforcement Learning] Model-Free Prediction

摘要：上篇文章介绍了 Model based 的通用方法——动态规划，本文内容介绍 Model Free 情况下 Prediction 问题，即 "Estimate the value function of an unknown MDP"。 Model based：MDP已知，即转移矩阵和奖赏函数均已知阅读全文

posted @ 2018-10-30 09:37 Poll的笔记阅读(2098) 评论(1) 推荐(2)

Poll的笔记

[三叶草精神] what hurts more,the pain of hard work or the pain of regret?

公告