摘要: Optimal Value Function is how much reward the best policy can get from a state s, which is the best senario given state s. It can be defined as: Value 阅读全文
posted @ 2019-07-10 09:53 Junfei_Wang 阅读(551) 评论(0) 推荐(0) 编辑