2022 年 2月 11 日随笔档案 - initial_h

2022年2月11日

Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning

摘要： **发表时间：**2018 **文章要点：**这篇文章提出了model-based value expansion (MVE)算法，通过在model上扩展有限深度，来控制model uncertainty，利用这有限步上的reward来估计value，提升value估计的准确性，在结合model f 阅读全文

posted @ 2022-02-11 13:53 initial_h 阅读(211) 评论(0) 推荐(0) 编辑

initial_h

https://github.com/initial-h

公告