2022 年 2月 17 日随笔档案 - initial_h

2022年2月17日

Collect & Infer - a fresh look at data-efficient Reinforcement Learning

摘要： **发表时间：**2021 **文章要点：**一篇比较短的概念性的文章，主要想说Data-efficient RL走过了三个阶段，一个是pure on-line RL，就是数据来了用一次就扔；第二个是RL with a replay buffer，数据来了会存到一个容量有限的buffer里，数据可以阅读全文

posted @ 2022-02-17 12:38 initial_h 阅读(64) 评论(0) 推荐(0) 编辑

initial_h

https://github.com/initial-h

公告