随笔档案「2021年11月22日」：Search on the Replay Buffer: Bridging Pl... - initial_h

2021年11月22日

Search on the Replay Buffer: Bridging Planning and Reinforcement Learning

摘要： **发表时间：**2019（NeurIPS 2019） **文章要点：**这篇文章结合planning和强化学习来解决复杂任务，主要思路是通过强化学习（Goal-conditioned RL）的方式构建一个图结构（graph），图里的节点就包括起始位置，目标位置以及中间点，这就相当于把一个远距离的目阅读全文

posted @ 2021-11-22 12:42 initial_h 阅读(189) 评论(0) 推荐(0)

initial_h

https://github.com/initial-h

公告