initial_h

2023年7月31日

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230731085957589-2046683860.png) **发表时间：**2019 (IJCAI 2019) **文章要点：**这篇文章提出experience rep 阅读全文

posted @ 2023-07-31 09:05 initial_h 阅读(22) 评论(0) 推荐(0) 编辑

2023年7月29日

Improved deep reinforcement learning for robotics through distribution-based experience retention

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230729080850680-1663030080.png) **发表时间：**2016（IROS 2016） **文章要点：**这篇文章提出了experience repl 阅读全文

posted @ 2023-07-29 08:25 initial_h 阅读(10) 评论(0) 推荐(0) 编辑

2023年7月27日

The importance of experience replay database composition in deep reinforcement learning

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230727110633815-1407402877.png) **发表时间：**2015（Deep Reinforcement Learning Workshop, NIPS 阅读全文

posted @ 2023-07-27 11:12 initial_h 阅读(18) 评论(0) 推荐(0) 编辑

2023年7月25日

Selective Experience Replay for Lifelong Learning

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230725234343269-1373726308.png) **发表时间：**2018（AAAI 2018） **文章要点：**这篇文章想解决强化学习在学多个任务时候的遗忘阅读全文

posted @ 2023-07-25 23:47 initial_h 阅读(61) 评论(0) 推荐(0) 编辑

2023年7月17日

Reverb: A Framework For Experience Replay

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230717102339025-699657308.png) **发表时间：**2021 **文章要点：**这篇文章主要是设计了一个用来做experience replay的框阅读全文

posted @ 2023-07-17 10:24 initial_h 阅读(19) 评论(0) 推荐(0) 编辑

2023年7月13日

TOPOLOGICAL EXPERIENCE REPLAY

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230713232535617-402383287.png) **发表时间：**2022（ICLR 2022） **文章要点：**这篇文章指出根据TD error来采样是低效的阅读全文

posted @ 2023-07-13 23:30 initial_h 阅读(27) 评论(0) 推荐(0) 编辑

2023年7月10日

Regret Minimization Experience Replay in Off-Policy Reinforcement Learning

摘要： **发表时间：**2021 (NeurIPS 2021) **文章要点：**理论表明，更高的hindsight TD error，更加on policy,以及更准的target Q value的样本应该有更高的采样权重（The theory suggests that data with highe 阅读全文

posted @ 2023-07-10 12:53 initial_h 阅读(94) 评论(0) 推荐(0) 编辑

2023年7月7日

Effective Diversity in Population-Based Reinforcement Learning

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230707084258489-1960518081.png) **发表时间：**2020 (NeurIPS 2020) **文章要点：**这篇文章提出了Diversity v 阅读全文

posted @ 2023-07-07 08:46 initial_h 阅读(28) 评论(0) 推荐(0) 编辑

2023年7月3日

MODEL-AUGMENTED PRIORITIZED EXPERIENCE REPLAY

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202307/1428973-20230703112126926-921811970.png) **发表时间：**2022（ICLR 2022） **文章要点：**这篇文章想说Q网络通常会存在under- or 阅读全文

posted @ 2023-07-03 11:25 initial_h 阅读(40) 评论(0) 推荐(0) 编辑

2023年7月2日

Remember and Forget for Experience Replay

摘要： **发表时间：**2019（ICML 2019） **文章要点：**这篇文章想说如果replay的经验和当前的policy差别很大的话，对更新是有害的。然后提出了Remember and Forget Experience Replay (ReF-ER)算法，（1）跳过那些和当前policy差别很大阅读全文

posted @ 2023-07-02 12:15 initial_h 阅读(28) 评论(0) 推荐(0) 编辑

2023年6月25日

LEARNING TO SAMPLE WITH LOCAL AND GLOBAL CONTEXTS FROM EXPERIENCE REPLAY BUFFERS

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202306/1428973-20230625114456465-1558069206.png) **发表时间：**2021（ICLR 2021） **文章要点：**这篇文章想说，之前的experience r 阅读全文

posted @ 2023-06-25 11:57 initial_h 阅读(22) 评论(0) 推荐(0) 编辑

2023年6月23日

Prioritized Sequence Experience Replay

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202306/1428973-20230623122845476-1483728572.png) **发表时间：**2020 **文章要点：**这篇文章提出了Prioritized Sequence Exper 阅读全文

posted @ 2023-06-23 12:34 initial_h 阅读(49) 评论(0) 推荐(0) 编辑

2023年6月9日

Revisiting Fundamentals of Experience Replay

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202306/1428973-20230609121441155-1445259850.png) **发表时间：**2020（ICML2020） **文章要点：**这篇文章研究了experience repla 阅读全文

posted @ 2023-06-09 12:22 initial_h 阅读(29) 评论(0) 推荐(0) 编辑

2023年6月4日

Revisiting Prioritized Experience Replay: A Value Perspective

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202306/1428973-20230604130820622-309698896.png) **发表时间：**2021 **文章要点：**这篇文章想说Prioritized experience repla 阅读全文

posted @ 2023-06-04 13:12 initial_h 阅读(21) 评论(0) 推荐(0) 编辑

2023年6月2日

Muesli: Combining Improvements in Policy Optimization

摘要： ![](https://img2023.cnblogs.com/blog/1428973/202306/1428973-20230602222440022-2137032229.png) **发表时间：**2021（ICML 2021） **文章要点：**这篇文章提出一个更新policy的方式，结合阅读全文

posted @ 2023-06-02 22:36 initial_h 阅读(21) 评论(0) 推荐(0) 编辑

https://github.com/initial-h

公告