2017 年 8月 13 日随笔档案 - Shiyu_Huang

2017年8月13日

Incentivizing exploration in reinforcement learning with deep predictive models

摘要： Stadie, Bradly C., Sergey Levine, and Pieter Abbeel. "Incentivizing exploration in reinforcement learning with deep predictive models." arXiv preprint 阅读全文

posted @ 2017-08-13 19:00 Shiyu_Huang 阅读(408) 评论(0) 推荐(0) 编辑

RL Problems

摘要： 1.Delayed, sparse reward(feedback), Long-term planning Hierarchical Deep Reinforcement Learning, Sub-goal, SAMDP, optoins, Thompson sampling, Boltzman 阅读全文

posted @ 2017-08-13 15:47 Shiyu_Huang 阅读(250) 评论(0) 推荐(0) 编辑

Graying the black box: Understanding DQNs

摘要： Zahavy, Tom, Nir Ben-Zrihem, and Shie Mannor. "Graying the black box: Understanding DQNs." International Conference on Machine Learning. 2016. 这篇论文想要做阅读全文

posted @ 2017-08-13 14:56 Shiyu_Huang 阅读(369) 评论(0) 推荐(0) 编辑

黄世宇@智谱AI，OpenRL Lab负责人，强化学习，LLM，通用人工智能[OpenRL][知乎][GitHub][Linkedin]如果你对人工智能前沿感兴趣，欢迎联系并加入我们！

黄世宇@智谱AI，OpenRL Lab负责人，强化学习，LLM，通用人工智能
[OpenRL][知乎][GitHub][Linkedin]
如果你对人工智能前沿感兴趣，欢迎联系并加入我们！