摘要: Stadie, Bradly C., Sergey Levine, and Pieter Abbeel. "Incentivizing exploration in reinforcement learning with deep predictive models." arXiv preprint 阅读全文
posted @ 2017-08-13 19:00 Shiyu_Huang 阅读(408) 评论(0) 推荐(0) 编辑
摘要: 1.Delayed, sparse reward(feedback), Long-term planning Hierarchical Deep Reinforcement Learning, Sub-goal, SAMDP, optoins, Thompson sampling, Boltzman 阅读全文
posted @ 2017-08-13 15:47 Shiyu_Huang 阅读(250) 评论(0) 推荐(0) 编辑
摘要: Zahavy, Tom, Nir Ben-Zrihem, and Shie Mannor. "Graying the black box: Understanding DQNs." International Conference on Machine Learning. 2016. 这篇论文想要做 阅读全文
posted @ 2017-08-13 14:56 Shiyu_Huang 阅读(369) 评论(0) 推荐(0) 编辑