随笔档案「2021年7月14日」：Using Monte Carlo Tree Search as a Demon... - initial_h

2021年7月14日

Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL

摘要： **发表时间：**2018（AAAI-19 Workshop on Reinforcement Learning in Games） **文章要点：**结合了A3C和MCTS，再加上一个预测terminal的辅助任务的loss，在Pommerman上取得了不错的效果。主要的方法就是在A3C的work 阅读全文

posted @ 2021-07-14 11:43 initial_h 阅读(115) 评论(0) 推荐(0)

initial_h

https://github.com/initial-h

公告