摘要:
About model based and model free Model free methods cannot be the future of reinforcement learnig, even though these algorithms perform better than mo 阅读全文
摘要:
很早之前看到这篇文章的时候,觉得这篇文章的思想很朴素,没有让人眼前一亮的东西就没有太在意。之后读到很多Multi-Agent或者并行训练的文章,都会提到这个算法,比如第一视角多人游戏(Quake III Arena Capture the Flag)的超人表现,NeurIPS2018首届多智能体竞赛 阅读全文
摘要:
AlphaZero Gomoku MPI Link Github : "AlphaZero Gomoku MPI" Overview This repo is based on "junxiaosong/AlphaZero_Gomoku" , sincerely grateful for it. I 阅读全文