11月深度学习班第9课强化学习与DQN
强化学习与DQN
强化学习成就
Learned the world’s best player of Backgammon (Tesauro 1995)
Learned acrobatic helicopter autopilots (Ng, Abbeel, Coates et al
2006+)
Widely used in the placement and selection of advertisements on
the web (e.g. A-B tests)
Used to make strategic decisions in Jeopardy! (IBM’s Watson
2011)
Achieved human-level performance on Atari games from pixel
-level visual input, in conjunction with deep learning (Google
Deepmind 2015)
In all these cases, performance was better than could be obtained by
any other method, and was obtained without human instruction
Life is short, but I have a cat.
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】凌霞软件回馈社区,博客园 & 1Panel & Halo 联合会员上线
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】博客园社区专享云产品让利特惠,阿里云新客6.5折上折
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步