240
笔下虽有千言,胸中实无一策
摘要: Lecture 1 Video: https://www.youtube.com/watch?v=2pWv7GOvuf0 Slide: Introduction to Reinforcement Learning Key points: 1. An RL agent may include one 阅读全文
posted @ 2017-01-18 09:32 CasperWin 阅读(521) 评论(0) 推荐(0) 编辑
摘要: Pip installation Pip is a package management system used to install and manage software packages written in Python. We provide pip packages for Tensor 阅读全文
posted @ 2017-01-18 07:13 CasperWin 阅读(2712) 评论(0) 推荐(0) 编辑
摘要: Deep Q Network 4.1 DQN 算法更新 4.2 DQN 神经网络 4.3 DQN 思维决策 4.4 OpenAI gym 环境库 Deep Q Network 的简称叫 DQN, 是将 Q learning 的优势 和 Neural networks 结合了. Notes Psudo 阅读全文
posted @ 2017-01-18 05:08 CasperWin 阅读(513) 评论(0) 推荐(0) 编辑
摘要: Homepage Warm up First Chapters from Reinforcement Learning: an Introduction, Sutton&Barto ,Second Edition (pdf) & also ebook here Dave Silver’s cours 阅读全文
posted @ 2017-01-18 03:14 CasperWin 阅读(1351) 评论(0) 推荐(0) 编辑