2017 年 1月 18 日随笔档案 - CasperWin

2017年1月18日

摘要： Lecture 1 Video: https://www.youtube.com/watch?v=2pWv7GOvuf0 Slide: Introduction to Reinforcement Learning Key points: 1. An RL agent may include one 阅读全文

posted @ 2017-01-18 09:32 CasperWin 阅读(521) 评论(0) 推荐(0) 编辑

[Installation] Using Tensorflow with Python 2.7 / 3.5

摘要： Pip installation Pip is a package management system used to install and manage software packages written in Python. We provide pip packages for Tensor 阅读全文

posted @ 2017-01-18 07:13 CasperWin 阅读(2712) 评论(0) 推荐(0) 编辑

学习笔记 | Morvan - Reinforcement Learning, Part 4: Deep Q Network

摘要： Deep Q Network 4.1 DQN 算法更新 4.2 DQN 神经网络 4.3 DQN 思维决策 4.4 OpenAI gym 环境库 Deep Q Network 的简称叫 DQN, 是将 Q learning 的优势和 Neural networks 结合了. Notes Psudo 阅读全文

posted @ 2017-01-18 05:08 CasperWin 阅读(513) 评论(0) 推荐(0) 编辑

学习笔记 | CMU 10703: Deep Reinforcement Learning and Control, Spring 2017

摘要： Homepage Warm up First Chapters from Reinforcement Learning: an Introduction, Sutton&Barto ,Second Edition (pdf) & also ebook here Dave Silver’s cours 阅读全文

posted @ 2017-01-18 03:14 CasperWin 阅读(1351) 评论(0) 推荐(0) 编辑

卡斯柏的博客

笔下虽有千言，胸中实无一策

公告