ecoflex

2018年5月23日

CS294-112深度增强学习课程（加州大学伯克利分校 2017）NO.4 Learning policies by imitating optimal controllers

摘要： There are some problems: mismatch of model and reality; gradient explosion so, the dynamics can be quite messy, and backpropogating can be quite probl 阅读全文

posted @ 2018-05-23 19:14 ecoflex 阅读(334) 评论(0) 推荐(0) 编辑

2018年5月22日

CS294-112深度增强学习课程（加州大学伯克利分校 2017）NO.3 Learning dynamical system models from data

摘要： ... 阅读全文

posted @ 2018-05-22 19:58 ecoflex 阅读(205) 评论(0) 推荐(0) 编辑

2018年5月21日

CS294-112深度增强学习课程（加州大学伯克利分校 2017）NO.2 optimal control and planning

摘要： solved normally by sequential quadratic programming algorithms an example of linear system 阅读全文

posted @ 2018-05-21 20:33 ecoflex 阅读(209) 评论(0) 推荐(0) 编辑

2018年5月19日

【李宏毅深度学习合辑】Advanced Topics in Deep Learning - Imitation Learning

摘要： You have to force experts to treat some uncommon and extreme situations. a mechanical way to learn However, we don't know rt if you use sequence GAN, 阅读全文

posted @ 2018-05-19 20:21 ecoflex 阅读(460) 评论(0) 推荐(0) 编辑

2018年5月15日

摘要： not only JS divergence could be applied to GAN, other divergences are all applicable! f start is convex several ACG icons become very similar, if trai 阅读全文

posted @ 2018-05-15 18:43 ecoflex 阅读(644) 评论(0) 推荐(0) 编辑

摘要： Too much limitation of Gaussian model. The images are too blurry. So any general model? But if PG(x;θ) is a neural network, it's impossible to calcula 阅读全文

posted @ 2018-05-15 14:50 ecoflex 阅读(446) 评论(0) 推荐(0) 编辑

2018年5月14日

摘要： HW2: input a sentence, output an ACG icon 3 target: trains from front view, side views. So that the output would be the average of the three pictures. 阅读全文

posted @ 2018-05-14 23:12 ecoflex 阅读(1526) 评论(0) 推荐(0) 编辑

2018年5月13日

李宏毅 2018最新GAN课程 class 1 Introduction

摘要：比较有用的是conditioned generator，能够控制输入的vector来控制对应的文字音像 https://zhuanlan.zhihu.com/p/24767059 单纯生成人脸意义不大，因为随便拍一个路人就行了。但是能从左右照片生成正面照片，就很神奇了必须学会辨别转折 Varia 阅读全文

posted @ 2018-05-13 13:12 ecoflex 阅读(3558) 评论(0) 推荐(0) 编辑

2018年5月6日

Records of Pytorch in Practice

摘要： https://www.bilibili.com/video/av15997678/ My own deep reinforcement learning code: https://github.com/ysgclight/Reinforcement-Learning-with-Pytorch D 阅读全文

posted @ 2018-05-06 14:54 ecoflex 阅读(171) 评论(0) 推荐(0) 编辑

2018年5月5日

fast.ai Lesson 2: Deep Learning 2018

摘要： data augumentation 阅读全文

posted @ 2018-05-05 19:46 ecoflex 阅读(178) 评论(0) 推荐(0) 编辑

fast.ai Lesson 1: Deep Learning 2018

摘要： 10 free hours run on AWS click this one click on new machine pick a region choose linux ubuntu 16 250GB is preferred ctrl shift v to paste your passwo 阅读全文

posted @ 2018-05-05 18:49 ecoflex 阅读(555) 评论(0) 推荐(0) 编辑

2018年5月4日

Learn You a PyTorch! (aka Introduction Into PyTorch)

摘要： https://www.bilibili.com/video/av22940029 left hand side: NN being constructed right hand side: NN being called turn the NN code into GPU compatible m 阅读全文

posted @ 2018-05-04 18:02 ecoflex 阅读(149) 评论(0) 推荐(0) 编辑

Deep RL Bootcamp Frontiers Lecture I: Recent Advances,

摘要： high bias if the robot has learnt something (no changes appear with iterations) however, in the real world tasks, the task could change a little bit, 阅读全文

posted @ 2018-05-04 17:14 ecoflex 阅读(252) 评论(0) 推荐(0) 编辑

Deep RL Bootcamp TAs Research Overview

摘要： model free: high variance. model based: high bias within 1h of human demonstration of each task, VR!!! 阅读全文

posted @ 2018-05-04 15:34 ecoflex 阅读(243) 评论(0) 推荐(0) 编辑

Deep RL Bootcamp Lecture 10B Inverse Reinforcement Learning

摘要： intrinsic ambiguity: move toward purple triangle? move away from red triangle? move along grey arrow? or the combine of them? the right part of the ri 阅读全文

posted @ 2018-05-04 13:58 ecoflex 阅读(349) 评论(0) 推荐(0) 编辑

2018年5月3日

Deep RL Bootcamp Lecture 10A Utlities

摘要：阅读全文

posted @ 2018-05-03 18:55 ecoflex 阅读(177) 评论(0) 推荐(0) 编辑

2018年5月2日

Deep RL Bootcamp Lecture 9 Model-based Reinforcement

摘要： So, the process is similar to one-to-many RNN? learn much more efficiently than model-free method iteratively get better less than 300 trials ~ 25min 阅读全文

posted @ 2018-05-02 23:02 ecoflex 阅读(216) 评论(0) 推荐(0) 编辑

Deep RL Bootcamp Lecture 8 Derivative Free Methods

摘要： you wouldn't try to explore any problem structure in DFO low dimension policy 30 degrees of freedom 120 paramaters to tune keep the positive results i 阅读全文

posted @ 2018-05-02 13:08 ecoflex 阅读(182) 评论(0) 推荐(0) 编辑

2018年5月1日

Deep RL Bootcamp Lecture 7: SVG, DDPG, and Stochastic Computation Graphs

摘要： ^ is the square root of epsilon a simplified version of hard version a more smooth way to find correct solution the first term is the REINFORCE term, 阅读全文

posted @ 2018-05-01 22:38 ecoflex 阅读(276) 评论(0) 推荐(0) 编辑

Deep RL Bootcamp Lecture 6: Nuts and Bolts of Deep RL Experimentation

摘要： fast feedback to robot with better shape reward func, and learning could be much faster open ai baseline rllab multiple tasks and multiple seeds to te 阅读全文

posted @ 2018-05-01 21:34 ecoflex 阅读(332) 评论(0) 推荐(0) 编辑

公告