随笔分类 - Deep Reinforcement Learning
摘要:Dynamic Zoom-in Network for Fast Object Detection in Large Images 2020-03-17 21:45:24 Paper: CVPR-2018 1. Background and Motivation: 如图 1 所示, 本文提出一种 c
阅读全文
摘要:Learning When and Where to Zoom with Deep Reinforcement Learning 2020-03-03 14:47:08 Paper: https://arxiv.org/pdf/2003.00425.pdf Related work: "Effici
阅读全文
摘要:Policy Gradient Algorithms 2019-10-02 17:37:47 This blog is from: https://lilianweng.github.io/lil-log/2018/04/08/policy-gradient-algorithms.html Abst
阅读全文
摘要:Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition ICCV 2019 (oral) 2019-08-01 15:08:19 Paper:https://a
阅读全文
摘要:Multi-shot Pedestrian Re-identification via Sequential Decision Making 2019-07-31 20:33:37 Paper: http://openaccess.thecvf.com/content_cvpr_2018/paper
阅读全文
摘要:Deep Reinforcement Learning with Iterative Shift for Visual Tracking 2019-07-30 14:55:31 Paper: http://openaccess.thecvf.com/content_ECCV_2018/papers/
阅读全文
摘要:A3C 算法资料收集 2019-07-26 21:37:55 Paper: https://arxiv.org/pdf/1602.01783.pdf Code: 1. 超级马里奥:https://github.com/vietnguyen91/Super-mario-bros-A3C-pytorch
阅读全文
摘要:Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor 2019-07-15 22:23:02 Paper: https://arxiv.org/pdf/180
阅读全文
摘要:Real-time ‘Actor-Critic’ Tracking 2019-07-15 10:49:16 Paper: http://openaccess.thecvf.com/content_ECCV_2018/papers/Boyu_Chen_Real-time_Actor-Critic_Tr
阅读全文
摘要:RL — Policy Gradient Explained 2019-05-02 21:12:57 This blog is copied from: https://medium.com/@jonathan_hui/rl-policy-gradients-explained-9b13b688b1
阅读全文
摘要:深度学习课程笔记(十八)Deep Reinforcement Learning - Part 1 (17/11/27) Lectured by Yun-Nung Chen @ NTU CSIE 2018-08-11 13:42:23 This video can be found from: htt
阅读全文
摘要:Tutorials on Inverse Reinforcement Learning 2018-07-22 21:44:39 1. Papers: Inverse Reinforcement Learning: http://citeseerx.ist.psu.edu/viewdoc/downlo
阅读全文
该文被密码保护。
该文被密码保护。
摘要:Proximal Policy Optimization Algorithms Updated on 2019-09-14 16:15:59 Paper: https://arxiv.org/pdf/1707.06347.pdf TensorFlow Code from OpenAI: https:
阅读全文
摘要:分布式强化学习基础概念(Distributional RL) from: https://mtomassoli.github.io/2017/12/08/distributional_rl/ 1. Q-learning 在 Q-learning 中,我们想要优化如下的 loss: Distribut
阅读全文
摘要:深度强化学习的18个关键问题 from: https://zhuanlan.zhihu.com/p/32153603 85 人赞了该文章 深度强化学习的问题在哪里?未来怎么走?哪些方面可以突破? 这两天我阅读了两篇篇猛文A Brief Survey of Deep Reinforcement Lea
阅读全文
摘要:深度学习课程笔记(七):模仿学习(imitation learning) 2017.12.10 本文所涉及到的 模仿学习,则是从给定的展示中进行学习。机器在这个过程中,也和环境进行交互,但是,并没有显示的得到 reward。在某些任务上,也很难定义 reward。如:自动驾驶,撞死一人,reward
阅读全文
摘要:文章来源:Attentional Action-Driven Deep Network for Visual Object Tracking 博士论文(2017年8月份完稿) http://s-space.snu.ac.kr/bitstream/10371/136793/1/000000145905
阅读全文
摘要:大牛讲堂 | 算法工程师入门第二期-穆黎森讲增强学习 2017-07-13 HorizonRobotics 🤖小广告:去CVPR 2017的小伙伴,点我有惊喜! 编者按:地平线大牛讲堂算法工程师入门第二讲如期而至,本期地平线资深算法工程师、增强学习专家穆黎森将为大家带来增强学习简介,全文约1200
阅读全文