Deep Reinforcement Learning — Papers (2)
2016-03-20 23:22
423 查看
Deep Reinforcement Learning — Papers
Many recent advancements in AI research stem from breakthroughs in deep reinforcement learning. This is a complex and varied field, but JunhyukOh at the University of Michigan has compiled a great list of papers. The list, which originally appeared on GitHub,
are sorted by time with most recent appearing first.
Bookmarks
All PapersQ-learning
Policy Gradient
Discrete Control
Continuous Control
Text Domain
Visual Domain
Robotics
Games
Monte-Carlo Tree Search
Inverse Reinforcement Learning
Improving Exploration
Transfer Learning
All Papers
Deep Reinforcement Learning with an Unbounded Action Space, J. He et al.,arXiv,2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., arXiv,
2015.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv,
2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv,
2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv,
2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv,
2015.
Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al., arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,arXiv,
2015.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP,
2015.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS,
2015.
Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al., arXiv,
2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv,
2015.
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al., arXiv,
2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv,
2015.
Maximum Entropy Deep Inverse Reinforcement Learning, M. Wulfmeier et al.,arXiv,
2015.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al., RSS,
2015.
Universal Value Function Approximators, T. Schaul et al., ICML,
2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML
Workshop, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML,
2015.
Human-level control through deep reinforcement learning, V. Mnih et al.,Nature,
2015.
Deep Learning for Real-Time Atari Game
Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS
Workshop, 2013.
Q-learning
Deep Reinforcement Learning with an Unbounded Action Space, J. He et al.,arXiv,2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., arXiv,
2015.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv,
2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv,
2015.
Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al., arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,arXiv,
2015.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP,
2015.
Action-Conditiona
143c7
l Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS,
2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv,
2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv,
2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML
Workshop, 2015.
Human-level control through deep reinforcement learning, V. Mnih et al.,Nature,
2015.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS
Workshop, 2013.
Policy Gradient
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv,2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,arXiv,
2015.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML,
2015.
Discrete Control
Deep Reinforcement Learning with an Unbounded Action Space, J. He et al.,arXiv,2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., arXiv,
2015.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv,
2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv,
2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv,
2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv,
2015.
Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al., arXiv, 2015.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP,
2015.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS,
2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv,
2015.
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al., arXiv,
2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv,
2015.
Universal Value Function Approximators, T. Schaul et al., ICML,
2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML
Workshop, 2015.
Human-level control through deep reinforcement learning, V. Mnih et al.,Nature,
2015.
Deep Learning for Real-Time Atari Game
Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS
Workshop, 2013.
Continuous Control
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv,2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,arXiv,
2015.
Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al., arXiv,
2015.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al., RSS,
2015.
Trust Region Policy Optimization, J. Schulman et al., ICML,
2015.
Text Domain
Deep Reinforcement Learning with an Unbounded Action Space, J. He et al.,arXiv,2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP,
2015.
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al., arXiv,
2015.
Visual Domain
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., arXiv,2015.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv,
2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv,
2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv,
2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,arXiv,
2015.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS,
2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv,
2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv,
2015.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
Universal Value Function Approximators, T. Schaul et al., ICML,
2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML
Workshop, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML,
2015.
Human-level control through deep reinforcement learning, V. Mnih et al.,Nature,
2015.
Deep Learning for Real-Time Atari Game
Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS
Workshop, 2013.
Robotics
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv,2015.
Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al., arXiv,
2015.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al., RSS,
2015.
Trust Region Policy Optimization, J. Schulman et al., ICML,
2015.
Games
Deep Reinforcement Learning with an Unbounded Action Space, J. He et al.,arXiv,2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., arXiv,
2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv,
2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv,
2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,arXiv,
2015.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP,
2015.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS,
2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv,
2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv,
2015.
Universal Value Function Approximators, T. Schaul et al., ICML,
2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML
Workshop, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML,
2015.
Human-level control through deep reinforcement learning, V. Mnih et al.,Nature,
2015.
Deep Learning for Real-Time Atari Game
Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS
Workshop, 2013.
Monte-Carlo Tree Search
Deep Learning for Real-Time Atari GamePlay Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Inverse Reinforcement Learning
Maximum Entropy Deep Inverse Reinforcement Learning, M. Wulfmeier et al.,arXiv,2015.
Transfer Learning
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv,2015.
Improving Exploration
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS,2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv,
2015.
Josh.ai is an artificial intelligence agent for your home. If you’re interested in learning more, visit us at https://josh.ai.
相关文章推荐
- 用Python编写一个国际象棋AI程序
- AI和IA之随想
- 位图和矢量图区别,PS和AI的区别
- 用java写星际争霸的ai
- 用于对象识别的最好的多级结构是什么?(What is the Best Multi-Stage Architecture for Object Recognition)
- deep learning 在各对象数据集上的识别率比较
- BWAPI 系列 第一 配置jni-bwapi
- Unity3d之怪物AI接口规范
- 深度学习的一些教程
- AI设计---Robot(v1.0),简单对话的实现
- AI设计---robot_v2.0(感情机制的简单模拟)
- ubuntu 14.04上配置无GPU的Caffe(A卡机适用)
- 小白的柳州麻将黑科技.Part1 = 你不知道的门清
- WIFI小车记录四:用Adobe Illustrator绘制应用磁贴
- Deep learning: autoencoders and sparsity
- 学不进去、玩不痛快、睡不踏实、吃的还特多
- 讨厌这些不符合年龄而必须得去面对的经历
- 『HTML5实现人工智能』小游戏《井字棋》发布,据说IQ上200才能赢【算法&代码讲解+资源打包下载】
- deep learning 个人理解及其实现工具
- 快速去除图标斜线水印