achakraborty > reinforcement-learning | BibSonomy

bookmarks (hide)44
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

3AI and Deep Learning in 2017 – A Year in Review – WildML
http://www.wildml.com/2017/12/ai-and-deep-learning-in-2017-a-year-in-review/
6 years ago by @achakraborty
show all tags
2017
article
artificial-intelligence
blog
deep-learning
reinforcement-learning
review
2017articleartificial-intelligenceblogdeep-learningreinforcement-learningreview
(0)
copydelete
- community post
- history of this post
1CMSC389F: Reinforcement Learning
This is CMSC389F, the University of Maryland's theoretical introduction to the art of reinforcement learning. An introductory course taught by Kevin Chen and Zack Khan, CMSC389F covers topics including markov decision processes, monte carlo methods, policy gradient methods, exploration, and application towards real environments in broad strokes .
6 years ago by @achakraborty
show all tags
course
lectures
reinforcement-learning
slides
umd
courselecturesreinforcement-learningslidesumd
(0)
copydelete
- community post
- history of this post
1Generative Temporal Models with Spatial Memory for Partially Observed Environments - Nurture.AI
In Model-based Reinforcement Learning, Generative And Temporal Models Of Environments Can Be Leveraged To Boost Agent Performance, Either By Tuning The Agent's Representations During Training Or Via Use As Part Of An Explicit Planning Mechanism. However, Their Application In Practice Has Been Limited To Simplistic Environments, Due To The Difficulty Of Training Such Models In Larger, Potentially Partially-observed And 3d Environments. In This Work We Introduce A Novel Action-conditioned Generative Model Of Such Challenging Environments. The Model Features A Non-parametric Spatial Memory System In Which We Store Learned, Disentangled Representations Of The Environment. Low-dimensional Spatial Updates Are Computed Using A State-space Model That Makes Use Of Knowledge On The Prior Dynamics Of The Moving Agent, And High-dimensional Visual Observations Are Modelled With A Variational Auto-encoder. The Result Is A Scalable Architecture Capable Of Performing Coherent Predictions Over Hundreds Of Time Steps Across A Range Of Partially Observed 2d And 3d Environments.
6 years ago by @achakraborty
show all tags
nurture.ai
paper
reinforcement-learning
nurture.aipaperreinforcement-learning
(0)
copydelete
- community post
- history of this post
2CS294 Deep Reinforcement Learning (Berkeley) - Fall 2017 - YouTube
https://www.youtube.com/playlist?list=PLkFD6_40KJIznC9CDbVTjAF2oyt8_VAe3
7 years ago by @achakraborty
show all tags
berkeley
course
deep-learning
playlist
reinforcement-learning
videos
youtube
berkeleycoursedeep-learningplaylistreinforcement-learningvideosyoutube
(0)
copydelete
- community post
- history of this post
1Article: Using OpenAI with ROS | The Construct
Anything you need to know about OpenAI with ROS. In this post we describe how to apply the OpenAI Gym to the control of a drone that runs with ROS.
7 years ago by @achakraborty
show all tags
article
artificial-intelligence
blog
reinforcement-learning
robotics
ros
articleartificial-intelligenceblogreinforcement-learningroboticsros
(0)
copydelete
- community post
- history of this post
1DeepMind papers at ICLR 2018 | DeepMind
https://deepmind.com/blog/deepmind-papers-iclr-2018/
7 years ago by @achakraborty
show all tags
2018
collection
conference
deep-learning
deepmind
iclr
paper
reinforcement-learning
robotics
2018collectionconferencedeep-learningdeepmindiclrpaperreinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Hallucinogenic Deep Reinforcement Learning using Python and Keras
A step-by-step guide to reproducing the World Models paper https://arxiv.org/pdf/1803.10122.pdf
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
keras
python
reinforcement-learning
articleblogdeep-learningkeraspythonreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Simple Reinforcement Learning with Tensorflow Part 8: Asynchronous Actor-Critic Agents (A3C)
https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2
7 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tensorflow
tutorial
A2Carticleblogreinforcement-learningtensorflowtutorial
(0)
copydelete
- community post
- history of this post
2Intuitive RL: Intro to Advantage-Actor-Critic (A2C)
https://hackernoon.com/intuitive-rl-intro-to-advantage-actor-critic-a2c-4ff545978752
7 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tutorial
A2Carticleblogreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1The Mathematics of 2048: Optimal Play with Markov Decision Processes
http://jdlm.info/articles/2018/03/18/markov-decision-process-2048.html
7 years ago by @achakraborty
show all tags
article
blog
reinforcement-learning
articleblogreinforcement-learning
(0)
copydelete
- community post
- history of this post
1After Millions of Trials, These Simulated Humans Learned to Do Perfect Backflips and Cartwheels
https://gizmodo.com/after-millions-of-trials-these-simulated-humans-learne-1825156125
7 years ago by @achakraborty
show all tags
article
news
reinforcement-learning
simulation
articlenewsreinforcement-learningsimulation
(0)
copydelete
- community post
- history of this post
1Towards a Virtual Stuntman – The Berkeley Artificial Intelligence Research Blog
http://bair.berkeley.edu/blog/2018/04/10/virtual-stuntman/
7 years ago by @achakraborty
show all tags
animation
article
berkeley
blog
deep-learning
reinforcement-learning
research
robotics
animationarticleberkeleyblogdeep-learningreinforcement-learningresearchrobotics
(0)
copydelete
- community post
- history of this post
1Arxiv Insights - YouTube
Through my PhD on Deep Learning based robotics, I read a lot of papers on Machine Learning, Reinforcement Learning and AI in general. But papers can be a bit...
7 years ago by @achakraborty
show all tags
arxiv
deep-learning
lectures
playlist
reinforcement-learning
robotics
videos
youtube
arxivdeep-learninglecturesplaylistreinforcement-learningroboticsvideosyoutube
(0)
copydelete
- community post
- history of this post
1idsia | BibSonomy
https://www.bibsonomy.org/user/idsia
7 years ago by @achakraborty
show all tags
bibsonomy
neural-networks
profile
reinforcement-learning
robotics
bibsonomyneural-networksprofilereinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Shangtong | CV
https://shangtongzhang.github.io/
7 years ago by @achakraborty
show all tags
deep-learning
profile
reinforcement-learning
research
resume
deep-learningprofilereinforcement-learningresearchresume
(0)
copydelete
- community post
- history of this post
1Awesome-rl
Reinforcement learning resources curated
7 years ago by @achakraborty
show all tags
collection
reinforcement-learning
resources
collectionreinforcement-learningresources
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning Introduction
Introduction to Reinforcement Learning, including a definition, analysis of the motivations and limitations of AI, and an overview of the technology along with its applications.
7 years ago by @achakraborty
show all tags
reinforcement-learning
resources
reinforcement-learningresources
(0)
copydelete
- community post
- history of this post
1Two Minute Papers - YouTube
Awesome research for everyone. Two new science videos every week. You'll love it! Our links: Web → https://cg.tuwien.ac.at/~zsolnai/
7 years ago by @achakraborty
show all tags
collection
deep-learning
graphics
machine-learning
paper
reinforcement-learning
research
tutorial
videos
youtube
collectiondeep-learninggraphicsmachine-learningpaperreinforcement-learningresearchtutorialvideosyoutube
(0)
copydelete
- community post
- history of this post
2Asynchronous methods for deep reinforcement learning | the morning paper
Asynchronous methods for deep reinforcement learning Mnih et al. ICML 2016 You know something interesting is going on when you see a scalability plot that looks like this: That’s a superlinear speedup as we increase the number of threads, giving a 24x performance improvement with 16 threads as compared to a single thread. The result…
7 years ago by @achakraborty
show all tags
deep-learning
reinforcement-learning
deep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
4How to build your own AlphaZero AI using Python and Keras
The codebase contains a replica of the AlphaZero methodology, built in Python and Keras. Gain a deeper understanding of how AlphaZero works and adapt the code to plug in new games.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
reinforcement-learning
tutorial
articleblogdeep-learningreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)23
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3A Survey on Policy Search for Robotics
M. Deisenroth$^*$, G. Neumann$^*$, and J. Peters. Foundations and Trends in Robotics, (2013)
6 years ago by @achakraborty
show all tags
2013
book
reinforcement-learning
robotics
search
survey
2013bookreinforcement-learningroboticssearchsurvey
(0)
copydeleteadd this publication to your clipboard
1Solving the Rubik's Cube Without Human Knowledge
S. McAleer, F. Agostinelli, A. Shmakov, and P. Baldi. (2018)cite arxiv:1805.07470Comment: First three authors contributed equally. Submitted to NIPS 2018.
6 years ago by @achakraborty
show all tags
2018
arxiv
games
paper
reinforcement-learning
2018arxivgamespaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
8Mastering the game of Go with deep neural networks and tree search
D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot and 10 other author(s). Nature, (January 2016)
7 years ago by @achakraborty
show all tags
2016
article
deep-learning
game
go
google
nature
reinforcement-learning
2016articledeep-learninggamegogooglenaturereinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Teaching Deep Convolutional Neural Networks to Play Go
C. Clark, and A. Storkey. (2014)cite arxiv:1412.3409Comment: 9 pages, 8 figures, 5 tables. Corrected typos, minor adjustment to table format.
7 years ago by @achakraborty
show all tags
2014
arxiv
cnn
deep-learning
game
go
reinforcement-learning
2014arxivcnndeep-learninggamegoreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Reinforcement learning in robotics: A survey
J. Kober, J. Bagnell, and J. Peters. The International Journal of Robotics Research, 32 (11): 1238--1274 (August 2013)
7 years ago by @achakraborty
show all tags
2013
journal
reinforcement-learning
robotics
survey
2013journalreinforcement-learningroboticssurvey
(0)
copydeleteadd this publication to your clipboard
3World Models
D. Ha, and J. Schmidhuber. (2018)cite arxiv:1803.10122.
7 years ago by @achakraborty
show all tags
2018
arxiv
deep-learning
reinforcement-learning
2018arxivdeep-learningreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4Deep Reinforcement Learning: An Overview
Y. Li. (2017)cite arxiv:1701.07274.
7 years ago by @achakraborty
show all tags
2017
arxiv
deep-learning
paper
reinforcement-learning
review
2017arxivdeep-learningpaperreinforcement-learningreview
(0)
copydeleteadd this publication to your clipboard
3Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Y. Zhu, Z. Wang, J. Merel, A. Rusu, T. Erez, S. Cabi, S. Tunyasuvunakool, J. Kramár, R. Hadsell, N. de Freitas and 1 other author(s). (2018)cite arxiv:1802.09564Comment: 13 pages, 6 figures.
7 years ago by @achakraborty
show all tags
2018
arxiv
deepmind
imitation-learning
reinforcement-learning
robotics
stanford
2018arxivdeepmindimitation-learningreinforcement-learningroboticsstanford
(0)
copydeleteadd this publication to your clipboard
1Truncated Horizon Policy Search: Combining Reinforcement Learning and Imitation Learning
W. Sun, J. Bagnell, and B. Boots. International Conference on Learning Representations, (2018)
7 years ago by @achakraborty
show all tags
2018
iclr
imitation-learning
paper
reinforcement-learning
2018iclrimitation-learningpaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
3A survey of robot learning from demonstration
B. Argall, S. Chernova, M. Veloso, and B. Browning. Robotics and Autonomous Systems, 57 (5): 469 - 483 (2009)
7 years ago by @achakraborty
show all tags
2009
reinforcement-learning
robotics
survey
2009reinforcement-learningroboticssurvey
(0)
copydeleteadd this publication to your clipboard
3Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning
A. Zeng, S. Song, S. Welker, J. Lee, A. Rodriguez, and T. Funkhouser. (2018)cite arxiv:1803.09956Comment: Under review at the International Conference On Intelligent Robots and Systems (IROS) 2018. Project webpage: http://vpg.cs.princeton.edu.
7 years ago by @achakraborty
show all tags
2018
arxiv
paper
reinforcement-learning
research
robot-arm
robotics
2018arxivpaperreinforcement-learningresearchrobot-armrobotics
(0)
copydeleteadd this publication to your clipboard
2Constructing Temporal Abstractions Autonomously in Reinforcement Learning
P. Bacon, and D. Precup. AI Magazine, 39 (1): 39--50 (March 2018)
7 years ago by @achakraborty
show all tags
2018
paper
reinforcement-learning
temporal
2018paperreinforcement-learningtemporal
(0)
copydeleteadd this publication to your clipboard
2OpenAI Gym
G. Brockman, V. Cheung, L. Pettersson, J. Schneider, J. Schulman, J. Tang, and W. Zaremba. (2016)cite arxiv:1606.01540.
7 years ago by @achakraborty
show all tags
2016
arxiv
paper
reinforcement-learning
2016arxivpaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
X. Peng, P. Abbeel, S. Levine, and M. van de Panne. (2018)cite arxiv:1804.02717.
7 years ago by @achakraborty
show all tags
2018
arxiv
berkeley
deep-learning
graphics
reinforcement-learning
robotics
siggraph
2018arxivberkeleydeep-learninggraphicsreinforcement-learningroboticssiggraph
(0)
copydeleteadd this publication to your clipboard
4Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods
D. Quillen, E. Jang, O. Nachum, C. Finn, J. Ibarz, and S. Levine. (2018)cite arxiv:1802.10264Comment: 8 pages.
7 years ago by @achakraborty
show all tags
2018
arxiv
deep-learning
grasp
reinforcement-learning
robot-arm
robotics
2018arxivdeep-learninggraspreinforcement-learningrobot-armrobotics
(0)
copydeleteadd this publication to your clipboard
4Deep Reinforcement Learning that Matters
P. Henderson, R. Islam, P. Bachman, J. Pineau, D. Precup, and D. Meger. (2017)cite arxiv:1709.06560Comment: Accepted to the Thirthy-Second AAAI Conference On Artificial Intelligence (AAAI), 2018.
7 years ago by @achakraborty
show all tags
2017
arxiv
deep-learning
reinforcement-learning
2017arxivdeep-learningreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Neural Task Programming: Learning to Generalize Across Hierarchical Tasks
D. Xu, S. Nair, Y. Zhu, J. Gao, A. Garg, L. Fei-Fei, and S. Savarese. (2017)cite arxiv:1710.01813.
7 years ago by @achakraborty
show all tags
2018
arxiv
machine-learning
paper
reinforcement-learning
robotics
2018arxivmachine-learningpaperreinforcement-learningrobotics
(0)
copydeleteadd this publication to your clipboard
4Neural Optimizer Search with Reinforcement Learning
I. Bello, B. Zoph, V. Vasudevan, and Q. Le. (2017)cite arxiv:1709.07417Comment: ICML 2017 Conference paper.
7 years ago by @achakraborty
show all tags
2017
arxiv
deep-learning
optimization
reinforcement-learning
2017arxivdeep-learningoptimizationreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4Mastering the game of Go without human knowledge
D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton and 7 other author(s). Nature, (October 2017)
7 years ago by @achakraborty
show all tags
2017
deep-learning
deepmind
google
paper
reinforcement-learning
2017deep-learningdeepmindgooglepaperreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems
S. Albrecht, and P. Stone. (2017)cite arxiv:1709.08071Comment: 42 pages, submitted for review to Artificial Intelligence Journal. Keywords: multiagent systems, agent modelling, opponent modelling, survey, open problems.
7 years ago by @achakraborty
show all tags
2017
artificial-intelligence
arxiv
collection
paper
problem
reinforcement-learning
research
survey
2017artificial-intelligencearxivcollectionpaperproblemreinforcement-learningresearchsurvey
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
⟩
⟩⟩