tag :: reinforcement-learning | BibSonomy

bookmarks (hide)104
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1The Magic Behind ChatGPT: Reinforcement Learning with Human Feedback
One of the key enablers of the ChatGPT magic can be traced back to 2017 under the obscure name of reinforcement learning with human feedback(RLHF). Large language models(LLMs) have become one of the most interesting environments for applying modern reinforcement learning(RL) techniques. While LLMs are great at deriving knowledge from vast amounts of text, RL can help to translate that knowledge into actions. That has been the secret behind RLHF.
6 months ago by @ghagerer
show all tags
chatgpt
llms
openai
reinforcement-learning
chatgptllmsopenaireinforcement-learning
(0)
copydelete
- community post
- history of this post
1Distributional Reinforcement Learning
https://www.distributional-rl.org/
2 years ago by @analyst
show all tags
book
distributed-systems
reinforcement-learning
bookdistributed-systemsreinforcement-learning
(0)
copydelete
- community post
- history of this post
1robo-gym
https://sites.google.com/view/robo-gym
2 years ago by @analyst
show all tags
reinforcement-learning
robotics
simulation
reinforcement-learningroboticssimulation
(0)
copydelete
- community post
- history of this post
1labml.ai — Annotated PyTorch Paper Implementations
This is a collection of simple PyTorch implementations of neural networks and related algorithms
3 years ago by @analyst
show all tags
deep-learning
pytorch
reinforcement-learning
software
deep-learningpytorchreinforcement-learningsoftware
(0)
copydelete
- community post
- history of this post
1Building a DQN in PyTorch: Balancing Cart Pole with Deep RL | by Mohit Pilkhan | Building Fynd
Hi Geeks, welcome to Part-3 of our Reinforcement Learning Series. In the last two blogs, we covered some basic concepts in RL and also studied the multi-armed bandit problem and its solution methods…
3 years ago by @analyst
show all tags
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Q-Network with Pytorch. DQN | by Unnat Singh | Medium
When the agent interacts with the environment, the sequence of experienced tuples can be highly correlated. The naive Q-Learning algorithm that learns from each of these experience tuples in…
3 years ago by @analyst
show all tags
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Q-Network, with PyTorch. Explaining the fundamentals of… | by Chao De-Yu | Towards Data Science
In Q-Learning, we represent the Q-value as a table. However, in many real-world problems, there are enormous state and/or action spaces and tabular representation is insufficient. For instance…
3 years ago by @analyst
show all tags
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Q Networks (DQN) | labml.ai
This is a PyTorch implementation/tutorial of Deep Q Networks (DQN) from paper Playing Atari with Deep Reinforcement Learning. This includes dueling network architecture, a prioritized replay buffer and double-Q-network training.
3 years ago by @analyst
show all tags
article
blog
deep-learning
pytorch
reinforcement-learning
tutorial
articleblogdeep-learningpytorchreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1Off-policy vs On-Policy vs Offline Reinforcement Learning Demystified! | by Kowshik chilamkurthy | Medium
In this article, we will try to understand where On-Policy learning, Off-policy learning and offline learning algorithms fundamentally differ. Though there is a fair amount of intimidating jargon in…
3 years ago by @analyst
show all tags
article
blog
reinforcement-learning
articleblogreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Yoshua Bengio Team Designs Consciousness-Inspired Planning Agent for Model-Based RL | by Synced | SyncedReview | Jun, 2021 | Medium
A research team from McGill University, Université de Montréal, DeepMind and Mila presents an end-to-end, model-based deep reinforcement learning (RL) agent that dynamically attends to relevant parts of its environments to facilitate out-of-distribution (OOD) and systematic generalization.
3 years ago by @analyst
show all tags
2021
machine-learning
reinforcement-learning
2021machine-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Evolution, rewards, and artificial intelligence – TechTalks
A paper by DeepMind scientist triggered much debate about the path to artificial intelligence. Here, we'll try to draw the line between theory and practice.
3 years ago by @analyst
show all tags
2021
article
blog
google
lecture
reinforcement-learning
2021articlebloggooglelecturereinforcement-learning
(0)
copydelete
- community post
- history of this post
1The Best 26 Python Reinforcement Learning Libraries | PythonRepo
https://pythonrepo.com/catalog/python-reinforcement-learning_newest_1
3 years ago by @analyst
show all tags
python
recommendation
reinforcement-learning
pythonrecommendationreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Learning & Reinforcement Learning
Chris G. Willcocks, Durham University
3 years ago by @analyst
show all tags
collection
course
deep-learning
lecture
reinforcement-learning
collectioncoursedeep-learninglecturereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Welcome to Spinning Up in Deep RL! | OpenAI
https://spinningup.openai.com/en/latest/
3 years ago by @analyst
show all tags
deep-learning
documentation
opensource
reinforcement-learning
tutorial
deep-learningdocumentationopensourcereinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning | Simons Institute
- Sep. 28 – Oct. 2, 2020 - Lihong Li (Google Brain; chair), Marc G. Bellemare (Google Brain) - The success of deep neural networks in modeling complicated functions has recently been applied by the reinforcement learning community, resulting in algorithms that are able to learn in environments previously thought to be much too large. Successful applications span domains from robotics to health care. However, the success is not well understood from a theoretical perspective. What are the modeling choices necessary for good performance, and how does the flexibility of deep neural nets help learning? This workshop will connect practitioners to theoreticians with the goal of understanding the most impactful modeling decisions and the properties of deep neural networks that make them so successful. Specifically, we will study the ability of deep neural nets to approximate in the context of reinforcement learning.
4 years ago by @analyst
show all tags
2020
deep-learning
reinforcement-learning
tutorial
workshop
2020deep-learningreinforcement-learningtutorialworkshop
(0)
copydelete
- community post
- history of this post
1Theory of Reinforcement Learning Boot Camp | Simons Institute
- Aug. 31 – Sep. 4, 2020 - Csaba Szepesvari (University of Alberta, Google DeepMind; chair), Emma Brunskill (Stanford University), Sébastien Bubeck (MSR), Alan Malek (DeepMind), Sean Meyn (University of Florida), Ambuj Tewari (University of Michigan), Mengdi Wang (Princeton)
4 years ago by @analyst
show all tags
2020
reinforcement-learning
tutorial
workshop
2020reinforcement-learningtutorialworkshop
(0)
copydelete
- community post
- history of this post
1Theory of Reinforcement Learning | Simons Institute for the Theory of Computing
This program aims to reunite researchers across disciplines that have played a role in developing the theory of reinforcement learning. It will review past developments and identify promising directions of research, with an emphasis on addressing existing open problems, ranging from the design of efficient, scalable algorithms for exploration to how to control learning and planning. It also aims to deepen the understanding of model-free vs. model-based learning and control, and the design of efficient methods to exploit structure and adapt to easier environments.
4 years ago by @analyst
show all tags
2020
collection
course
reinforcement-learning
tutorial
workshop
2020collectioncoursereinforcement-learningtutorialworkshop
(0)
copydelete
- community post
- history of this post
1MS&E338 Reinforcement Learning
https://web.stanford.edu/class/msande338/
4 years ago by @analyst
show all tags
2020
course
reinforcement-learning
stanford
2020coursereinforcement-learningstanford
(0)
copydelete
- community post
- history of this post
1RL theory seminars
RL theory seminars
4 years ago by @kirk86
show all tags
reinforcement-learning
theory
tutorials
website
reinforcement-learningtheorytutorialswebsite
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning: Theory and Algorithms
University of Washington. Research interests: Machine Learning, Artificial Intelligence, Optimization, Statistics
5 years ago by @kirk86
show all tags
book
reinforcement-learning
website
bookreinforcement-learningwebsite
(0)
copydelete
- community post
- history of this post
1Deep RL Bootcamp
https://sites.google.com/view/deep-rl-bootcamp/lectures
5 years ago by @analyst
show all tags
2017
berkeley
collection
deep-learning
lecture
reinforcement-learning
video
workshop
2017berkeleycollectiondeep-learninglecturereinforcement-learningvideoworkshop
(0)
copydelete
- community post
- history of this post
1Microsoft Research Talks Archives - Microsoft Research
https://www.microsoft.com/en-us/research/videos/microsoft-research-talks/
5 years ago by @analyst
show all tags
artificial-intelligence
collection
computer-vision
deep-learning
lecture
machine-learning
microsoft
programming
reinforcement-learning
research
video
artificial-intelligencecollectioncomputer-visiondeep-learninglecturemachine-learningmicrosoftprogrammingreinforcement-learningresearchvideo
(0)
copydelete
- community post
- history of this post
1Stanford Artificial Intelligence Resource Hub
Learn AI from Stanford professors Christopher Manning, Andrew Ng, and Emma Brunskill. Free online course videos in Deep Learning, Reinforcement Learning, and Natural Language Processing.
5 years ago by @analyst
show all tags
collection
course
deep-learning
reinforcement-learning
stanford
collectioncoursedeep-learningreinforcement-learningstanford
(0)
copydelete
- community post
- history of this post
1Resources Reinforcement Leraning
Reinforcement Learning Seminars & Resources and Hong Kong University
5 years ago by @kirk86
show all tags
reinforcement-learning
website
reinforcement-learningwebsite
(0)
copydelete
- community post
- history of this post
12019_06_15_ICML Exploration in RL workshop.pdf
https://www.dropbox.com/s/4t1a3dpldgqtqk6/2019_06_15_ICML%20Exploration%20in%20RL%20workshop.pdf?dl=0
5 years ago by @kirk86
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydelete
- community post
- history of this post
1Spiral: A Survey on Policy Search for Robotics
https://spiral.imperial.ac.uk:8443/handle/10044/1/12051
5 years ago by @kirk86
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydelete
- community post
- history of this post
11.3 Agents Situated in Environments‣ Chapter 1 Artificial Intelligence and Agents ‣ Artificial Intelligence: Foundations of Computational Agents, 2nd Edition
https://artint.info/2e/html/ArtInt2e.Ch1.S3.html#Ch1.F3
5 years ago by @kirk86
show all tags
agents
reinforcement-learning
agentsreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Optimal Control Theory by Bertsekas 2019
Optimal Control & Reinforcement Learning Book and VideoLectures
5 years ago by @kirk86
show all tags
book
dynamic
programming
reinforcement-learning
tutorials
website
bookdynamicprogrammingreinforcement-learningtutorialswebsite
(0)
copydelete
- community post
- history of this post
1Welcome to Spinning Up in Deep RL! — OpenAI
http://spinningup.openai.com/en/latest/
6 years ago by @analyst
show all tags
deep-learning
documentation
reinforcement-learning
tutorial
deep-learningdocumentationreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
2CS 294-112: Deep Reinforcement Learning
http://rail.eecs.berkeley.edu/deeprlcourse/
6 years ago by @analyst
show all tags
berkeley
course
deep-learning
reinforcement-learning
berkeleycoursedeep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Robotics, Navigation, and Reinforcement Learning with Raia Hadsell | Google Cloud Platform Podcast
https://www.gcppodcast.com/post/episode-136-robotics-navigation-and-reinforcement-learning-with-raia-hadsell/
6 years ago by @analyst
show all tags
audio
deep-learning
google
podcast
reinforcement-learning
robotics
audiodeep-learninggooglepodcastreinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1CS 294 Deep Reinforcement Learning, Fall 2017
http://rail.eecs.berkeley.edu/deeprlcourse-fa17/index.html
6 years ago by @analyst
show all tags
berkeley
course
deep-learning
reinforcement-learning
berkeleycoursedeep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
2CS294-112: Deep Reinforcement Learning, Fall 2017 | UC Berkeley
https://www.youtube.com/playlist?list=PLkFD6_40KJIznC9CDbVTjAF2oyt8_VAe3
6 years ago by @analyst
show all tags
berkeley
course
deep-learning
lecture
playlist
reinforcement-learning
video
youtube
berkeleycoursedeep-learninglectureplaylistreinforcement-learningvideoyoutube
(0)
copydelete
- community post
- history of this post
1AI Alignment
Aligning AI systems with human interests.
6 years ago by @analyst
show all tags
article
artificial-intelligence
blog
collection
reinforcement-learning
articleartificial-intelligenceblogcollectionreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning: A Graduate Course (6hp) - Uppsala University, 2017
http://www.it.uu.se/research/systems_and_control/education/2017/relearn/
6 years ago by @analyst
show all tags
2017
course
reinforcement-learning
2017coursereinforcement-learning
(0)
copydelete
- community post
- history of this post
115-780: Reinforcement Learning - rl.pdf
15-780: Graduate Artificial Intelligence, CMU, Spring 2016
6 years ago by @analyst
show all tags
cmu
pdf
reinforcement-learning
slide
cmupdfreinforcement-learningslide
(0)
copydelete
- community post
- history of this post
115-780: Markov Decision Processes - mdps.pdf
15-780: Graduate Artificial Intelligence, CMU, Spring 2016
6 years ago by @analyst
show all tags
cmu
pdf
reinforcement-learning
slide
cmupdfreinforcement-learningslide
(0)
copydelete
- community post
- history of this post
1AI Magazine Archives
The purpose of AI Magazine is to disseminate timely and informative articles that represent the current state of the art in AI and to keep its readers posted on AAAI-related matters. The articles are selected for appeal to readers engaged in research and
6 years ago by @analyst
show all tags
archive
artificial-intelligence
collection
deep-learning
machine-learning
magazine
online
reinforcement-learning
archiveartificial-intelligencecollectiondeep-learningmachine-learningmagazineonlinereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Sutton and Barto Notebooks | End-to-End AI
https://www.endtoend.ai/sutton-barto-notebooks
6 years ago by @analyst
show all tags
article
blog
book
jupyter
python
reinforcement-learning
articleblogbookjupyterpythonreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Reinforcement learning with musculoskeletal models
http://osim-rl.stanford.edu/
6 years ago by @analyst
show all tags
2018
competition
geometry
medical
nips
reinforcement-learning
2018competitiongeometrymedicalnipsreinforcement-learning
(0)
copydelete
- community post
- history of this post
1An Outsider's Tour of Reinforcement Learning – arg min blog
Suggested here: news.ycombinator.com/item?id=17587796
6 years ago by @analyst
show all tags
article
blog
collection
reinforcement-learning
tutorial
articleblogcollectionreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1Optimization Perspectives on Learning to Control | ICML 2018
https://people.eecs.berkeley.edu/~brecht/l2c-icml2018/
6 years ago by @analyst
show all tags
2018
control
icml
optimization
reinforcement-learning
tutorial
2018controlicmloptimizationreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1Overviews - The Gradient
https://thegradient.pub/tag/overviews/
6 years ago by @analyst
show all tags
article
blog
collection
deep-learning
machine-learning
reinforcement-learning
articleblogcollectiondeep-learningmachine-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
3AI and Deep Learning in 2017 – A Year in Review – WildML
http://www.wildml.com/2017/12/ai-and-deep-learning-in-2017-a-year-in-review/
6 years ago by @achakraborty
show all tags
2017
article
artificial-intelligence
blog
deep-learning
reinforcement-learning
review
2017articleartificial-intelligenceblogdeep-learningreinforcement-learningreview
(0)
copydelete
- community post
- history of this post
1CMSC389F: Reinforcement Learning
This is CMSC389F, the University of Maryland's theoretical introduction to the art of reinforcement learning. An introductory course taught by Kevin Chen and Zack Khan, CMSC389F covers topics including markov decision processes, monte carlo methods, policy gradient methods, exploration, and application towards real environments in broad strokes .
6 years ago by @achakraborty
show all tags
course
lectures
reinforcement-learning
slides
umd
courselecturesreinforcement-learningslidesumd
(0)
copydelete
- community post
- history of this post
1Generative Temporal Models with Spatial Memory for Partially Observed Environments - Nurture.AI
In Model-based Reinforcement Learning, Generative And Temporal Models Of Environments Can Be Leveraged To Boost Agent Performance, Either By Tuning The Agent's Representations During Training Or Via Use As Part Of An Explicit Planning Mechanism. However, Their Application In Practice Has Been Limited To Simplistic Environments, Due To The Difficulty Of Training Such Models In Larger, Potentially Partially-observed And 3d Environments. In This Work We Introduce A Novel Action-conditioned Generative Model Of Such Challenging Environments. The Model Features A Non-parametric Spatial Memory System In Which We Store Learned, Disentangled Representations Of The Environment. Low-dimensional Spatial Updates Are Computed Using A State-space Model That Makes Use Of Knowledge On The Prior Dynamics Of The Moving Agent, And High-dimensional Visual Observations Are Modelled With A Variational Auto-encoder. The Result Is A Scalable Architecture Capable Of Performing Coherent Predictions Over Hundreds Of Time Steps Across A Range Of Partially Observed 2d And 3d Environments.
6 years ago by @achakraborty
show all tags
nurture.ai
paper
reinforcement-learning
nurture.aipaperreinforcement-learning
(0)
copydelete
- community post
- history of this post
2CS294 Deep Reinforcement Learning (Berkeley) - Fall 2017 - YouTube
https://www.youtube.com/playlist?list=PLkFD6_40KJIznC9CDbVTjAF2oyt8_VAe3
6 years ago by @achakraborty
show all tags
berkeley
course
deep-learning
playlist
reinforcement-learning
videos
youtube
berkeleycoursedeep-learningplaylistreinforcement-learningvideosyoutube
(0)
copydelete
- community post
- history of this post
1Article: Using OpenAI with ROS | The Construct
Anything you need to know about OpenAI with ROS. In this post we describe how to apply the OpenAI Gym to the control of a drone that runs with ROS.
6 years ago by @achakraborty
show all tags
article
artificial-intelligence
blog
reinforcement-learning
robotics
ros
articleartificial-intelligenceblogreinforcement-learningroboticsros
(0)
copydelete
- community post
- history of this post
1DeepMind papers at ICLR 2018 | DeepMind
https://deepmind.com/blog/deepmind-papers-iclr-2018/
6 years ago by @achakraborty
show all tags
2018
collection
conference
deep-learning
deepmind
iclr
paper
reinforcement-learning
robotics
2018collectionconferencedeep-learningdeepmindiclrpaperreinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Hallucinogenic Deep Reinforcement Learning using Python and Keras
A step-by-step guide to reproducing the World Models paper https://arxiv.org/pdf/1803.10122.pdf
6 years ago by @achakraborty
show all tags
article
blog
deep-learning
keras
python
reinforcement-learning
articleblogdeep-learningkeraspythonreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Simple Reinforcement Learning with Tensorflow Part 8: Asynchronous Actor-Critic Agents (A3C)
https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2
6 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tensorflow
tutorial
A2Carticleblogreinforcement-learningtensorflowtutorial
(0)
copydelete
- community post
- history of this post
2Intuitive RL: Intro to Advantage-Actor-Critic (A2C)
https://hackernoon.com/intuitive-rl-intro-to-advantage-actor-critic-a2c-4ff545978752
6 years ago by @achakraborty
show all tags
A2C
article
blog
reinforcement-learning
tutorial
A2Carticleblogreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1The Mathematics of 2048: Optimal Play with Markov Decision Processes
http://jdlm.info/articles/2018/03/18/markov-decision-process-2048.html
6 years ago by @achakraborty
show all tags
article
blog
reinforcement-learning
articleblogreinforcement-learning
(0)
copydelete
- community post
- history of this post
1After Millions of Trials, These Simulated Humans Learned to Do Perfect Backflips and Cartwheels
https://gizmodo.com/after-millions-of-trials-these-simulated-humans-learne-1825156125
6 years ago by @achakraborty
show all tags
article
news
reinforcement-learning
simulation
articlenewsreinforcement-learningsimulation
(0)
copydelete
- community post
- history of this post
1Towards a Virtual Stuntman – The Berkeley Artificial Intelligence Research Blog
http://bair.berkeley.edu/blog/2018/04/10/virtual-stuntman/
6 years ago by @achakraborty
show all tags
animation
article
berkeley
blog
deep-learning
reinforcement-learning
research
robotics
animationarticleberkeleyblogdeep-learningreinforcement-learningresearchrobotics
(0)
copydelete
- community post
- history of this post
1Arxiv Insights - YouTube
Through my PhD on Deep Learning based robotics, I read a lot of papers on Machine Learning, Reinforcement Learning and AI in general. But papers can be a bit...
6 years ago by @achakraborty
show all tags
arxiv
deep-learning
lectures
playlist
reinforcement-learning
robotics
videos
youtube
arxivdeep-learninglecturesplaylistreinforcement-learningroboticsvideosyoutube
(0)
copydelete
- community post
- history of this post
1idsia | BibSonomy
https://www.bibsonomy.org/user/idsia
6 years ago by @achakraborty
show all tags
bibsonomy
neural-networks
profile
reinforcement-learning
robotics
bibsonomyneural-networksprofilereinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Shangtong | CV
https://shangtongzhang.github.io/
6 years ago by @achakraborty
show all tags
deep-learning
profile
reinforcement-learning
research
resume
deep-learningprofilereinforcement-learningresearchresume
(0)
copydelete
- community post
- history of this post
1Awesome-rl
Reinforcement learning resources curated
6 years ago by @achakraborty
show all tags
collection
reinforcement-learning
resources
collectionreinforcement-learningresources
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning Introduction
Introduction to Reinforcement Learning, including a definition, analysis of the motivations and limitations of AI, and an overview of the technology along with its applications.
6 years ago by @achakraborty
show all tags
reinforcement-learning
resources
reinforcement-learningresources
(0)
copydelete
- community post
- history of this post
1Two Minute Papers - YouTube
Awesome research for everyone. Two new science videos every week. You'll love it! Our links: Web → https://cg.tuwien.ac.at/~zsolnai/
6 years ago by @achakraborty
show all tags
collection
deep-learning
graphics
machine-learning
paper
reinforcement-learning
research
tutorial
videos
youtube
collectiondeep-learninggraphicsmachine-learningpaperreinforcement-learningresearchtutorialvideosyoutube
(0)
copydelete
- community post
- history of this post
2Asynchronous methods for deep reinforcement learning | the morning paper
Asynchronous methods for deep reinforcement learning Mnih et al. ICML 2016 You know something interesting is going on when you see a scalability plot that looks like this: That’s a superlinear speedup as we increase the number of threads, giving a 24x performance improvement with 16 threads as compared to a single thread. The result…
6 years ago by @achakraborty
show all tags
deep-learning
reinforcement-learning
deep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
4How to build your own AlphaZero AI using Python and Keras
The codebase contains a replica of the AlphaZero methodology, built in Python and Keras. Gain a deeper understanding of how AlphaZero works and adapt the code to plug in new games.
6 years ago by @achakraborty
show all tags
article
blog
deep-learning
reinforcement-learning
tutorial
articleblogdeep-learningreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1The Fido Project
An open-source machine learning library targeted towards embedded electronics and robotics.
6 years ago by @achakraborty
show all tags
c++
deep-learning
embedded
library
machine-learning
reinforcement-learning
robotics
c++deep-learningembeddedlibrarymachine-learningreinforcement-learningrobotics
(0)
copydelete
- community post
- history of this post
1Learning From Scratch by Thinking Fast and Slow with Deep Learning and Tree Search · David Barber
https://davidbarber.github.io/blog/2017/11/07/Learning-From-Scratch-by-Thinking-Fast-and-Slow-with-Deep-Learning-and-Tree-Search/
6 years ago by @achakraborty
show all tags
article
blog
deep-learning
reinforcement-learning
articleblogdeep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1CS 294 Deep Reinforcement Learning, Fall 2017
http://rll.berkeley.edu/deeprlcourse/
6 years ago by @achakraborty
show all tags
berkeley
course
deep-learning
reinforcement-learning
berkeleycoursedeep-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
3AlphaGo Zero: Learning from scratch | DeepMind
We introduce AlphaGo Zero, the latest evolution of AlphaGo, the first computer program to defeat a world champion at the ancient Chinese game of Go. Zero is even more powerful and is arguably the strongest Go player in history. Previous versions of AlphaGo initially trained on thousands of human amateur and professional games to learn how to play Go. AlphaGo Zero skips this step and learns to play simply by playing games against itself, starting from completely random play. In doing so, it quickly surpassed human level of play and defeated the previously published champion-defeating version of AlphaGo by 100 games to 0.
7 years ago by @achakraborty
show all tags
article
deepmind
google
reinforcement-learning
articledeepmindgooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning Nptel - YouTube
https://www.youtube.com/playlist?list=PLuWx2S0SyaDctJtVKHhmjYACmHZ3nX9ew
7 years ago by @achakraborty
show all tags
course
iit
lectures
nptel
playlist
reinforcement-learning
youtube
courseiitlecturesnptelplaylistreinforcement-learningyoutube
(0)
copydelete
- community post
- history of this post
2Neural Information Processing Systems - Videos
https://www.facebook.com/pg/nipsfoundation/videos/
7 years ago by @achakraborty
show all tags
deep-learning
facebook
lectures
machine-learning
reinforcement-learning
videos
deep-learningfacebooklecturesmachine-learningreinforcement-learningvideos
(0)
copydelete
- community post
- history of this post
1MIT 6.S094: Deep Learning for Self-Driving Cars - YouTube - YouTube
These are lectures for course 6.S094: Deep Learning for Self-Driving Cars taught in Winter 2017. Course website: http://cars.mit.edu Contact: deepcars@mit.ed...
7 years ago by @achakraborty
show all tags
deep-learning
lectures
mit
motion-planning
playlist
reinforcement-learning
videos
youtube
deep-learninglecturesmitmotion-planningplaylistreinforcement-learningvideosyoutube
(0)
copydelete
- community post
- history of this post
1Flood Sung
http://www.floodsung.com/
7 years ago by @achakraborty
show all tags
blog
deep-learning
machine-learning
profile
reinforcement-learning
research
blogdeep-learningmachine-learningprofilereinforcement-learningresearch
(0)
copydelete
- community post
- history of this post
1RL Course by David Silver - YouTube
https://www.youtube.com/playlist?list=PLzuuYNsE1EZAXYR4FJ75jcJseBmo4KQ9-
7 years ago by @achakraborty
show all tags
course
lectures
playlist
reinforcement-learning
videos
youtube
courselecturesplaylistreinforcement-learningvideosyoutube
(0)
copydelete
- community post
- history of this post
1lukasw | BibSonomy
https://www.bibsonomy.org/user/lukasw
7 years ago by @achakraborty
show all tags
bibsonomy
deep-learning
profile
reinforcement-learning
research
bibsonomydeep-learningprofilereinforcement-learningresearch
(0)
copydelete
- community post
- history of this post
1Why AlphaGo Zero is a Quantum Leap Forward in Deep Learning
https://medium.com/intuitionmachine/the-strange-loop-in-alphago-zeros-self-play-6e3274fcdd9f
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
google
reinforcement-learning
articleblogdeep-learninggooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Reinforcement Learning: The quirks – Towards Data Science
I have been working on Reinforcement Learning for the past few months and all I can say about it: It is different. A writeup of the common quirks and frustrations of Reinforcement Learning I have…
7 years ago by @achakraborty
show all tags
article
blog
nvidia
reinforcement-learning
udacity
articleblognvidiareinforcement-learningudacity
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning | DeepMind
Humans excel at solving a wide variety of challenging problems, from low-level motor control through to high-level cognitive tasks. Our goal at DeepMind is to create artificial agents that can achieve a similar level of performance and generality. Like a human, our agents learn for themselves to achieve successful strategies that lead to the greatest long-term rewards.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
deepmind
google
reinforcement-learning
articleblogdeep-learningdeepmindgooglereinforcement-learning
(0)
copydelete
- community post
- history of this post
1[1611.01578] Neural Architecture Search with Reinforcement Learning
https://arxiv.org/abs/1611.01578
7 years ago by @achakraborty
show all tags
2016
neural-networks
reinforcement-learning
search
2016neural-networksreinforcement-learningsearch
(0)
copydelete
- community post
- history of this post
2Roboschool
https://blog.openai.com/roboschool/
7 years ago by @achakraborty
show all tags
artificial-intelligence
examples
openai
reinforcement-learning
robotics
simulation
artificial-intelligenceexamplesopenaireinforcement-learningroboticssimulation
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning: Pong from Pixels
Musings of a Computer Scientist.
7 years ago by @achakraborty
show all tags
article
blog
deep-learning
machine-learning
reinforcement-learning
tutorial
articleblogdeep-learningmachine-learningreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1OpenAI’s Goofy Sumo-Wrestling Bots Are Smarter Than They Look
It could be a virtual blood sport in some absurdist techno-future.
7 years ago by @achakraborty
show all tags
article
artificial-intelligence
news
openai
reinforcement-learning
articleartificial-intelligencenewsopenaireinforcement-learning
(0)
copydelete
- community post
- history of this post
1CMU 10703: Deep RL and Control
https://katefvision.github.io/
7 years ago by @achakraborty
show all tags
cmu
control-theory
controller
course
lectures
machine-learning
reinforcement-learning
slides
cmucontrol-theorycontrollercourselecturesmachine-learningreinforcement-learningslides
(0)
copydelete
- community post
- history of this post
2DeepMind Open Source – Datasets | DeepMind
https://deepmind.com/research/open-source/open-source-datasets/
7 years ago by @achakraborty
show all tags
dataset
deep-learning
google
machine-learning
reinforcement-learning
datasetdeep-learninggooglemachine-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning, Decision Making, a... - compute - Quora
https://compute.quora.com/Deep-Reinforcement-Learning-Decision-Making-and-Control
7 years ago by @achakraborty
show all tags
article
blog
controller
deep-learning
machine-learning
quora
reinforcement-learning
articleblogcontrollerdeep-learningmachine-learningquorareinforcement-learning
(0)
copydelete
- community post
- history of this post
1OpenAI Gym
https://gym.openai.com/read-only.html
7 years ago by @achakraborty
show all tags
examples
opensource
reinforcement-learning
examplesopensourcereinforcement-learning
(0)
copydelete
- community post
- history of this post
2UCL Course on RL
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html
7 years ago by @achakraborty
show all tags
2015
course
reinforcement-learning
resources
slides
2015coursereinforcement-learningresourcesslides
(0)
copydelete
- community post
- history of this post
2Learning Reinforcement Learning (with Code, Exercises and Solutions) – WildML
http://www.wildml.com/2016/10/learning-reinforcement-learning/
7 years ago by @achakraborty
show all tags
course
github
machine-learning
programming
reinforcement-learning
repository
coursegithubmachine-learningprogrammingreinforcement-learningrepository
(0)
copydelete
- community post
- history of this post
1MvdP Projects & Publications
http://www.cs.ubc.ca/~van/papers/
7 years ago by @achakraborty
show all tags
controller
design
graphics
paragc
publications
reinforcement-learning
research
controllerdesigngraphicsparagcpublicationsreinforcement-learningresearch
(0)
copydelete
- community post
- history of this post
1Guest Post (Part I): Demystifying Deep Reinforcement Learning - Nervana
https://www.nervanasys.com/demystifying-deep-reinforcement-learning/
8 years ago by @bsc
show all tags
deep-learning
neural-networks
reinforcement-learning
deep-learningneural-networksreinforcement-learning
(0)
copydelete
- community post
- history of this post
5Guest Post (Part I): Demystifying Deep Reinforcement Learning - Nervana
http://www.nervanasys.com/demystifying-deep-reinforcement-learning/
8 years ago by @hprop
show all tags
machine-learning
ai
reinforcement-learning
machine-learningaireinforcement-learning
(0)
copydelete
- community post
- history of this post
1Playing Atari with Deep Reinforcement Learning
http://arxiv.org/pdf/1312.5602v1.pdf
8 years ago by @hprop
show all tags
ai
machine-learning
reinforcement-learning
aimachine-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1libpgrl - Project Hosting on Google Code
C++ library for RL.
14 years ago by @gromgull
show all tags
machine-learning
reinforcement-learning
machine-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
5PyBrain
PyBrain is a modular Machine Learning Library for Python. Its goal is to offer flexible, easy-to-use yet still powerful algorithms for Machine Learning Tasks and a variety of predefined environments to test and compare your algorithms. PyBrain is short for Python-Based Reinforcement Learning, Artificial Intelligence and Neural Network Library.
14 years ago by @gromgull
show all tags
python
machine-learning
reinforcement-learning
recurrent-neural-networks
pythonmachine-learningreinforcement-learningrecurrent-neural-networks
(0)
copydelete
- community post
- history of this post
1B7 Concepts for Neuronal Reinforcement-Learning in Real-Time Applications - Sonderforschungsbereich Computational Intelligence
http://sfbci.uni-dortmund.de/index.php?Itemid=191&id=44&option=com_content&task=view&lang=en
16 years ago by @semanticinvesting
show all tags
Intelligences
Reinforcement-Learning
real-time
IntelligencesReinforcement-Learningreal-time
(0)
copydelete
- community post
- history of this post
2Reinforcement-Learning Order-Book - Google Search
http://www.google.de/search?num=100&hl=en&client=firefox-a&rls=org.mozilla%3Aen-US%3Aofficial&hs=GBQ&q=Reinforcement-Learning+Order-Book&btnG=Search
17 years ago by @semanticinvesting
show all tags
Order-Books
Reinforcement-Learning
Order-BooksReinforcement-Learning
(0)
copydelete
- community post
- history of this post
2RL Competition 2008 - Home
http://rl-competition.org/
17 years ago by @semanticinvesting
show all tags
2008
Competition
RL
reinforcement-learning
2008CompetitionRLreinforcement-learning
(0)
copydelete
- community post
- history of this post
2rl-glue - Google Code
http://code.google.com/p/rl-glue/
17 years ago by @semanticinvesting
show all tags
RL-Glue
reinforcement-learning
RL-Gluereinforcement-learning
(0)
copydelete
- community post
- history of this post
2rl-viz - Google Code
http://code.google.com/p/rl-viz/
17 years ago by @semanticinvesting
show all tags
RL-Glue
reinforcement-learning
rl-viz
RL-Gluereinforcement-learningrl-viz
(0)
copydelete
- community post
- history of this post
2Reinforcement learning - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Reinforcement_learning
17 years ago by @semanticinvesting
show all tags
Reinforcement-Learning
computational-learning
learning
Reinforcement-Learningcomputational-learninglearning
(0)
copydelete
- community post
- history of this post
2Java Reinforcement Learning Framework
http://mykel.kochenderfer.com/jrlf/index.php
17 years ago by @semanticinvesting
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydelete
- community post
- history of this post
2reinforcement-learning java - Google Search
http://www.google.de/search?num=100&hl=en&client=firefox-a&rls=org.mozilla%3Aen-US%3Aofficial&hs=A5O&q=reinforcement-learning+java&btnG=Search&meta=
17 years ago by @semanticinvesting
show all tags
java
reinforcement-learning
javareinforcement-learning
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
⟩
⟩⟩

publications (hide)142
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
Y. Chen, and M. Bansal. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), page 675--686. Melbourne, Australia, Association for Computational Linguistics, (July 2018)
a month ago by @antonvlasjuk
show all tags
pointer
reinforcement-learning
summarisation
pointerreinforcement-learningsummarisation
(0)
copydeleteadd this publication to your clipboard
2Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog.
N. Jaques, A. Ghandeharioun, J. Shen, C. Ferguson, À. Lapedriza, N. Jones, S. Gu, and R. Picard. CoRR, (2019)
4 months ago by @ghagerer
show all tags
codefreeze
llms
reinforcement-learning
codefreezellmsreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
7Improving language understanding by generative pre-training
A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever. (2018)
4 months ago by @ghagerer
show all tags
chatgpt
codefreeze
llms
openai
reinforcement-learning
chatgptcodefreezellmsopenaireinforcement-learning
(1)
copydeleteadd this publication to your clipboard
2Fine-Tuning Language Models from Human Preferences.
D. Ziegler, N. Stiennon, J. Wu, T. Brown, A. Radford, D. Amodei, P. Christiano, and G. Irving. CoRR, (2019)
4 months ago by @ghagerer
show all tags
ChatGPT
OpenAI
codefreeze
llms
reinforcement-learning
ChatGPTOpenAIcodefreezellmsreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4Learning to summarize from human feedback.
N. Stiennon, L. Ouyang, J. Wu, D. Ziegler, R. Lowe, C. Voss, A. Radford, D. Amodei, and P. Christiano. CoRR, (2020)
6 months ago by @ghagerer
show all tags
ChatGPT
OpenAI
abstractive
llms
reinforcement-learning
summarization
ChatGPTOpenAIabstractivellmsreinforcement-learningsummarization
(0)
copydeleteadd this publication to your clipboard
2TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
P. Sun, X. Sun, L. Han, J. Xiong, Q. Wang, B. Li, Y. Zheng, J. Liu, Y. Liu, H. Liu and 1 other author(s). (2018)cite arxiv:1809.07193Comment: add link for source code.
2 years ago by @jpbarrettel
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Evaluating Critical Reinforcement Learning Framework in the Field
S. Ju, G. Zhou, M. Abdelshiheed, T. Barnes, and M. Chi. Artificial Intelligence in Education, page 215--227. Cham, Springer International Publishing, (2021)
2 years ago by @brusilovsky
show all tags
aied2021
reinforcement-learning
aied2021reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Tackling the Credit Assignment Problem in Reinforcement Learning-Induced Pedagogical Policies with Neural Networks
M. Ausin, M. Maniktala, T. Barnes, and M. Chi. Artificial Intelligence in Education, page 356--368. Cham, Springer International Publishing, (2021)
2 years ago by @brusilovsky
show all tags
aied2021
reinforcement-learning
sequencing
aied2021reinforcement-learningsequencing
(0)
copydeleteadd this publication to your clipboard
2Large-scale Interactive Conversational Recommendation System using Actor-Critic Framework
A. Montazeralghaem, J. Allan, and P. Thomas. Fifteenth ACM Conference on Recommender Systems, page 220-229. ACM, (September 2021)
3 years ago by @brusilovsky
show all tags
conversational-recommender
recsys2021
reinforcement-learning
conversational-recommenderrecsys2021reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Partially Observable Reinforcement Learning for Dialog-based Interactive Recommendation
Y. Wu, C. Macdonald, and I. Ounis. Fifteenth ACM Conference on Recommender Systems, page 241-251. ACM, (September 2021)
3 years ago by @brusilovsky
show all tags
conversational-recommender
recsys2021
reinforcement-learning
conversational-recommenderrecsys2021reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Reinforcement learning based recommender systems: A survey
M. Afsar, T. Crump, and B. Far. (2021)cite arxiv:2101.06286Comment: Submitted to ACM Computing Surveys.
3 years ago by @analyst
show all tags
2021
recommendation
reinforcement-learning
survey
2021recommendationreinforcement-learningsurvey
(0)
copydeleteadd this publication to your clipboard
2A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif, H. Cuayáhuitl, F. Pervez, F. Shamshad, H. Ali, and E. Cambria. (2021)cite arxiv:2101.00240Comment: Under Review.
3 years ago by @analyst
show all tags
2021
audio
reinforcement-learning
survey
2021audioreinforcement-learningsurvey
(0)
copydeleteadd this publication to your clipboard
3Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
R. Liu, F. Nageotte, P. Zanne, M. de Mathelin, and B. Dresp-Langley. Robotics, 10 (1): 22 (January 2021)
3 years ago by @analyst
show all tags
2021
control
deep-learning
manipulator
reinforcement-learning
review
2021controldeep-learningmanipulatorreinforcement-learningreview
(0)
copydeleteadd this publication to your clipboard
2Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
L. Brunke, M. Greeff, A. Hall, Z. Yuan, S. Zhou, J. Panerati, and A. Schoellig. (2021)cite arxiv:2108.06266Comment: 36 pages, 8 figures.
3 years ago by @analyst
show all tags
2021
reinforcement-learning
robotics
2021reinforcement-learningrobotics
(0)
copydeleteadd this publication to your clipboard
1Positioning of the Robotic Arm Using Different Reinforcement Learning Algorithms
T. Lindner, A. Milecki, and D. Wyrwał. International Journal of Control, Automation and Systems, 19 (4): 1661--1676 (Apr 1, 2021)
3 years ago by @analyst
show all tags
2021
journal
manipulator
paper
reinforcement-learning
robotics
springer
2021journalmanipulatorpaperreinforcement-learningroboticsspringer
(0)
copydeleteadd this publication to your clipboard
1Reinforcement learning in robotic applications: a comprehensive survey
B. Singh, R. Kumar, and V. Singh. Artificial Intelligence Review, (Apr 20, 2021)
3 years ago by @analyst
show all tags
2021
machine-learning
reinforcement-learning
robotics
survey
2021machine-learningreinforcement-learningroboticssurvey
(0)
copydeleteadd this publication to your clipboard
2Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning
J. Hua, L. Zeng, G. Li, and Z. Ju. Sensors, 21 (4): 1278 (February 2021)
3 years ago by @analyst
show all tags
2021
deep-learning
imitation-learning
reinforcement-learning
transfer-learning
2021deep-learningimitation-learningreinforcement-learningtransfer-learning
(0)
copydeleteadd this publication to your clipboard
1Hierarchical Reinforcement Learning
S. Pateria, B. Subagdja, A. hwee Tan, and C. Quek. ACM Computing Surveys, 54 (5): 1--35 (June 2021)
3 years ago by @analyst
show all tags
2021
reinforcement-learning
survey
2021reinforcement-learningsurvey
(0)
copydeleteadd this publication to your clipboard
2Incorporation of Expert Knowledge for Learning Robotic Assembly Tasks.
M. Braun, and S. Wrede. ETFA, page 1594-1601. IEEE, (2020)
3 years ago by @mtibbe
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2A Review on Interactive Reinforcement Learning From Human Social Feedback.
J. Lin, Z. Ma, R. Gomez, K. Nakamura, B. He, and G. Li. IEEE Access, (2020)
3 years ago by @mtibbe
show all tags
evaluative-feedback
human-feedback
reinforcement-learning
evaluative-feedbackhuman-feedbackreinforcement-learning
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩