tag :: reinforcement-learning

bookmarks (hide)104
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Reinforcement Learning: Theory and Algorithms
University of Washington. Research interests: Machine Learning, Artificial Intelligence, Optimization, Statistics
5 years ago by @kirk86
show all tags
book
reinforcement-learning
website
bookreinforcement-learningwebsite
(0)
copydelete
- community post
- history of this post
1RL theory seminars
RL theory seminars
4 years ago by @kirk86
show all tags
reinforcement-learning
theory
tutorials
website
reinforcement-learningtheorytutorialswebsite
(0)
copydelete
- community post
- history of this post
1MS&E338 Reinforcement Learning
https://web.stanford.edu/class/msande338/
4 years ago by @analyst
show all tags
2020
course
reinforcement-learning
stanford
2020coursereinforcement-learningstanford
(0)
copydelete
- community post
- history of this post
1Theory of Reinforcement Learning | Simons Institute for the Theory of Computing
This program aims to reunite researchers across disciplines that have played a role in developing the theory of reinforcement learning. It will review past developments and identify promising directions of research, with an emphasis on addressing existing open problems, ranging from the design of efficient, scalable algorithms for exploration to how to control learning and planning. It also aims to deepen the understanding of model-free vs. model-based learning and control, and the design of efficient methods to exploit structure and adapt to easier environments.
4 years ago by @analyst
show all tags
2020
collection
course
reinforcement-learning
tutorial
workshop
2020collectioncoursereinforcement-learningtutorialworkshop
(0)
copydelete
- community post
- history of this post
1Theory of Reinforcement Learning Boot Camp | Simons Institute
- Aug. 31 – Sep. 4, 2020 - Csaba Szepesvari (University of Alberta, Google DeepMind; chair), Emma Brunskill (Stanford University), Sébastien Bubeck (MSR), Alan Malek (DeepMind), Sean Meyn (University of Florida), Ambuj Tewari (University of Michigan), Mengdi Wang (Princeton)
4 years ago by @analyst
show all tags
2020
reinforcement-learning
tutorial
workshop
2020reinforcement-learningtutorialworkshop
(0)
copydelete
- community post
- history of this post
1Deep Reinforcement Learning | Simons Institute
- Sep. 28 – Oct. 2, 2020 - Lihong Li (Google Brain; chair), Marc G. Bellemare (Google Brain) - The success of deep neural networks in modeling complicated functions has recently been applied by the reinforcement learning community, resulting in algorithms that are able to learn in environments previously thought to be much too large. Successful applications span domains from robotics to health care. However, the success is not well understood from a theoretical perspective. What are the modeling choices necessary for good performance, and how does the flexibility of deep neural nets help learning? This workshop will connect practitioners to theoreticians with the goal of understanding the most impactful modeling decisions and the properties of deep neural networks that make them so successful. Specifically, we will study the ability of deep neural nets to approximate in the context of reinforcement learning.
4 years ago by @analyst
show all tags
2020
deep-learning
reinforcement-learning
tutorial
workshop
2020deep-learningreinforcement-learningtutorialworkshop
(0)
copydelete
- community post
- history of this post
1Welcome to Spinning Up in Deep RL! | OpenAI
https://spinningup.openai.com/en/latest/
4 years ago by @analyst
show all tags
deep-learning
documentation
opensource
reinforcement-learning
tutorial
deep-learningdocumentationopensourcereinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1Deep Learning & Reinforcement Learning
Chris G. Willcocks, Durham University
3 years ago by @analyst
show all tags
collection
course
deep-learning
lecture
reinforcement-learning
collectioncoursedeep-learninglecturereinforcement-learning
(0)
copydelete
- community post
- history of this post
1The Best 26 Python Reinforcement Learning Libraries | PythonRepo
https://pythonrepo.com/catalog/python-reinforcement-learning_newest_1
3 years ago by @analyst
show all tags
python
recommendation
reinforcement-learning
pythonrecommendationreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Evolution, rewards, and artificial intelligence – TechTalks
A paper by DeepMind scientist triggered much debate about the path to artificial intelligence. Here, we'll try to draw the line between theory and practice.
3 years ago by @analyst
show all tags
2021
article
blog
google
lecture
reinforcement-learning
2021articlebloggooglelecturereinforcement-learning
(0)
copydelete
- community post
- history of this post
1Yoshua Bengio Team Designs Consciousness-Inspired Planning Agent for Model-Based RL | by Synced | SyncedReview | Jun, 2021 | Medium
A research team from McGill University, Université de Montréal, DeepMind and Mila presents an end-to-end, model-based deep reinforcement learning (RL) agent that dynamically attends to relevant parts of its environments to facilitate out-of-distribution (OOD) and systematic generalization.
3 years ago by @analyst
show all tags
2021
machine-learning
reinforcement-learning
2021machine-learningreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Off-policy vs On-Policy vs Offline Reinforcement Learning Demystified! | by Kowshik chilamkurthy | Medium
In this article, we will try to understand where On-Policy learning, Off-policy learning and offline learning algorithms fundamentally differ. Though there is a fair amount of intimidating jargon in…
3 years ago by @analyst
show all tags
article
blog
reinforcement-learning
articleblogreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Q Networks (DQN) | labml.ai
This is a PyTorch implementation/tutorial of Deep Q Networks (DQN) from paper Playing Atari with Deep Reinforcement Learning. This includes dueling network architecture, a prioritized replay buffer and double-Q-network training.
3 years ago by @analyst
show all tags
article
blog
deep-learning
pytorch
reinforcement-learning
tutorial
articleblogdeep-learningpytorchreinforcement-learningtutorial
(0)
copydelete
- community post
- history of this post
1Deep Q-Network, with PyTorch. Explaining the fundamentals of… | by Chao De-Yu | Towards Data Science
In Q-Learning, we represent the Q-value as a table. However, in many real-world problems, there are enormous state and/or action spaces and tabular representation is insufficient. For instance…
3 years ago by @analyst
show all tags
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Deep Q-Network with Pytorch. DQN | by Unnat Singh | Medium
When the agent interacts with the environment, the sequence of experienced tuples can be highly correlated. The naive Q-Learning algorithm that learns from each of these experience tuples in…
3 years ago by @analyst
show all tags
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
copydelete
- community post
- history of this post
1Building a DQN in PyTorch: Balancing Cart Pole with Deep RL | by Mohit Pilkhan | Building Fynd
Hi Geeks, welcome to Part-3 of our Reinforcement Learning Series. In the last two blogs, we covered some basic concepts in RL and also studied the multi-armed bandit problem and its solution methods…
3 years ago by @analyst
show all tags
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
copydelete
- community post
- history of this post
1labml.ai — Annotated PyTorch Paper Implementations
This is a collection of simple PyTorch implementations of neural networks and related algorithms
3 years ago by @analyst
show all tags
deep-learning
pytorch
reinforcement-learning
software
deep-learningpytorchreinforcement-learningsoftware
(0)
copydelete
- community post
- history of this post
1robo-gym
https://sites.google.com/view/robo-gym
2 years ago by @analyst
show all tags
reinforcement-learning
robotics
simulation
reinforcement-learningroboticssimulation
(0)
copydelete
- community post
- history of this post
1Distributional Reinforcement Learning
https://www.distributional-rl.org/
2 years ago by @analyst
show all tags
book
distributed-systems
reinforcement-learning
bookdistributed-systemsreinforcement-learning
(0)
copydelete
- community post
- history of this post
1The Magic Behind ChatGPT: Reinforcement Learning with Human Feedback
One of the key enablers of the ChatGPT magic can be traced back to 2017 under the obscure name of reinforcement learning with human feedback(RLHF). Large language models(LLMs) have become one of the most interesting environments for applying modern reinforcement learning(RL) techniques. While LLMs are great at deriving knowledge from vast amounts of text, RL can help to translate that knowledge into actions. That has been the secret behind RLHF.
7 months ago by @ghagerer
show all tags
chatgpt
llms
openai
reinforcement-learning
chatgptllmsopenaireinforcement-learning
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)142
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Is Q-Learning Provably Efficient?
C. Jin, Z. Allen-Zhu, S. Bubeck, and M. Jordan. (2018)
5 years ago by @kirk86
show all tags
neurips2019
optimization
reinforcement-learning
neurips2019optimizationreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Reinforcement Learning with Convex Constraints
S. Miryoosefi, K. Brantley, H. III, M. Dudik, and R. Schapire. (2019)
5 years ago by @kirk86
show all tags
neurips2019
optimization
reinforcement-learning
neurips2019optimizationreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Multiple Futures Prediction
C. Tang, and R. Salakhutdinov. (2019)
5 years ago by @kirk86
show all tags
bayesian
neurips2019
probability
reinforcement-learning
bayesianneurips2019probabilityreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
3Robust exploration in linear quadratic reinforcement learning
J. Umenberger, M. Ferizbegovic, T. Schön, and H. Hjalmarsson. (2019)
5 years ago by @kirk86
show all tags
neurips2019
reinforcement-learning
robustness
neurips2019reinforcement-learningrobustness
(0)
copydeleteadd this publication to your clipboard
2A Family of Robust Stochastic Operators for Reinforcement Learning
Y. Lu, M. Squillante, and C. Wu. (2019)
5 years ago by @kirk86
show all tags
neurips2019
reinforcement-learning
robustness
neurips2019reinforcement-learningrobustness
(0)
copydeleteadd this publication to your clipboard
13Q-learning
C. Watkins, and P. Dayan. Machine Learning, 8 (3): 279--292 (May 1, 1992)
5 years ago by @analyst
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Training Agents using Upside-Down Reinforcement Learning
R. Srivastava, P. Shyam, F. Mutz, W. Jaśkowski, and J. Schmidhuber. (2019)cite arxiv:1912.02877Comment: NNAISENSE Technical Report. 17 pages, 6 figures.
5 years ago by @analyst
show all tags
2019
arxiv
reinforcement-learning
2019arxivreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Combining Q-Learning and Search with Amortized Value Estimates
J. Hamrick, V. Bapst, A. Sanchez-Gonzalez, T. Pfaff, T. Weber, L. Buesing, and P. Battaglia. (2019)cite arxiv:1912.02807.
5 years ago by @analyst
show all tags
2019
arxiv
reinforcement-learning
2019arxivreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
4Introduction to Multi-Armed Bandits
A. Slivkins. (2019)cite arxiv:1904.07272Comment: The manuscript is complete, but comments are very welcome! To be published with Foundations and Trends in Machine Learning.
4 years ago by @analyst
show all tags
2019
arxiv
book
reinforcement-learning
tutorial
2019arxivbookreinforcement-learningtutorial
(0)
copydeleteadd this publication to your clipboard
2Reinforcement Learning via Fenchel-Rockafellar Duality
O. Nachum, and B. Dai. (2020)cite arxiv:2001.01866.
4 years ago by @kirk86
show all tags
duality
mathematics
reinforcement-learning
dualitymathematicsreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
I. Osband, and B. Van Roy. (2016)cite arxiv:1607.00215.
4 years ago by @kirk86
show all tags
approximate
bayesian
reinforcement-learning
sampling
uncertainty
approximatebayesianreinforcement-learningsamplinguncertainty
(0)
copydeleteadd this publication to your clipboard
3Fast Rates for Online Prediction with Abstention
G. Neu, and N. Zhivotovskiy. (2020)cite arxiv:2001.10623Comment: 19 pages.
4 years ago by @kirk86
show all tags
convergence
online-learning
readings
reinforcement-learning
convergenceonline-learningreadingsreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Causally Correct Partial Models for Reinforcement Learning
D. Rezende, I. Danihelka, G. Papamakarios, N. Ke, R. Jiang, T. Weber, K. Gregor, H. Merzic, F. Viola, J. Wang and 4 other author(s). (2020)cite arxiv:2002.02836.
4 years ago by @kirk86
show all tags
causal-analysis
reinforcement-learning
causal-analysisreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Advanced Robotics – Fundamental Knowledge
P. Abbeel, L. Smith, I. Clavera, and H. Xu. (2019)
4 years ago by @kirk86
show all tags
readings
reinforcement-learning
readingsreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
3Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
A. Edwards, H. Sahni, R. Liu, J. Hung, A. Jain, R. Wang, A. Ecoffet, T. Miconi, C. Isbell, and J. Yosinski. (2020)cite arxiv:2002.09505.
4 years ago by @kirk86
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning
K. Lee, K. Lee, J. Shin, and H. Lee. (2019)cite arxiv:1910.05396Comment: Accepted in ICLR 2020 and NeurIPS Workshop on Deep RL 2019 / First two authors are equally contributed.
4 years ago by @kirk86
show all tags
generalization
randomized
reinforcement-learning
generalizationrandomizedreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
1On the Sample Complexity of Reinforcement Learning
M. Kakade. (2003)
4 years ago by @kirk86
show all tags
bounds
complexity
reinforcement-learning
sampling
thesis
boundscomplexityreinforcement-learningsamplingthesis
(0)
copydeleteadd this publication to your clipboard
1How to Train Your Robot - New Environments for Robotic Training and New Methods for Transferring Policies from the Simulator to the Real Robot
F. Golemo. (2019)
4 years ago by @kirk86
show all tags
reinforcement-learning
thesis
reinforcement-learningthesis
(0)
copydeleteadd this publication to your clipboard
2Rainbow: Combining Improvements in Deep Reinforcement Learning
M. Hessel, J. Modayil, H. van Hasselt, T. Schaul, G. Ostrovski, W. Dabney, D. Horgan, B. Piot, M. Azar, and D. Silver. (2017)cite arxiv:1710.02298Comment: Under review as a conference paper at AAAI 2018.
4 years ago by @kirk86
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydeleteadd this publication to your clipboard
3A Comparative Analysis of Expected and Distributional Reinforcement Learning
C. Lyle, P. Castro, and M. Bellemare. (2019)cite arxiv:1901.11084Comment: To appear in the Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence.
4 years ago by @kirk86
show all tags
reinforcement-learning
reinforcement-learning
(0)
copydeleteadd this publication to your clipboard

bookmarks (hide)104 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)142 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

similar tags

bookmarks (hide)104
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)142
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...