tag :: reinforcement-learning

закладки (спрятать)104
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

1The Magic Behind ChatGPT: Reinforcement Learning with Human Feedback
One of the key enablers of the ChatGPT magic can be traced back to 2017 under the obscure name of reinforcement learning with human feedback(RLHF). Large language models(LLMs) have become one of the most interesting environments for applying modern reinforcement learning(RL) techniques. While LLMs are great at deriving knowledge from vast amounts of text, RL can help to translate that knowledge into actions. That has been the secret behind RLHF.
2 лет назад , @ghagerer
chatgpt
llms
openai
reinforcement-learning
chatgptllmsopenaireinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Distributional Reinforcement Learning
https://www.distributional-rl.org/
4 лет назад , @analyst
book
distributed-systems
reinforcement-learning
bookdistributed-systemsreinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1robo-gym
https://sites.google.com/view/robo-gym
4 лет назад , @analyst
reinforcement-learning
robotics
simulation
reinforcement-learningroboticssimulation
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1labml.ai — Annotated PyTorch Paper Implementations
This is a collection of simple PyTorch implementations of neural networks and related algorithms
4 лет назад , @analyst
deep-learning
pytorch
reinforcement-learning
software
deep-learningpytorchreinforcement-learningsoftware
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Building a DQN in PyTorch: Balancing Cart Pole with Deep RL | by Mohit Pilkhan | Building Fynd
Hi Geeks, welcome to Part-3 of our Reinforcement Learning Series. In the last two blogs, we covered some basic concepts in RL and also studied the multi-armed bandit problem and its solution methods…
4 лет назад , @analyst
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Deep Q-Network with Pytorch. DQN | by Unnat Singh | Medium
When the agent interacts with the environment, the sequence of experienced tuples can be highly correlated. The naive Q-Learning algorithm that learns from each of these experience tuples in…
4 лет назад , @analyst
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Deep Q-Network, with PyTorch. Explaining the fundamentals of… | by Chao De-Yu | Towards Data Science
In Q-Learning, we represent the Q-value as a table. However, in many real-world problems, there are enormous state and/or action spaces and tabular representation is insufficient. For instance…
4 лет назад , @analyst
article
blog
dqn
pytorch
reinforcement-learning
articleblogdqnpytorchreinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Deep Q Networks (DQN) | labml.ai
This is a PyTorch implementation/tutorial of Deep Q Networks (DQN) from paper Playing Atari with Deep Reinforcement Learning. This includes dueling network architecture, a prioritized replay buffer and double-Q-network training.
4 лет назад , @analyst
article
blog
deep-learning
pytorch
reinforcement-learning
tutorial
articleblogdeep-learningpytorchreinforcement-learningtutorial
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Off-policy vs On-Policy vs Offline Reinforcement Learning Demystified! | by Kowshik chilamkurthy | Medium
In this article, we will try to understand where On-Policy learning, Off-policy learning and offline learning algorithms fundamentally differ. Though there is a fair amount of intimidating jargon in…
4 лет назад , @analyst
article
blog
reinforcement-learning
articleblogreinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Yoshua Bengio Team Designs Consciousness-Inspired Planning Agent for Model-Based RL | by Synced | SyncedReview | Jun, 2021 | Medium
A research team from McGill University, Université de Montréal, DeepMind and Mila presents an end-to-end, model-based deep reinforcement learning (RL) agent that dynamically attends to relevant parts of its environments to facilitate out-of-distribution (OOD) and systematic generalization.
4 лет назад , @analyst
2021
machine-learning
reinforcement-learning
2021machine-learningreinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
2Evolution, rewards, and artificial intelligence – TechTalks
A paper by DeepMind scientist triggered much debate about the path to artificial intelligence. Here, we'll try to draw the line between theory and practice.
4 лет назад , @analyst
2021
article
blog
google
lecture
reinforcement-learning
2021articlebloggooglelecturereinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1The Best 26 Python Reinforcement Learning Libraries | PythonRepo
https://pythonrepo.com/catalog/python-reinforcement-learning_newest_1
4 лет назад , @analyst
python
recommendation
reinforcement-learning
pythonrecommendationreinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Deep Learning & Reinforcement Learning
Chris G. Willcocks, Durham University
4 лет назад , @analyst
collection
course
deep-learning
lecture
reinforcement-learning
collectioncoursedeep-learninglecturereinforcement-learning
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Welcome to Spinning Up in Deep RL! | OpenAI
https://spinningup.openai.com/en/latest/
5 лет назад , @analyst
deep-learning
documentation
opensource
reinforcement-learning
tutorial
deep-learningdocumentationopensourcereinforcement-learningtutorial
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Deep Reinforcement Learning | Simons Institute
- Sep. 28 – Oct. 2, 2020 - Lihong Li (Google Brain; chair), Marc G. Bellemare (Google Brain) - The success of deep neural networks in modeling complicated functions has recently been applied by the reinforcement learning community, resulting in algorithms that are able to learn in environments previously thought to be much too large. Successful applications span domains from robotics to health care. However, the success is not well understood from a theoretical perspective. What are the modeling choices necessary for good performance, and how does the flexibility of deep neural nets help learning? This workshop will connect practitioners to theoreticians with the goal of understanding the most impactful modeling decisions and the properties of deep neural networks that make them so successful. Specifically, we will study the ability of deep neural nets to approximate in the context of reinforcement learning.
5 лет назад , @analyst
2020
deep-learning
reinforcement-learning
tutorial
workshop
2020deep-learningreinforcement-learningtutorialworkshop
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Theory of Reinforcement Learning Boot Camp | Simons Institute
- Aug. 31 – Sep. 4, 2020 - Csaba Szepesvari (University of Alberta, Google DeepMind; chair), Emma Brunskill (Stanford University), Sébastien Bubeck (MSR), Alan Malek (DeepMind), Sean Meyn (University of Florida), Ambuj Tewari (University of Michigan), Mengdi Wang (Princeton)
5 лет назад , @analyst
2020
reinforcement-learning
tutorial
workshop
2020reinforcement-learningtutorialworkshop
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Theory of Reinforcement Learning | Simons Institute for the Theory of Computing
This program aims to reunite researchers across disciplines that have played a role in developing the theory of reinforcement learning. It will review past developments and identify promising directions of research, with an emphasis on addressing existing open problems, ranging from the design of efficient, scalable algorithms for exploration to how to control learning and planning. It also aims to deepen the understanding of model-free vs. model-based learning and control, and the design of efficient methods to exploit structure and adapt to easier environments.
5 лет назад , @analyst
2020
collection
course
reinforcement-learning
tutorial
workshop
2020collectioncoursereinforcement-learningtutorialworkshop
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1MS&E338 Reinforcement Learning
https://web.stanford.edu/class/msande338/
5 лет назад , @analyst
2020
course
reinforcement-learning
stanford
2020coursereinforcement-learningstanford
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1RL theory seminars
RL theory seminars
5 лет назад , @kirk86
reinforcement-learning
theory
tutorials
website
reinforcement-learningtheorytutorialswebsite
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Reinforcement Learning: Theory and Algorithms
University of Washington. Research interests: Machine Learning, Artificial Intelligence, Optimization, Statistics
6 лет назад , @kirk86
book
reinforcement-learning
website
bookreinforcement-learningwebsite
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи

&lang;&lang;
⟨
1
2
3
&rang;
⟩⟩

публикации (спрятать)147
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...

1Q-DPM: An Efficient Model-Free Dynamic Power Management Technique
M. Li, X. Wu, R. Yao, и L. Yan. Design Automation Conference, (2005)
18 лет назад , @derkling
DATE
DPM
Energy
Estimation
Model-free
Optimization
Q-Learning
Reinforcement-Learning
DATEDPMEnergyEstimationModel-freeOptimizationQ-LearningReinforcement-Learning
(0)
копироватьудалитьдобавить публикацию в буфер
3Modelling Motivation as an Intrinsic Reward Signal for Reinforcement Learning Agents
K. Merrick, и M. Mahler. Machine Learning, (2006)not yet accepted..
19 лет назад , @gromgull
interestingness
motivation
reinforcement-learning
interestingnessmotivationreinforcement-learning
(0)
копироватьудалитьдобавить публикацию в буфер
2Importance Sampling for Reinforcement Learning with Multiple Objectives
C. Shelton. Massachusetts Institute of Technology, (августа 2001)
19 лет назад , @darius
reinforcement-learning
reinforcement-learning
(0)
копироватьудалитьдобавить публикацию в буфер
3Multi-criteria reinforcement learning
Z. Gabor, Z. Kalmar, и C. Szepesvari. International Conference on Machine Learning (ICML-98), Madison, WI, (1998)
19 лет назад , @darius
cogs
reinforcement-learning
cogsreinforcement-learning
(0)
копироватьудалитьдобавить публикацию в буфер
4Dynamic Preferences in Multi-Criteria Reinforcement Learning
S. Natarajan, и P. Tadepalli. Proceedings of the 22nd International Conference on Machine Learning (ICML), Bonn, Germany, (2005)
19 лет назад , @darius
cogs
reinforcement-learning
cogsreinforcement-learning
(0)
копироватьудалитьдобавить публикацию в буфер
2A Survey of Reinforcement Learning in Relational Domains
M. van Otterlo. (июля 2005)
19 лет назад , @darius
to-read
reinforcement-learning
to-readreinforcement-learning
(0)
копироватьудалитьдобавить публикацию в буфер
4Learning in Embedded Systems
L. Kaelbling. The MIT Press, (мая 1993)
20 лет назад , @darius
bandit-problems
books
reinforcement-learning
bandit-problemsbooksreinforcement-learning
(0)
копироватьудалитьдобавить публикацию в буфер

&lang;&lang;
⟨
6
7
8
&rang;
⟩⟩

BibSonomy

закладки (спрятать)104
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

1The Magic Behind ChatGPT: Reinforcement Learning with Human Feedback

1Distributional Reinforcement Learning

1robo-gym

1labml.ai — Annotated PyTorch Paper Implementations

1Building a DQN in PyTorch: Balancing Cart Pole with Deep RL | by Mohit Pilkhan | Building Fynd

1Deep Q-Network with Pytorch. DQN | by Unnat Singh | Medium

1Deep Q-Network, with PyTorch. Explaining the fundamentals of… | by Chao De-Yu | Towards Data Science

1Deep Q Networks (DQN) | labml.ai

1Off-policy vs On-Policy vs Offline Reinforcement Learning Demystified! | by Kowshik chilamkurthy | Medium

1Yoshua Bengio Team Designs Consciousness-Inspired Planning Agent for Model-Based RL | by Synced | SyncedReview | Jun, 2021 | Medium

2Evolution, rewards, and artificial intelligence – TechTalks

1The Best 26 Python Reinforcement Learning Libraries | PythonRepo

1Deep Learning & Reinforcement Learning

1Welcome to Spinning Up in Deep RL! | OpenAI

1Deep Reinforcement Learning | Simons Institute

1Theory of Reinforcement Learning Boot Camp | Simons Institute

1Theory of Reinforcement Learning | Simons Institute for the Theory of Computing

1MS&E338 Reinforcement Learning

1RL theory seminars

1Reinforcement Learning: Theory and Algorithms

публикации (спрятать)147
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...

1Q-DPM: An Efficient Model-Free Dynamic Power Management Technique

3Modelling Motivation as an Intrinsic Reward Signal for Reinforcement Learning Agents

2Importance Sampling for Reinforcement Learning with Multiple Objectives

3Multi-criteria reinforcement learning

4Dynamic Preferences in Multi-Criteria Reinforcement Learning

2A Survey of Reinforcement Learning in Relational Domains

4Learning in Embedded Systems

просмотр

сходные по теме тэги

сходные по популярности тэги

закладки (спрятать)104 показатьвсётолько закладкизакладки на страницу5102050100 RSSBibTeXXML

публикации (спрятать)147 показатьвсётолько публикациипубликации на страницу5102050100 расширенный... RSSBibTeXRDFдальше...

просмотр

сходные по теме тэги

сходные по популярности тэги

закладки (спрятать)104
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

публикации (спрятать)147
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...