copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Algorithms for Reinforcement Learning

{. Szepesvári. Morgan and Claypool, (July 2010)
DOI: 10.2200/S00268ED1V01Y201005AIM009

Abstract

Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book we focus on those algorithms of reinforcement learning which build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations.

Cite this publication

@book{szepesvari2010c, abstract = {Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book we focus on those algorithms of reinforcement learning which build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations.}, added-at = {2020-03-17T03:03:01.000+0100}, author = {Szepesv{\'a}ri, {Cs}.}, bdsk-url-1 = {http://www.ualberta.ca/~szepesva/RLBook.html}, biburl = {https://www.bibsonomy.org/bibtex/2c10cedf299544b22d58d2534737c1082/csaba}, date-added = {2010-08-28 20:41:24 -0600}, date-modified = {2012-06-03 14:07:36 -0600}, doi = {10.2200/S00268ED1V01Y201005AIM009}, interhash = {c0d5fd2b0ae378115aaf15952c6aa949}, intrahash = {c10cedf299544b22d58d2534737c1082}, keywords = {Monte-Carlo PAC-learning, Q-learning, active actor-critic approximation, bias-variance difference function gradient gradient, learning, least-squares methods, natural online optimization, overfitting, planning, policy reinforcement simulation simulation, stochastic temporal tradeoff, two-timescale}, month = {July}, pdf = {papers/RLAlgsInMDPs-lecture.pdf}, publisher = {Morgan and Claypool}, timestamp = {2020-03-17T03:03:01.000+0100}, title = {Algorithms for Reinforcement Learning}, url = {http://www.ualberta.ca/~szepesva/RLBook.html}, year = 2010 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Algorithms for Reinforcement Learning

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Algorithms for Reinforcement Learning

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Algorithms for Reinforcement Learning

Comments and Reviews
(0)