copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients

I. Grondman, L. Busoniu, G. Lopes, and R. Babuska. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 42 (6): 1291-1307 (November 2012)
DOI: 10.1109/TSMCC.2012.2218595

Abstract

Policy-gradient-based actor-critic algorithms are amongst the most popular algorithms in the reinforcement learning framework. Their advantage of being able to search for optimal policies using low-variance gradient estimates has made them useful in several real-life applications, such as robotics, power control, and finance. Although general surveys on reinforcement learning techniques already exist, no survey is specifically dedicated to actor-critic algorithms in particular. This paper, therefore, describes the state of the art of actor-critic algorithms, with a focus on methods that can work in an online setting and use function approximation in order to deal with continuous state and action spaces. After starting with a discussion on the concepts of reinforcement learning and the origins of actor-critic algorithms, this paper describes the workings of the natural gradient, which has made its way into many actor-critic algorithms over the past few years. A review of several standard and natural actor-critic algorithms is given, and the paper concludes with an overview of application areas and a discussion on open issues.

Links and resources

BibTeX key: grondman2012ActorCriticSurvey
entry type: article
year: 2012
month: nov
journal: IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)
number: 6
pages: 1291-1307
volume: 42
issn: 1094-6977
DOI: 10.1109/TSMCC.2012.2218595

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients

Comments and Reviews
(0)