copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Meta-learning of Sequential Strategies

P. Ortega, J. Wang, M. Rowland, T. Genewein, Z. Kurth-Nelson, R. Pascanu, N. Heess, J. Veness, A. Pritzel, P. Sprechmann, S. Jayakumar, T. McGrath, K. Miller, M. Azar, I. Osband, N. Rabinowitz, A. György, S. Chiappa, S. Osindero, Y. Teh, H. van Hasselt, N. de Freitas, M. Botvinick, and S. Legg. (2019)cite arxiv:1905.03030Comment: DeepMind Technical Report (15 pages, 6 figures).

Abstract

In this report we review memory-based meta-learning as a tool for building sample-efficient strategies that learn from past experience to adapt to any task within a target class. Our goal is to equip the reader with the conceptual foundations of this tool for building new, scalable agents that operate on broad domains. To do so, we present basic algorithmic templates for building near-optimal predictors and reinforcement learners which behave as if they had a probabilistic model that allowed them to efficiently exploit task structure. Furthermore, we recast memory-based meta-learning within a Bayesian framework, showing that the meta-learned strategies are near-optimal because they amortize Bayes-filtered data, where the adaptation is implemented in the memory dynamics as a state-machine of sufficient statistics. Essentially, memory-based meta-learning translates the hard problem of probabilistic sequential inference into a regression problem.

Description

[1905.03030] Meta-learning of Sequential Strategies

Links and resources

BibTeX key: ortega2019metalearning
entry type: article
year: 2019
url: http://arxiv.org/abs/1905.03030
note: cite arxiv:1905.03030Comment: DeepMind Technical Report (15 pages, 6 figures)

@kirk86's tags highlighted

Cite this publication

@article{ortega2019metalearning, abstract = {In this report we review memory-based meta-learning as a tool for building sample-efficient strategies that learn from past experience to adapt to any task within a target class. Our goal is to equip the reader with the conceptual foundations of this tool for building new, scalable agents that operate on broad domains. To do so, we present basic algorithmic templates for building near-optimal predictors and reinforcement learners which behave as if they had a probabilistic model that allowed them to efficiently exploit task structure. Furthermore, we recast memory-based meta-learning within a Bayesian framework, showing that the meta-learned strategies are near-optimal because they amortize Bayes-filtered data, where the adaptation is implemented in the memory dynamics as a state-machine of sufficient statistics. Essentially, memory-based meta-learning translates the hard problem of probabilistic sequential inference into a regression problem.}, added-at = {2019-05-09T13:27:02.000+0200}, author = {Ortega, Pedro A. and Wang, Jane X. and Rowland, Mark and Genewein, Tim and Kurth-Nelson, Zeb and Pascanu, Razvan and Heess, Nicolas and Veness, Joel and Pritzel, Alex and Sprechmann, Pablo and Jayakumar, Siddhant M. and McGrath, Tom and Miller, Kevin and Azar, Mohammad and Osband, Ian and Rabinowitz, Neil and György, András and Chiappa, Silvia and Osindero, Simon and Teh, Yee Whye and van Hasselt, Hado and de Freitas, Nando and Botvinick, Matthew and Legg, Shane}, biburl = {https://www.bibsonomy.org/bibtex/23d183aa1eaac5446a63037f917d4d1cc/kirk86}, description = {[1905.03030] Meta-learning of Sequential Strategies}, interhash = {d722300f00de73c460f54797b8531d3f}, intrahash = {3d183aa1eaac5446a63037f917d4d1cc}, keywords = {bayesian meta-learning optimization probability stats}, note = {cite arxiv:1905.03030Comment: DeepMind Technical Report (15 pages, 6 figures)}, timestamp = {2019-05-09T13:27:02.000+0200}, title = {Meta-learning of Sequential Strategies}, url = {http://arxiv.org/abs/1905.03030}, year = 2019 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Meta-learning of Sequential Strategies

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Meta-learning of Sequential Strategies

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Meta-learning of Sequential Strategies

Comments and Reviews
(0)