copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Monte Carlo Gradient Estimation in Machine Learning

S. Mohamed, M. Rosca, M. Figurnov, and A. Mnih. (2019)cite arxiv:1906.10652Comment: 59 pages, under review.

Abstract

This paper is a broad and accessible survey of the methods we have at our disposal for Monte Carlo gradient estimation in machine learning and across the statistical sciences: the problem of computing the gradient of an expectation of a function with respect to parameters defining the distribution that is integrated; the problem of sensitivity analysis. In machine learning research, this gradient problem lies at the core of many learning problems, in supervised, unsupervised and reinforcement learning. We will generally seek to rewrite such gradients in a form that allows for Monte Carlo estimation, allowing them to be easily and efficiently used and analysed. We explore three strategies--the pathwise, score function, and measure-valued gradient estimators--exploring their historical developments, derivation, and underlying assumptions. We describe their use in other fields, show how they are related and can be combined, and expand on their possible generalisations. Wherever Monte Carlo gradient estimators have been derived and deployed in the past, important advances have followed. A deeper and more widely-held understanding of this problem will lead to further advances, and it is these advances that we wish to support.

Description

[1906.10652] Monte Carlo Gradient Estimation in Machine Learning

Links and resources

BibTeX key: mohamed2019monte
entry type: article
year: 2019
url: http://arxiv.org/abs/1906.10652
note: cite arxiv:1906.10652Comment: 59 pages, under review

@kirk86's tags highlighted

Cite this publication

@article{mohamed2019monte, abstract = {This paper is a broad and accessible survey of the methods we have at our disposal for Monte Carlo gradient estimation in machine learning and across the statistical sciences: the problem of computing the gradient of an expectation of a function with respect to parameters defining the distribution that is integrated; the problem of sensitivity analysis. In machine learning research, this gradient problem lies at the core of many learning problems, in supervised, unsupervised and reinforcement learning. We will generally seek to rewrite such gradients in a form that allows for Monte Carlo estimation, allowing them to be easily and efficiently used and analysed. We explore three strategies--the pathwise, score function, and measure-valued gradient estimators--exploring their historical developments, derivation, and underlying assumptions. We describe their use in other fields, show how they are related and can be combined, and expand on their possible generalisations. Wherever Monte Carlo gradient estimators have been derived and deployed in the past, important advances have followed. A deeper and more widely-held understanding of this problem will lead to further advances, and it is these advances that we wish to support.}, added-at = {2019-07-17T14:31:50.000+0200}, author = {Mohamed, Shakir and Rosca, Mihaela and Figurnov, Michael and Mnih, Andriy}, biburl = {https://www.bibsonomy.org/bibtex/2b457fc94595648ed0cd0057ee71f0651/kirk86}, description = {[1906.10652] Monte Carlo Gradient Estimation in Machine Learning}, interhash = {af30c826b8b860884bc1b0d2538d2a8b}, intrahash = {b457fc94595648ed0cd0057ee71f0651}, keywords = {bayesian gradients mcmc optimization probability readings stats survey}, note = {cite arxiv:1906.10652Comment: 59 pages, under review}, timestamp = {2020-07-16T12:15:17.000+0200}, title = {Monte Carlo Gradient Estimation in Machine Learning}, url = {http://arxiv.org/abs/1906.10652}, year = 2019 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Monte Carlo Gradient Estimation in Machine Learning

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Monte Carlo Gradient Estimation in Machine Learning

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Monte Carlo Gradient Estimation in Machine Learning

Comments and Reviews
(0)