Dropout as a Bayesian Approximation: Appendix

Abstract

We show that a neural network with arbitrary depth and non-linearities, with dropout applied before every weight layer, is mathematically equivalent to an approximation to a well known Bayesian model. This interpretation might offer an explanation to some of dropout's key properties, such as its robustness to over-fitting. Our interpretation allows us to reason about uncertainty in deep learning, and allows the introduction of the Bayesian machinery into existing deep learning frameworks in a principled way. This document is an appendix for the main paper "Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning" by Gal and Ghahramani, 2015.

BibTeX key: gal2015dropout
entry type: article
year: 2015
url: http://arxiv.org/abs/1506.02157
note: cite arxiv:1506.02157Comment: 20 pages, 1 figure; ICML proceedings version

BibSonomy

Dropout as a Bayesian Approximation: Appendix

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on