Inproceedings,

Invertible Residual Networks

J. Behrmann, W. Grathwohl, R. Chen, D. Duvenaud, and J. Jacobsen.
Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research, page 573--582. Long Beach, California, USA, PMLR, (09--15 Jun 2019)

Full text

Abstract

We show that standard ResNet architectures can be made invertible, allowing the same model to be used for classification, density estimation, and generation. Typically, enforcing invertibility requires partitioning dimensions or restricting network architectures. In contrast, our approach only requires adding a simple normalization step during training, already available in standard frameworks. Invertible ResNets define a generative model which can be trained by maximum likelihood on unlabeled data. To compute likelihoods, we introduce a tractable approximation to the Jacobian log-determinant of a residual block. Our empirical evaluation shows that invertible ResNets perform competitively with both state-of-the-art image classifiers and flow-based generative models, something that has not been previously achieved with a single architecture.

BibTeX key: pmlr-v97-behrmann19a
entry type: inproceedings
address: Long Beach, California, USA
booktitle: Proceedings of the 36th International Conference on Machine Learning
year: 2019
month: 09--15 Jun
pages: 573--582
publisher: PMLR
series: Proceedings of Machine Learning Research
volume: 97
pdf: http://proceedings.mlr.press/v97/behrmann19a/behrmann19a.pdf
Document: http://proceedings.mlr.press/v97/behrmann19a.html

BibSonomy

Invertible Residual Networks

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on