copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning to Forget: Continual Prediction with LSTM

F. Gers, J. Schmidhuber, and F. Cummins. Neural Computation, 12 (10): 2451--2471 (Oct 1, 2000)
DOI: 10.1162/089976600300015015

Abstract

Long short-term memory (LSTM; Hochreiter & Schmidhuber, 1997) can solve numerous tasks not solvable by previous learning algorithms for recurrent neural networks (RNNs). We identify a weakness of LSTM networks processing continual input streams that are not a priori segmented into subsequences with explicitly marked ends at which the network's internal state could be reset. Without resets, the state may grow indefinitely and eventually cause the network to break down. Our remedy is a novel, adaptive ?forget gate? that enables an LSTM cell to learn to reset itself at appropriate times, thus releasing internal resources. We review illustrative benchmark problems on which standard LSTM outperforms other RNN algorithms. All algorithms (including LSTM) fail to solve continual versions of these problems. LSTM with forget gates, however, easily solves them, and in an elegant way.

Links and resources

BibTeX key: gers2000learning
entry type: article
year: 2000
month: oct
day: 1
journal: Neural Computation
number: 10
pages: 2451--2471
publisher: MIT Press
volume: 12
citeulike-article-id: 14361642
citeulike-linkout-0: http://dx.doi.org/10.1162/089976600300015015
citeulike-linkout-1: http://www.mitpressjournals.org/doi/abs/10.1162/089976600300015015
priority: 2
posted-at: 2017-05-22 17:42:41
DOI: 10.1162/089976600300015015
url: http://dx.doi.org/10.1162/089976600300015015

@andreashdez's tags highlighted

chm1320

Cite this publication

search on

Meta data

Last update 7 years ago
Created 7 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning to Forget: Continual Prediction with LSTM

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Learning to Forget: Continual Prediction with LSTM

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning to Forget: Continual Prediction with LSTM

Comments and Reviews
(0)