Long short-term memory

被引:9179
|
作者
Hochreiter, S [1 ]
Schmidhuber, J [1 ]
机构
[1] IDSIA, CH-6900 LUGANO, SWITZERLAND
关键词
D O I
10.1162/neco.1997.9.8.1735
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error now through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.
引用
收藏
页码:1735 / 1780
页数:46
相关论文
共 50 条
  • [41] SHORT-TERM AND LONG-TERM-MEMORY IN SINGLE CELLS
    MORIMOTO, BH
    KOSHLAND, DE
    [J]. FASEB JOURNAL, 1991, 5 (07): : 2061 - 2067
  • [42] Effect of corticosteroids on short-term and long-term memory
    Brunner, R
    Schaefer, D
    Hess, K
    Parzer, P
    Resch, F
    Schwab, S
    [J]. NEUROLOGY, 2005, 64 (02) : 335 - 337
  • [43] NOTE ON INTERFERENCE IN SHORT-TERM AND LONG-TERM-MEMORY
    SICZ, G
    FOREST, J
    [J]. PSYCHOLOGICAL REPORTS, 1975, 36 (01) : 338 - 338
  • [44] Short-Term Memory to Long-Term Memory Transition in a Nanoscale Memristor
    Chang, Ting
    Jo, Sung-Hyun
    Lu, Wei
    [J]. ACS NANO, 2011, 5 (09) : 7669 - 7676
  • [45] The time needed to consolidate short-term memory to long-term memory
    Takeyama, E
    Takenoshita, M
    Nishimura, S
    Yoshiya, I
    [J]. ANESTHESIOLOGY, 1998, 89 (3A) : U317 - U317
  • [46] Two circuits to convert short-term memory into long-term memory
    Wong, CW
    [J]. MEDICAL HYPOTHESES, 1997, 49 (05) : 375 - 378
  • [47] Long Short Term Memory Networks for Short-Term Electric Load Forecasting
    Narayan, Apurva
    Hipel, Keith W.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 2573 - 2578
  • [48] Reference evapotranspiration estimation using long short-term memory network and wavelet-coupled long short-term memory network
    Long, Xiaoxu
    Wang, Jiandong
    Gong, Shihong
    Li, Guangyong
    Ju, Hui
    [J]. IRRIGATION AND DRAINAGE, 2022, 71 (04) : 855 - 881
  • [49] Time Series-based Spoof Speech Detection Using Long Short-term Memory and Bidirectional Long Short-term Memory
    Mirza, Arsalan R.
    Al-Talabani, Abdulbasit K.
    [J]. ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2024, 12 (02): : 119 - 129
  • [50] Verbal short-term memory reflects the organization of long-term memory: Further evidence from short-term memory for emotional words
    Majerus, Steve
    D'Argembeau, Arnaud
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2011, 64 (02) : 181 - 197