A review on the long short-term memory model

被引:649
|
作者
Van Houdt, Greg [1 ]
Mosquera, Carlos [2 ]
Napoles, Gonzalo [1 ,3 ]
机构
[1] Hasselt Univ, Fac Business Econ, Agoralaan Gebouw D, B-3590 Diepenbeek, Belgium
[2] Vrije Univ Brussel, Artificial Intelligence Lab, Pl Laan 9, B-1050 Brussels, Belgium
[3] Tilburg Univ, Dept Cognit Sci & Artificial Intelligence, Warandelaan 2, NL-5037 AB Tilburg, Netherlands
关键词
Recurrent neural networks; Vanishing; exploding gradient; Long short-term memory; Deep learning; CONVOLUTIONAL NEURAL-NETWORK; FRAMEWISE PHONEME CLASSIFICATION; DEEP LEARNING APPROACH; BIDIRECTIONAL LSTM; ENTITY DISAMBIGUATION; RECURRENT NETWORKS; SENTIMENT ANALYSIS; ANOMALY DETECTION; NATURAL-LANGUAGE; TIME-SERIES;
D O I
10.1007/s10462-020-09838-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long short-term memory (LSTM) has transformed both machine learning and neurocomputing fields. According to several online sources, this model has improved Google's speech recognition, greatly improved machine translations on Google Translate, and the answers of Amazon's Alexa. This neural system is also employed by Facebook, reaching over 4 billion LSTM-based translations per day as of 2017. Interestingly, recurrent neural networks had shown a rather discrete performance until LSTM showed up. One reason for the success of this recurrent network lies in its ability to handle the exploding/vanishing gradient problem, which stands as a difficult issue to be circumvented when training recurrent or very deep neural networks. In this paper, we present a comprehensive review that covers LSTM's formulation and training, relevant applications reported in the literature and code resources implementing this model for a toy example.
引用
收藏
页码:5929 / 5955
页数:27
相关论文
共 50 条
  • [1] A review on the long short-term memory model
    Greg Van Houdt
    Carlos Mosquera
    Gonzalo Nápoles
    [J]. Artificial Intelligence Review, 2020, 53 : 5929 - 5955
  • [2] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Peng Chen
    Rong Wang
    Yibin Yao
    Hao Chen
    Zhihao Wang
    Zhiyuan An
    [J]. Journal of Geodesy, 2023, 97
  • [3] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Chen, Peng
    Wang, Rong
    Yao, Yibin
    Chen, Hao
    Wang, Zhihao
    An, Zhiyuan
    [J]. JOURNAL OF GEODESY, 2023, 97 (05)
  • [4] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [5] A phenomenological memristor model for short-term/long-term memory
    Chen, Ling
    Li, Chuandong
    Huang, Tingwen
    Ahmad, Hafiz Gulfam
    Chen, Yiran
    [J]. PHYSICS LETTERS A, 2014, 378 (40) : 2924 - 2930
  • [6] A brain model with the circuit to convert short-term memory into long-term memory
    Wong, CW
    [J]. MEDICAL HYPOTHESES, 1997, 48 (03) : 221 - 226
  • [7] Long Short-Term Memory Model for Drone Detection and Classification
    Kunze, S.
    Saha, B.
    [J]. 2024 4TH URSI ATLANTIC RADIO SCIENCE MEETING, AT-RASC 2024, 2024,
  • [8] Enhanced Long Short-Term Memory Model for Runoff Prediction
    Feng, Rui
    Fan, Guangwei
    Lin, Jianyi
    Yao, Baozhen
    Guo, Qinghai
    [J]. JOURNAL OF HYDROLOGIC ENGINEERING, 2021, 26 (02)
  • [9] Short-term Load Forecasting with Distributed Long Short-Term Memory
    Dong, Yi
    Chen, Yang
    Zhao, Xingyu
    Huang, Xiaowei
    [J]. 2023 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE, ISGT, 2023,
  • [10] LIPREADING WITH LONG SHORT-TERM MEMORY
    Wand, Michael
    Koutnik, Jan
    Schmidhuber, Jurgen
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6115 - 6119