NEWLSTM: An Optimized Long Short-Term Memory Language Model for Sequence Prediction

被引:11
|
作者
Wang, Qing [1 ]
Peng, Rong-Qun [1 ]
Wang, Jia-Qiang [2 ]
Li, Zhi [3 ]
Qu, Han-Bing [2 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255049, Peoples R China
[2] Beijing Acad Sci & Technol, Key Lab Artificial Intelligence & Data Anal, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100049, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Logic gates; Recurrent neural networks; Task analysis; Predictive models; Natural language processing; Context modeling; Data models; Gate fusion; exploding gradient; long short-term memory; recurrent neural network; NETWORKS;
D O I
10.1109/ACCESS.2020.2985418
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The long short-term memory (LSTM) model trained on the universal language modeling task overcomes the bottleneck of vanishing gradients in the traditional recurrent neural network (RNN) and shows excellent performance in processing multiple tasks generated by natural language processing. Although LSTM effectively alleviates the vanishing gradient problem in the RNN, the information will be greatly lost in the long distance transmission, and there are still some limitations in its practical use. In this paper, we propose a new model called NEWLSTM, which improves the LSTM model, and alleviates the defects of too many parameters in LSTM and the vanishing gradient. The NEWLSTM model directly correlates the cell state information with current information. The traditional LSTM & x2019;s input gate and forget gate are integrated, some components are deleted, the problems of too many LSTM parameters and complicated calculations are solved, and the iteration time is effectively reduced. In this paper, a neural network model is used to identify the relationship between input information sequences to predict the language sequence. The experimental results show that the improved new model is simpler than traditional LSTM models and LSTM variants on multiple test sets. NEWLSTM has better overall stability and can better solve the sparse words problem.
引用
收藏
页码:65395 / 65401
页数:7
相关论文
共 50 条
  • [1] ELSTM: An improved long short-term memory network language model for sequence learning
    Li, Zhi
    Wang, Qing
    Wang, Jia-Qiang
    Qu, Han-Bing
    Dong, Jichang
    Dong, Zhi
    [J]. EXPERT SYSTEMS, 2024, 41 (06)
  • [2] MFOA-Bi-LSTM: An optimized bidirectional long short-term memory model for short-term traffic flow prediction
    Naheliya, Bharti
    Redhu, Poonam
    Kumar, Kranti
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 634
  • [3] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Peng Chen
    Rong Wang
    Yibin Yao
    Hao Chen
    Zhihao Wang
    Zhiyuan An
    [J]. Journal of Geodesy, 2023, 97
  • [4] A Causal Long Short-Term Memory Sequence to Sequence Model for TEC Prediction Using GNSS Observations
    Kaselimi, Maria
    Voulodimos, Athanasios
    Doulamis, Nikolaos
    Doulamis, Anastasios
    Delikaraoglou, Demitris
    [J]. REMOTE SENSING, 2020, 12 (09)
  • [5] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Chen, Peng
    Wang, Rong
    Yao, Yibin
    Chen, Hao
    Wang, Zhihao
    An, Zhiyuan
    [J]. JOURNAL OF GEODESY, 2023, 97 (05)
  • [6] IGBT Lifetime Prediction Model Based on Optimized Long Short-Term Memory Neural Network
    Ren H.
    Yu Y.
    Du X.
    Liu J.
    Zhou J.
    [J]. Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2024, 39 (04): : 1074 - 1086
  • [7] Refining Short-Term Power Load Forecasting: An Optimized Model with Long Short-Term Memory Network
    Hu S.
    Cai W.
    Liu J.
    Shi H.
    Yu J.
    [J]. Journal of Computing and Information Technology, 2023, 31 (03) : 151 - 166
  • [8] Enhanced Long Short-Term Memory Model for Runoff Prediction
    Feng, Rui
    Fan, Guangwei
    Lin, Jianyi
    Yao, Baozhen
    Guo, Qinghai
    [J]. JOURNAL OF HYDROLOGIC ENGINEERING, 2021, 26 (02)
  • [9] Sparrow Search Algorithm-Optimized Long Short-Term Memory Model for Stock Trend Prediction
    Liu, Feiyang
    Qin, Panke
    You, Junru
    Fu, Yanyan
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] Milling tool wear prediction: optimized long short-term memory model based on attention mechanism
    Liu, Yiming
    Yang, Shucai
    Sun, Tao
    Zhang, Yuhua
    [J]. FERROELECTRICS, 2023, 607 (01) : 56 - 72