NEWLSTM: An Optimized Long Short-Term Memory Language Model for Sequence Prediction

被引:11
|
作者
Wang, Qing [1 ]
Peng, Rong-Qun [1 ]
Wang, Jia-Qiang [2 ]
Li, Zhi [3 ]
Qu, Han-Bing [2 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255049, Peoples R China
[2] Beijing Acad Sci & Technol, Key Lab Artificial Intelligence & Data Anal, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100049, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Logic gates; Recurrent neural networks; Task analysis; Predictive models; Natural language processing; Context modeling; Data models; Gate fusion; exploding gradient; long short-term memory; recurrent neural network; NETWORKS;
D O I
10.1109/ACCESS.2020.2985418
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The long short-term memory (LSTM) model trained on the universal language modeling task overcomes the bottleneck of vanishing gradients in the traditional recurrent neural network (RNN) and shows excellent performance in processing multiple tasks generated by natural language processing. Although LSTM effectively alleviates the vanishing gradient problem in the RNN, the information will be greatly lost in the long distance transmission, and there are still some limitations in its practical use. In this paper, we propose a new model called NEWLSTM, which improves the LSTM model, and alleviates the defects of too many parameters in LSTM and the vanishing gradient. The NEWLSTM model directly correlates the cell state information with current information. The traditional LSTM & x2019;s input gate and forget gate are integrated, some components are deleted, the problems of too many LSTM parameters and complicated calculations are solved, and the iteration time is effectively reduced. In this paper, a neural network model is used to identify the relationship between input information sequences to predict the language sequence. The experimental results show that the improved new model is simpler than traditional LSTM models and LSTM variants on multiple test sets. NEWLSTM has better overall stability and can better solve the sparse words problem.
引用
下载
收藏
页码:65395 / 65401
页数:7
相关论文
共 50 条
  • [21] Short-term wind speed prediction model based on long short-term memory network with feature extraction
    Zhongda Tian
    Xiyan Yu
    Guokui Feng
    Earth Science Informatics, 2025, 18 (4)
  • [22] Deep Bi-directional Long Short-Term Memory Model for Short-Term Traffic Flow Prediction
    Wang, Jingyuan
    Hu, Fei
    Li, Li
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 306 - 316
  • [23] Long Short-Term Memory (LSTM) model for Indian sign language recognition
    Nihalani R.
    Chouhan S.S.
    Mittal D.
    Vadula J.
    Thakur S.
    Chakraborty S.
    Patel R.K.
    Singh U.P.
    Ghosh R.
    Singh P.
    Saxena A.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 11185 - 11203
  • [24] Short-Term Traffic Prediction Using Long Short-Term Memory Neural Networks
    Abbas, Zainab
    Al-Shishtawy, Ahmad
    Girdzijauskas, Sarunas
    Vlassov, Vladimir
    2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 57 - 65
  • [25] Short-Term Prediction of Wind Power Based on Deep Long Short-Term Memory
    Qu Xiaoyun
    Kang Xiaoning
    Zhang Chao
    Jiang Shuai
    Ma Xiuda
    2016 IEEE PES ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), 2016, : 1148 - 1152
  • [26] Short-Term Relay Quality Prediction Algorithm Based on Long and Short-Term Memory
    XUE Wendong
    CHAI Yuan
    LI Qigan
    HONG Yongqiang
    ZHENG Gaofeng
    Instrumentation, 2018, 5 (04) : 46 - 54
  • [27] Research on short-term disease risk prediction based on long short-term memory
    Feng, Yanjun
    Wang, Hongxia
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2021, 128 : 176 - 176
  • [28] Short-term wind power prediction based on combined long short-term memory
    Zhao, Yuyang
    Li, Lincong
    Guo, Yingjun
    Shi, Boming
    Sun, Hexu
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, 18 (05) : 931 - 940
  • [29] Optimized long short-term memory-based stock price prediction with sentiment score
    Yalanati Ayyappa
    A. P. Siva Kumar
    Social Network Analysis and Mining, 13
  • [30] A review on the long short-term memory model
    Van Houdt, Greg
    Mosquera, Carlos
    Napoles, Gonzalo
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (08) : 5929 - 5955