Novel training algorithms for long short-term memory neural network

被引:7
|
作者
Li, Xiaodong [1 ]
Yu, Changjun [2 ]
Su, Fulin [1 ]
Quan, Taifan [2 ]
Yang, Xuguang [1 ]
机构
[1] Harbin Inst Technol, Dept Informat & Commun Engn, Harbin, Heilongjiang, Peoples R China
[2] Harbin Inst Technol, Dept Informat & Commun Engn, Weihai, Peoples R China
基金
中国国家自然科学基金;
关键词
FILTERS; PREDICTION;
D O I
10.1049/iet-spr.2018.5240
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
More recently, due to the enormous potential of long short-term memory (LSTM) neural network in various fields, some efficient training algorithms have been developed, including the extended Kalman filter (EKF)-based training algorithm and particle filter (PF)-based training algorithm. However, it should be noted that if the system is highly non-linear, the linearisation employed in the EKF may cause instability. Moreover, the PF usually suffers from the particle degeneracy. Therefore, the PF-based training algorithm may only find a poor local optimum. To solve these problems, an unscented Kalman filter (UKF)-based training algorithm is proposed. The UKF employs a deterministic sampling method; hence, there is no linearisation in it and it does not have the degeneracy problem. Moreover, the computational complexity of the UKF is the same order as that of the EKF. To further reduce the computational complexity, the authors propose a minimum norm UKF (MN-UKF) to obtain a good trade-off between performance and complexity. To the best of the authors' knowledge, this is the first reported solution to this problem. Simulations using both benchmark synthetic signal and real-world signal illustrate the potential of the algorithms developed.
引用
收藏
页码:304 / 308
页数:5
相关论文
共 50 条
  • [1] Deep solar radiation forecasting with convolutional neural network and long short-term memory network algorithms
    Ghimire, Sujan
    Deo, Ravinesh C.
    Raj, Nawin
    Mi, Jianchun
    [J]. APPLIED ENERGY, 2019, 253
  • [2] Long Short-term Memory Neural Network for Network Traffic Prediction
    Zhuo, Qinzheng
    Li, Qianmu
    Yan, Han
    Qi, Yong
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [3] An FPGA Implementation of a Long Short-Term Memory Neural Network
    Ferreira, Joao Canas
    Fonseca, Jose
    [J]. 2016 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG16), 2016,
  • [4] Long short-term memory neural network for glucose prediction
    Carrillo-Moreno, Jaime
    Perez-Gandia, Carmen
    Sendra-Arranz, Rafael
    Garcia-Saez, Gema
    Hernando, M. Elena
    Gutierrez, Alvaro
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (09): : 4191 - 4203
  • [5] Long short-term memory neural network for glucose prediction
    Jaime Carrillo-Moreno
    Carmen Pérez-Gandía
    Rafael Sendra-Arranz
    Gema García-Sáez
    M. Elena Hernando
    Álvaro Gutiérrez
    [J]. Neural Computing and Applications, 2021, 33 : 4191 - 4203
  • [6] Short-term neural network memory
    Morris, Robert J.T.
    Wong, Wing Shing
    [J]. SIAM Journal on Computing, 1988, 17 (06): : 1103 - 1118
  • [7] A SHORT-TERM NEURAL NETWORK MEMORY
    MORRIS, RJT
    WONG, WS
    [J]. SIAM JOURNAL ON COMPUTING, 1988, 17 (06) : 1103 - 1118
  • [8] Predicting Short-term Traffic Flow by Long Short-Term Memory Recurrent Neural Network
    Tian, Yongxue
    Pan, Li
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 153 - 158
  • [9] Short-term runoff forecasting in an alpine catchment with a long short-term memory neural network
    Frank, Corinna
    Russwurm, Marc
    Fluixa-Sanmartin, Javier
    Tuia, Devis
    [J]. FRONTIERS IN WATER, 2023, 5
  • [10] A Deep Neural Network Model for Short-Term Load Forecast Based on Long Short-Term Memory Network and Convolutional Neural Network
    Tian, Chujie
    Ma, Jian
    Zhang, Chunhong
    Zhan, Panpan
    [J]. ENERGIES, 2018, 11 (12)