Error entropy minimization for LSTM training

被引:0
|
作者
Alexandre, Luis A. [1 ]
Marques de Sa, J. P.
机构
[1] Univ Beira Interior, Covilha, Dept Informat, Covilha, Portugal
[2] Univ Beira Interior, Covilha, IT Networks & Multimedia Grp, Covilha, Portugal
[3] Univ Porto, Fac Engn, P-4100 Oporto, Portugal
[4] Univ Porto, INEB, P-4100 Oporto, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new training algorithm for the Long Short-Term Memory (LSTM) recurrent neural network. This algorithrn uses entropy instead of the usual mean squared error as the cost function for the weight update. More precisely we use the Error Entropy Minimization approach, were the entropy of the error is minimized after each symbol is present to the network. Our experiments show that this approach enables the convergence of the LSTM more frequently than with the traditional learning algorithm. This in turn relaxes the burden of parameter tuning since learning is achieved for a wider range of parameter values. The use of EEM also reduces, in some cases, the number of epochs needed for convergence.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 50 条
  • [41] MINIMIZATION OF ERROR IN INDIRECT MEASUREMENTS
    KYUREGYAN, SG
    MEASUREMENT TECHNIQUES USSR, 1994, 37 (12): : 1351 - 1355
  • [42] Structural minimization of tracking error
    Rossbach, Peter
    Karlow, Denis
    QUANTITATIVE FINANCE, 2019, 19 (03) : 357 - 366
  • [43] MEDICATION ERROR MINIMIZATION PROJECT
    Anand, M.
    Dong, C.
    Totev, V.
    Stewart, L.
    AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2018, 52 : 84 - 85
  • [44] Exploiting the Massive MIMO Channel Structural Properties for Minimization of Channel Estimation Error and Training Overhead
    Bazzi, Samer
    Stefanatos, Stelios
    Le Magoarou, Luc
    Hajri, Salah Eddine
    Assaad, Mohamad
    Paquelet, Stephane
    Wunder, Gerhard
    Xu, Wen
    IEEE ACCESS, 2019, 7 : 32434 - 32452
  • [45] Error in an approximate wave function and an error minimization scheme
    Mukhopadhyay, S
    Bhattacharyya, K
    INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2004, 96 (05) : 492 - 500
  • [46] Offline Modeling for Product Quality Prediction of Mineral Processing Using Modeling Error PDF Shaping and Entropy Minimization
    Ding, Jinliang
    Chai, Tianyou
    Wang, Hong
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (03): : 408 - 419
  • [47] An Error-Entropy Minimization Algorithm for Tracking Control of Nonlinear Stochastic Systems with Non-Gaussian Variables
    Liu, Yunlong
    Wang, Aiping
    Guo, Lei
    Wang, Hong
    IFAC PAPERSONLINE, 2017, 50 (01): : 10407 - 10412
  • [48] A new histogram-based estimation technique of entropy and mutual information using mean squared error minimization
    Hacine-Gharbi, A.
    Deriche, M.
    Ravier, P.
    Harba, R.
    Mohamadi, T.
    COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (03) : 918 - 933
  • [49] Concurrent Error Detection for LSTM Accelerators
    Nosrati, Nooshin
    Ghasemi, Seyedeh Maryam
    Roodsari, Mahboobe Sadeghipour
    Navabi, Zainalabedin
    2022 IEEE EUROPEAN TEST SYMPOSIUM (ETS 2022), 2022,
  • [50] Mean-Square Convergence Analysis of ADALINE Training With Minimum Error Entropy Criterion
    Chen, Badong
    Zhu, Yu
    Hu, Jinchun
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (07): : 1168 - 1179