Error entropy minimization for LSTM training

被引:0
|
作者
Alexandre, Luis A. [1 ]
Marques de Sa, J. P.
机构
[1] Univ Beira Interior, Covilha, Dept Informat, Covilha, Portugal
[2] Univ Beira Interior, Covilha, IT Networks & Multimedia Grp, Covilha, Portugal
[3] Univ Porto, Fac Engn, P-4100 Oporto, Portugal
[4] Univ Porto, INEB, P-4100 Oporto, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new training algorithm for the Long Short-Term Memory (LSTM) recurrent neural network. This algorithrn uses entropy instead of the usual mean squared error as the cost function for the weight update. More precisely we use the Error Entropy Minimization approach, were the entropy of the error is minimized after each symbol is present to the network. Our experiments show that this approach enables the convergence of the LSTM more frequently than with the traditional learning algorithm. This in turn relaxes the burden of parameter tuning since learning is achieved for a wider range of parameter values. The use of EEM also reduces, in some cases, the number of epochs needed for convergence.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 50 条
  • [1] An error-entropy minimization algorithm for supervised training of nonlinear adaptive systems
    Erdogmus, D
    Principe, JC
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (07) : 1780 - 1786
  • [2] Error entropy and mean square error minimization for lossless image compression
    William, Peter E.
    Hoffman, Michael W.
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2261 - +
  • [3] Neural network classification using error entropy minimization
    Santos, Jorge M.
    Alexandre, Luis A.
    Marques de Sa, Joaquim
    BIOLOGICAL AND ARTIFICIAL INTELLIGENCE ENVIRONMENTS, 2005, : 291 - 297
  • [4] Error-Entropy Minimization for Dynamical Systems Modeling
    Zupanc, Jernej
    ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT I, 2008, 5163 : 417 - 425
  • [5] A dynamic Parzen window approach based on error-entropy minimization algorithm for supervised training of nonlinear adaptive system
    Wang Zibin
    Ren Xuemei
    Liu Yan
    PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 3, 2007, : 222 - +
  • [6] Adaptive system training based on minimum error entropy
    Wang, Y
    Guo, WG
    Guo, HW
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT SYSTEMS AND SIGNAL PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2003, : 1245 - 1249
  • [7] Efficient Levenberg-Marquardt Minimization of the Cross-Entropy Error Function
    Saric, Amar
    Xiao, Jing
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1 - 8
  • [8] Complex-Valued Filtering Based on the Minimization of Complex-Error Entropy
    Huang, Songyan
    Li, Chunguang
    Liu, Yiguang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (05) : 695 - 708
  • [9] Control for Stochastic Tracking Error Minimization Based on State Entropy with Neural Network
    Maki, Hayato
    Katsura, Seiichiro
    2018 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2018, : 105 - 110
  • [10] Training Ensembles using Max-Entropy Error Diversity
    Holness, Gary F.
    Utgoff, Paul E.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2009, 1193 : 202 - 209