Error entropy minimization for LSTM training

被引:0
|
作者
Alexandre, Luis A. [1 ]
Marques de Sa, J. P.
机构
[1] Univ Beira Interior, Covilha, Dept Informat, Covilha, Portugal
[2] Univ Beira Interior, Covilha, IT Networks & Multimedia Grp, Covilha, Portugal
[3] Univ Porto, Fac Engn, P-4100 Oporto, Portugal
[4] Univ Porto, INEB, P-4100 Oporto, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new training algorithm for the Long Short-Term Memory (LSTM) recurrent neural network. This algorithrn uses entropy instead of the usual mean squared error as the cost function for the weight update. More precisely we use the Error Entropy Minimization approach, were the entropy of the error is minimized after each symbol is present to the network. Our experiments show that this approach enables the convergence of the LSTM more frequently than with the traditional learning algorithm. This in turn relaxes the burden of parameter tuning since learning is achieved for a wider range of parameter values. The use of EEM also reduces, in some cases, the number of epochs needed for convergence.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 50 条
  • [31] ENTROPY PRODUCTION MINIMIZATION OF A CRHP
    Aarts, Stefan P.
    Gudjonsdottir, Vilborg
    Ferreira, Carlos A. Infante
    Kiss, Anton A.
    5TH IIR INTERNATIONAL CONFERENCE ON THERMOPHYSICAL PROPERTIES AND TRANSFER PROCESSES OF REFRIGERANTS (TPTPR), 2017, : 1130 - 1137
  • [32] SELECTIVE RESPONSES BY ENTROPY MINIMIZATION
    KAMIMURA, R
    MATHEMATICAL AND COMPUTER MODELLING, 1995, 21 (1-2) : 143 - 157
  • [33] ENTROPY PRODUCTION MINIMIZATION OF A CRHP
    Aarts, Stefan P.
    Guojonsdottir, Vilborg
    Ferreira, Carlos A. Infante
    Kiss, Anton A.
    5TH IIR INTERNATIONAL CONFERENCE ON THERMOPHYSICAL PROPERTIES AND TRANSFER PROCESSES OF REFRIGERANTS (TPTPR), 2017, : 475 - 482
  • [34] Intrinsic images by entropy minimization
    Finlayson, GD
    Drew, MS
    Lu, C
    COMPUTER VISION - ECCV 2004, PT 3, 2004, 3023 : 582 - 595
  • [35] Minimization of Entropy Functionals Revisited
    Csiszar, Imre
    Matus, Frantisek
    2012 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2012, : 150 - 154
  • [36] Entropy Minimization for Solving Sudoku
    Gunther, Jake
    Moon, Todd
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (01) : 508 - 513
  • [37] Convergence properties and data efficiency of the minimum error entropy criterion in adaline training
    Erdogmus, D
    Principe, JC
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (07) : 1966 - 1978
  • [38] A binning formula of bi-histogram for joint entropy estimation using mean square error minimization
    Hacine-Gharbi, Abdenour
    Ravier, Philippe
    PATTERN RECOGNITION LETTERS, 2018, 101 : 21 - 28
  • [39] Error minimization of multipole expansion
    Ohnuki, S
    Chew, WC
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2005, 26 (06): : 2047 - 2065
  • [40] DESIGN OF PAPERS FOR ERROR MINIMIZATION
    BONDI, A
    JOURNAL OF CHEMICAL DOCUMENTATION, 1969, 9 (01): : 7 - &