Error entropy minimization for LSTM training

被引：0

作者：

Alexandre, Luis A. ^{[1
]}

Marques de Sa, J. P.

机构：

[1] Univ Beira Interior, Covilha, Dept Informat, Covilha, Portugal

[2] Univ Beira Interior, Covilha, IT Networks & Multimedia Grp, Covilha, Portugal

[3] Univ Porto, Fac Engn, P-4100 Oporto, Portugal

[4] Univ Porto, INEB, P-4100 Oporto, Portugal

来源：

ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 1 | 2006年 / 4131卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a new training algorithm for the Long Short-Term Memory (LSTM) recurrent neural network. This algorithrn uses entropy instead of the usual mean squared error as the cost function for the weight update. More precisely we use the Error Entropy Minimization approach, were the entropy of the error is minimized after each symbol is present to the network. Our experiments show that this approach enables the convergence of the LSTM more frequently than with the traditional learning algorithm. This in turn relaxes the burden of parameter tuning since learning is achieved for a wider range of parameter values. The use of EEM also reduces, in some cases, the number of epochs needed for convergence.

引用

页码：244 / 253

页数：10

共 50 条

[1] An error-entropy minimization algorithm for supervised training of nonlinear adaptive systems
Erdogmus, D
Principe, JC
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (07) : 1780 - 1786
[2] Error entropy and mean square error minimization for lossless image compression
William, Peter E.
Hoffman, Michael W.
2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2261 - +
[3] Neural network classification using error entropy minimization
Santos, Jorge M.
Alexandre, Luis A.
Marques de Sa, Joaquim
BIOLOGICAL AND ARTIFICIAL INTELLIGENCE ENVIRONMENTS, 2005, : 291 - 297
[4] Error-Entropy Minimization for Dynamical Systems Modeling
Zupanc, Jernej
ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT I, 2008, 5163 : 417 - 425
[5] A dynamic Parzen window approach based on error-entropy minimization algorithm for supervised training of nonlinear adaptive system
Wang Zibin
Ren Xuemei
Liu Yan
PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 3, 2007, : 222 - +
[6] Adaptive system training based on minimum error entropy
Wang, Y
Guo, WG
Guo, HW
2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT SYSTEMS AND SIGNAL PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2003, : 1245 - 1249
[7] Efficient Levenberg-Marquardt Minimization of the Cross-Entropy Error Function
Saric, Amar
Xiao, Jing
2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1 - 8
[8] Complex-Valued Filtering Based on the Minimization of Complex-Error Entropy
Huang, Songyan
Li, Chunguang
Liu, Yiguang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (05) : 695 - 708
[9] Control for Stochastic Tracking Error Minimization Based on State Entropy with Neural Network
Maki, Hayato
Katsura, Seiichiro
2018 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2018, : 105 - 110
[10] Training Ensembles using Max-Entropy Error Diversity
Holness, Gary F.
Utgoff, Paul E.
BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2009, 1193 : 202 - 209

← 1 2 3 4 5 →