Error entropy minimization for LSTM training

被引:0
|
作者
Alexandre, Luis A. [1 ]
Marques de Sa, J. P.
机构
[1] Univ Beira Interior, Covilha, Dept Informat, Covilha, Portugal
[2] Univ Beira Interior, Covilha, IT Networks & Multimedia Grp, Covilha, Portugal
[3] Univ Porto, Fac Engn, P-4100 Oporto, Portugal
[4] Univ Porto, INEB, P-4100 Oporto, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new training algorithm for the Long Short-Term Memory (LSTM) recurrent neural network. This algorithrn uses entropy instead of the usual mean squared error as the cost function for the weight update. More precisely we use the Error Entropy Minimization approach, were the entropy of the error is minimized after each symbol is present to the network. Our experiments show that this approach enables the convergence of the LSTM more frequently than with the traditional learning algorithm. This in turn relaxes the burden of parameter tuning since learning is achieved for a wider range of parameter values. The use of EEM also reduces, in some cases, the number of epochs needed for convergence.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 50 条
  • [21] Three-way decision-based tri-training with entropy minimization
    Pan, Linchao
    Gao, Can
    Zhou, Jie
    INFORMATION SCIENCES, 2022, 610 : 33 - 51
  • [22] Entropy minimization and domain adversarial training guided by label distribution similarity for domain adaptation
    Fangzheng Xu
    Yu Bao
    Bingye Li
    Zhining Hou
    Lekang Wang
    Multimedia Systems, 2023, 29 : 2281 - 2292
  • [23] Large-margin minimum classification error training: A theoretical risk minimization perspective
    Yu, Dong
    Deng, Li
    He, Xiaodong
    Acero, Alex
    COMPUTER SPEECH AND LANGUAGE, 2008, 22 (04): : 415 - 429
  • [24] ENTROPY ERROR
    DOLINSKII, EF
    MEASUREMENT TECHNIQUES-USSR, 1967, (08): : 906 - +
  • [25] On minimization of multivariate entropy functionals
    Imre Csiszar
    Matsus, Frantisek
    ITW: 2009 IEEE INFORMATION THEORY WORKSHOP ON NETWORKING AND INFORMATION THEORY, 2009, : 96 - +
  • [26] ENTROPY MINIMIZATION WITH LATTICE BOUNDS
    BORWEIN, JM
    LEWIS, AS
    LIMBER, MA
    JOURNAL OF APPROXIMATION THEORY, 1994, 79 (01) : 1 - 16
  • [27] Entropy Minimization for Shadow Removal
    Graham D. Finlayson
    Mark S. Drew
    Cheng Lu
    International Journal of Computer Vision, 2009, 85 : 35 - 57
  • [28] MINIMIZATION OF ENTROPY PRODUCTION IN DISTILLATION
    MULLINS, OC
    BERRY, RS
    JOURNAL OF PHYSICAL CHEMISTRY, 1984, 88 (04): : 723 - 728
  • [29] Entropy Minimization for Shadow Removal
    Finlayson, Graham D.
    Drew, Mark S.
    Lu, Cheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 85 (01) : 35 - 57
  • [30] Entropy Minimization In Emergent Languages
    Kharitonov, Eugene
    Chaabouni, Rahma
    Bouchacourt, Diane
    Baroni, Marco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119