ON THE PROBLEM OF LOCAL MINIMA IN RECURRENT NEURAL NETWORKS

被引:34
|
作者
BIANCHINI, M
GORI, M
MAGGINI, M
机构
[1] Dipartimento di Sistemi e Informatica, Universita di Firenze, 50139, Firenze
来源
关键词
D O I
10.1109/72.279182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many researchers have recently focused their efforts on devising efficient algorithms, mainly based on optimization schemes, for learning the weights of recurrent neural networks. As in the case of feedforward networks, however, these learning algorithms may get stuck in local minima during gradient descent, thus discovering sub-optimal solutions. This paper analyses the problem of optimal learning in recurrent networks by proposing conditions that guarantee local minima free error surfaces. An example is given that also shows the constructive role of the proposed theory in designing networks suitable for solving a given task. Moreover, a formal relationship between recurrent and static feedforward networks is established such that the examples of local minima for feedforward networks already known in the literature can be associated with analogous ones in recurrent networks.
引用
收藏
页码:167 / 172
页数:6
相关论文
共 50 条
  • [31] Networks of local minima in optical system optimization
    Bociort, F
    van Driel, E
    Serebriakov, A
    OPTICS LETTERS, 2004, 29 (02) : 189 - 191
  • [32] Growing Neural Networks Achieve Flatter Minima
    Caillon, Paul
    Cerisara, Christophe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 222 - 234
  • [33] The local minima-free condition of feedforward neural networks for outer-supervised learning
    Huang, DS
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03): : 477 - 480
  • [34] A Systematic Algorithm to Escape from Local Minima in Training Feed-Forward Neural Networks
    Cheung, Chi-Chung
    Xu, Sean Shensheng
    Ng, Sin-Chun
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 396 - 402
  • [35] Local minima found in the subparameter space can be effective for ensembles of deep convolutional neural networks
    Yang, Yongquan
    Lv, Haijun
    Chen, Ning
    Wu, Yang
    Zheng, Jiayi
    Zheng, Zhongxi
    PATTERN RECOGNITION, 2021, 109
  • [36] An improved backpropagation algorithm to avoid the local minima problem
    Wang, XG
    Tang, Z
    Tamura, H
    Ishii, M
    Sun, WD
    NEUROCOMPUTING, 2004, 56 : 455 - 460
  • [37] On the alleviation of the problem of local minima in back-propagation
    Magoulas, GD
    Vrahatis, MN
    Androulakis, GS
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 1997, 30 (07) : 4545 - 4550
  • [38] A Neural Network for Tornado Diagnosis: Managing Local Minima
    C. Marzban
    Neural Computing & Applications, 2000, 9 : 133 - 141
  • [39] The local minima in the lattice-simplex covering problem
    Cocke, W.
    Forcade, Rod
    Hall, H. Tracy
    Journal of Combinatorial Mathematics and Combinatorial Computing, 2014, 90 : 117 - 122
  • [40] Dynamics and local minima of a simple neural network for optimization
    Tsutsumi, K
    Nakajima, K
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 353 - 358