ON THE PROBLEM OF LOCAL MINIMA IN RECURRENT NEURAL NETWORKS

被引:34
|
作者
BIANCHINI, M
GORI, M
MAGGINI, M
机构
[1] Dipartimento di Sistemi e Informatica, Universita di Firenze, 50139, Firenze
来源
关键词
D O I
10.1109/72.279182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many researchers have recently focused their efforts on devising efficient algorithms, mainly based on optimization schemes, for learning the weights of recurrent neural networks. As in the case of feedforward networks, however, these learning algorithms may get stuck in local minima during gradient descent, thus discovering sub-optimal solutions. This paper analyses the problem of optimal learning in recurrent networks by proposing conditions that guarantee local minima free error surfaces. An example is given that also shows the constructive role of the proposed theory in designing networks suitable for solving a given task. Moreover, a formal relationship between recurrent and static feedforward networks is established such that the examples of local minima for feedforward networks already known in the literature can be associated with analogous ones in recurrent networks.
引用
收藏
页码:167 / 172
页数:6
相关论文
共 50 条
  • [21] Global Minima of Overparameterized Neural Networks
    Cooper, Yaim
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (02): : 676 - 691
  • [22] NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA
    BALDI, P
    HORNIK, K
    NEURAL NETWORKS, 1989, 2 (01) : 53 - 58
  • [23] Solving local minima problem with large number of hidden nodes on two-layered feed-forward artificial neural networks
    Choi, Bumghi
    Lee, Ju-Hong
    Kim, Deok-Hwan
    NEUROCOMPUTING, 2008, 71 (16-18) : 3640 - 3643
  • [24] Local minima free neural network learning
    Jordanov, IN
    Rafik, TA
    2004 2ND INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 34 - 39
  • [25] COMPLETE SOLUTION OF THE LOCAL MINIMA IN THE XOR PROBLEM
    LISBOA, PJG
    PERANTONIS, SJ
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1991, 2 (01) : 119 - 124
  • [26] On the number of local minima for the multidimensional assignment problem
    Don A. Grundel
    Pavlo A. Krokhmal
    Carlos A. S. Oliveira
    Panos M. Pardalos
    Journal of Combinatorial Optimization, 2007, 13 : 1 - 18
  • [27] On the number of local minima for the multidimensional assignment problem
    Grundel, Don A.
    Krokhmal, Pavlo A.
    Oliveira, Carlos A. S.
    Pardalos, Panos M.
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2007, 13 (01) : 1 - 18
  • [28] A variational problem with a continuum of weak local minima
    Loewen, PD
    Watson, SJ
    Wolenski, PR
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 3530 - 3533
  • [29] Recurrent Neural Networks as Local Models for Time Series Prediction
    Cherif, Aymen
    Cardot, Hubert
    Bone, Romuald
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 786 - 793
  • [30] An Algorithm of Two-Phase Learning for Eleman Neural Network to Avoid the Local Minima Problem
    Zhang, Zhiqiang
    Tang, Zheng
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (04): : 1 - 10