ON THE PROBLEM OF LOCAL MINIMA IN RECURRENT NEURAL NETWORKS

被引:34
|
作者
BIANCHINI, M
GORI, M
MAGGINI, M
机构
[1] Dipartimento di Sistemi e Informatica, Universita di Firenze, 50139, Firenze
来源
关键词
D O I
10.1109/72.279182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many researchers have recently focused their efforts on devising efficient algorithms, mainly based on optimization schemes, for learning the weights of recurrent neural networks. As in the case of feedforward networks, however, these learning algorithms may get stuck in local minima during gradient descent, thus discovering sub-optimal solutions. This paper analyses the problem of optimal learning in recurrent networks by proposing conditions that guarantee local minima free error surfaces. An example is given that also shows the constructive role of the proposed theory in designing networks suitable for solving a given task. Moreover, a formal relationship between recurrent and static feedforward networks is established such that the examples of local minima for feedforward networks already known in the literature can be associated with analogous ones in recurrent networks.
引用
收藏
页码:167 / 172
页数:6
相关论文
共 50 条
  • [1] LEARNING IN NEURAL NETWORKS WITH LOCAL MINIMA
    HESKES, TM
    SLIJPEN, ETP
    KAPPEN, B
    PHYSICAL REVIEW A, 1992, 46 (08): : 5221 - 5231
  • [2] Local minima and plateaus in multilayer neural networks
    Fukumizu, K
    Amari, S
    NINTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS (ICANN99), VOLS 1 AND 2, 1999, (470): : 597 - 602
  • [3] Navigating Local Minima in Quantized Spiking Neural Networks
    Eshraghian, Jason K.
    Lammie, Corey
    Azghadi, Mostafa Rahimi
    Lu, Wei D.
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 352 - 355
  • [4] Exponentially Many Local Minima in Quantum Neural Networks
    You, Xuchen
    Wu, Xiaodi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] Avoiding local minima in feedforward neural networks by simultaneous learning
    Atakulreka, Akarachai
    Sutivong, Daricha
    AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 100 - +
  • [6] An improved algorithm for eleman neural network to avoid the local minima problem
    Zhang, Zhiqiang
    Tang, Guofeng
    Tang, Zheng
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2007, : 84 - +
  • [7] ON THE PROBLEM OF LOCAL MINIMA IN BACKPROPAGATION
    GORI, M
    TESI, A
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (01) : 76 - 86
  • [8] Local minima in hierarchical structures of complex-valued neural networks
    Nitta, Tohru
    NEURAL NETWORKS, 2013, 43 : 1 - 7
  • [9] Suboptimal Local Minima Exist for Wide Neural Networks with Smooth Activations
    Ding, Tian
    Li, Dawei
    Sun, Ruoyu
    MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (04) : 2784 - 2814
  • [10] Local Dynamics in Trained Recurrent Neural Networks
    Rivkind, Alexander
    Barak, Omri
    PHYSICAL REVIEW LETTERS, 2017, 118 (25)