Towards a Mathematical Understanding of the Difficulty in Learning with Feedforward Neural Networks

被引:26
|
作者
Shen, Hao [1 ]
机构
[1] Fortiss, Guerickestr 25, D-80805 Munich, Germany
关键词
LOCAL MINIMA; HESSIAN-MATRIX; ERROR SURFACE; BACKPROPAGATION; APPROXIMATION; ALGORITHM; XOR;
D O I
10.1109/CVPR.2018.00091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training deep neural networks for solving machine learning problems is one great challenge in the field, mainly due to its associated optimisation problem being highly non-convex. Recent developments have suggested that many training algorithms do not suffer from undesired local minima under certain scenario, and consequently led to great efforts in pursuing mathematical explanations for such observations. This work provides an alternative mathematical understanding of the challenge from a smooth optimisation perspective. By assuming exact learning of finite samples, sufficient conditions are identified via a critical point analysis to ensure any local minimum to be globally minimal as well. Furthermore, a state of the art algorithm, known as the Generalised Gauss-Newton (GGN) algorithm, is rigorously revisited as an approximate Newton's algorithm, which shares the property of being locally quadratically convergent to a global minimum under the condition of exact learning.
引用
收藏
页码:811 / 820
页数:10
相关论文
共 50 条
  • [1] Towards Understanding Learning in Neural Networks with Linear Teachers
    Sarussi, Roei
    Brutzkus, Alon
    Globerson, Amir
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [2] Topology of Learning in Feedforward Neural Networks
    Gabella, Maxime
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) : 3588 - 3592
  • [3] Providing understanding of the behavior of feedforward neural networks
    Huang, SH
    Endsley, MR
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1997, 27 (03): : 465 - 474
  • [4] Weighted Learning for Feedforward Neural Networks
    Rong-Fang Xu
    Thao-Tsen Chen
    Shie-Jue Lee
    [J]. JournalofElectronicScienceandTechnology, 2014, 12 (03) : 299 - 304
  • [5] Weighted Learning for Feedforward Neural Networks
    Rong-Fang Xu
    Thao-Tsen Chen
    Shie-Jue Lee
    [J]. Journal of Electronic Science and Technology, 2014, (03) : 299 - 304
  • [6] Fast learning algorithms for feedforward neural networks
    Jiang, MH
    Gielen, G
    Zhang, B
    Luo, ZS
    [J]. APPLIED INTELLIGENCE, 2003, 18 (01) : 37 - 54
  • [7] Contrastive Hebbian Feedforward Learning for Neural Networks
    Kermiche, Noureddine
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (06) : 2118 - 2128
  • [8] A novel learning algorithm for feedforward neural networks
    Chen, Huawei
    Jin, Fan
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 509 - 514
  • [9] A Study of Learning Issues in Feedforward Neural Networks
    Teso-Fz-Betono, Adrian
    Zulueta, Ekaitz
    Cabezas-Olivenza, Mireya
    Teso-Fz-Betono, Daniel
    Fernandez-Gamiz, Unai
    [J]. MATHEMATICS, 2022, 10 (17)
  • [10] Diffusion learning algorithms for feedforward neural networks
    Skorohod B.A.
    [J]. Cybernetics and Systems Analysis, 2013, 49 (03) : 334 - 346