Natural gradient works efficiently in learning

被引:1807
|
作者
Amari, S [1 ]
机构
[1] RIKEN, Frontier Res Program, Wako, Saitama 35101, Japan
关键词
D O I
10.1162/089976698300017746
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When a parameter space has a certain underlying structure, the ordinary gradient of a function does not represent its steepest direction, but the natural gradient does. Information geometry is used for calculating the natural gradients in the parameter space of perceptrons, the space of matrices (for blind source separation), and the space of linear dynamical systems (for blind source deconvolution). The dynamical behavior of natural gradient online learning is analyzed and is proved to be Fisher efficient, implying that it has asymptotically the same performance as the optimal batch estimation of parameters. This suggests that the plateau phenomenon, which appears in the backpropagation learning algorithm of multilayer perceptrons, might disappear or might not be so serious when the natural gradient is used. An adaptive method of updating the learning rate is proposed and analyzed.
引用
收藏
页码:251 / 276
页数:26
相关论文
共 50 条
  • [1] Natural gradient works efficiently in learning
    Amari, S
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 11 - 14
  • [2] Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning
    Zhao, Yang
    Zhang, Hao
    Hu, Xiuyuan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Natural gradient learning algorithms for decorrelation
    Choi, S
    Amari, S
    Cichocki, A
    [J]. PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 645 - 648
  • [4] Matrix momentum for practical natural gradient learning
    Scarpetta, S
    Rattray, M
    Saad, D
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1999, 32 (22): : 4047 - 4059
  • [5] Adaptive Natural Policy Gradient in Reinforcement Learning
    Li, Dazi
    Qiao, Zengyuan
    Song, Tianheng
    Jin, Qibing
    [J]. PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 605 - 610
  • [6] Natural Gradient Learning Algorithms for RBF Networks
    Zhao, Junsheng
    Wei, Haikun
    Zhang, Chi
    Li, Weiling
    Guo, Weili
    Zhang, Kanjian
    [J]. NEURAL COMPUTATION, 2015, 27 (02) : 481 - 505
  • [7] Natural gradient learning algorithms for nonlinear systems
    Zhao Junsheng
    Xia Jianwei
    Zhuang Guangming
    Zhang Huasheng
    [J]. 2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 1979 - 1983
  • [8] Natural gradient descent for on-line learning
    Rattray, M
    Saad, D
    Amari, S
    [J]. PHYSICAL REVIEW LETTERS, 1998, 81 (24) : 5461 - 5464
  • [9] Natural-gradient learning for spiking neurons
    Kreutzer, Elena
    Senn, Walter
    Petrovici, Mihai A.
    [J]. ELIFE, 2022, 11
  • [10] The natural gradient learning algorithm for neural networks
    Amari, S
    [J]. THEORETICAL ASPECTS OF NEURAL COMPUTATION: A MULTIDISCIPLINARY PERSPECTIVE, 1998, : 1 - 15