Adaptive natural gradient learning algorithms for various stochastic models

被引:104
|
作者
Park, H [1 ]
Amari, SI
Fukumizu, K
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
[2] RIKEN, Brain Sci Inst, Wako, Saitama 35101, Japan
[3] Inst Stat Math, Tokyo 106, Japan
关键词
feedforward neural network; gradient descent learning; plateau problem; natural gradient learning; adaptive natural gradient learning;
D O I
10.1016/S0893-6080(00)00051-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The natural gradient method has an ideal dynamic behavior which resolves the slow learning speed of the standard gradient descent method caused by plateaus. However, it is required to calculate the Fisher information matrix and its inverse, which makes the implementation of the natural gradient almost impossible. To solve this problem, a preliminary study has been proposed concerning an adaptive method of calculating an estimate of the inverse of the Fisher information matrix, which is called the adaptive natural gradient learning method. In this paper, we show that the adaptive natural gradient method can be extended to be applicable to a wide class of stochastic models: regression with an arbitrary noise model and classification with an arbitrary number of classes. We give explicit forms of the adaptive natural gradient for these models. We confirm the practical advantage of the proposed algorithms through computational experiments on benchmark problems. (C) 2000 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:755 / 764
页数:10
相关论文
共 50 条
  • [1] Adaptive Natural Gradient Learning Algorithms for Unnormalized Statistical Models
    Karakida, Ryo
    Okada, Masato
    Amari, Shun-ichi
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT I, 2016, 9886 : 427 - 434
  • [2] Langevin dynamics for adaptive inverse reinforcement learning of stochastic gradient algorithms
    Krishnamurthy, Vikram
    Yin, George
    [J]. Journal of Machine Learning Research, 2021, 22
  • [3] Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms
    Krishnamurthy, Vikram
    Yin, George
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22 : 1 - 49
  • [4] Stochastic Modified Equations and Adaptive Stochastic Gradient Algorithms
    Li, Qianxiao
    Tai, Cheng
    Weinan, E.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [5] Natural gradient learning algorithms for decorrelation
    Choi, S
    Amari, S
    Cichocki, A
    [J]. PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 645 - 648
  • [6] Stochastic learning algorithms for adaptive modulation
    Misra, A
    Krishnamurthy, V
    Schober, R
    [J]. 2005 IEEE 6TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, 2005, : 756 - 760
  • [7] Stochastic learning algorithms for adaptive modulation
    Misra, Anup
    Krishnamurthy, Vikram
    Schober, Robert
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 4451 - 4454
  • [8] Riemannian adaptive stochastic gradient algorithms on matrix manifolds
    Kasai, Hiroyuki
    Jawanpuria, Pratik
    Mishra, Bamdev
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [9] Stochastic gradient adaptive algorithms for blind source separation
    Dapena, A
    Castedo, L
    [J]. SIGNAL PROCESSING, 1999, 75 (01) : 11 - 27
  • [10] On the selection of optimal nonlinearities for stochastic gradient adaptive algorithms
    Al-Naffouri, TY
    Sayed, AH
    Kailath, T
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 464 - 467