Improving generalization performance of natural gradient learning using optimized regularization by NIC

被引:12
|
作者
Park, H [1 ]
Murata, N
Amari, S
机构
[1] RIKEN, Brain Sci Inst, Wako, Saitama, Japan
[2] Waseda Univ, Tokyo, Japan
关键词
D O I
10.1162/089976604322742065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural gradient learning is known to be efficient in escaping plateau, which is a main cause of the slow learning speed of neural networks. The adaptive natural gradient learning method for practical implementation also has been developed, and its advantage in real-world problems has been confirmed. In this letter, we deal with the generalization performances of the natural gradient method. Since natural gradient learning makes parameters fit to training data quickly, the overfitting phenomenon may easily occur, which results in poor generalization performance. To solve the problem, we introduce the regularization term in natural gradient learning and propose an efficient optimizing method for the scale of regularization by using a generalized Akaike information criterion (network information criterion). We discuss the properties of the optimized regularization strength by NIC through theoretical analysis as well as computer simulations. We confirm the computational efficiency and generalization performance of the proposed method in real-world applications through computational experiments on benchmark problems.
引用
收藏
页码:355 / 382
页数:28
相关论文
共 50 条
  • [21] Improving the Performance of Lightweight CNN models using Minimum Enclosing Ball Regularization
    Tzelepti, Maria
    Tefas, Anastastios
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [22] Improving elevator performance using reinforcement learning
    Crites, RH
    Barto, AG
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 1017 - 1023
  • [23] Improving the Performance of the PNLMS Algorithm Using l1 Norm Regularization
    Das, Rajib Lochan
    Chakraborty, Mrityunjoy
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (07) : 1280 - 1290
  • [24] Optimized machine learning approaches for identifying vertical temperature gradient on ballastless track in natural environments
    Shi, Tao
    Lou, Ping
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2023, 367
  • [25] Improving sulfide flotation performance using natural sorbents
    Eliseev, N. I.
    Averbukh, A. V.
    [J]. JOURNAL OF MINING SCIENCE, 2012, 48 (01) : 195 - 197
  • [26] Improving sulfide flotation performance using natural sorbents
    N. I. Eliseev
    A. V. Averbukh
    [J]. Journal of Mining Science, 2012, 48 : 195 - 197
  • [27] Improving the generalization performance of RBF neural networks using a linear regression technique
    Lin, C. L.
    Wang, J. F.
    Chen, C. Y.
    Chen, C. W.
    Yen, C. W.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (10) : 12049 - 12053
  • [28] A Generalization Performance Study Using Deep Learning Networks in Embedded Systems
    Gorospe, Joseba
    Mulero, Ruben
    Arbelaitz, Olatz
    Muguerza, Javier
    Anton, Miguel Angel
    [J]. SENSORS, 2021, 21 (04) : 1 - 29
  • [29] Improvement of Generalization Performance for Timber Health Monitoring using Machine Learning
    Suzuki, Kenta
    Ito, Takumi
    Koike, Kohei
    Kawahara, Takayuki
    Ke, Mengnan
    Mori, Kenjiro
    [J]. APCCAS 2020: PROCEEDINGS OF THE 2020 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2020), 2020, : 197 - 200
  • [30] Improving the performance of lightweight CNNs for binary classification using quadratic mutual information regularization
    Tzelepi, Maria
    Tefas, Anastasios
    [J]. PATTERN RECOGNITION, 2020, 106