Effect of batch learning in multilayer neural networks

被引:0
|
作者
Fukumizu, K [1 ]
机构
[1] RIKEN, Brain Res Inst, Lab Informat Synth, Wako, Saitama 35101, Japan
关键词
multilayer network; batch learning; overtraining; generalization error;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper discusses batch gradient descent learning in multilayer networks with a large number of statistical training data. We emphasize on the difference between regular cases, where the prepared model has the same size as the true function, and overrealizable cases, where the model has surplus hidden units to realize the true function. First, experimental study on multilayer perceptrons and linear neural networks (LNN) shows that batch learning induces strong overtraining on both models in overrealizable cases, which means the degrade of generalization error by surplus units can be alleviated. We theoretically analyze the dynamics in LNN, and show that this overtraining is caused by shrinkage of the parameters corresponding to surplus units.
引用
收藏
页码:67 / 70
页数:4
相关论文
共 50 条
  • [1] Learning with regularizers in multilayer neural networks
    Saad, D
    Rattray, M
    [J]. PHYSICAL REVIEW E, 1998, 57 (02) : 2170 - 2176
  • [2] Optimal learning in multilayer neural networks
    Winther, O
    Lautrup, B
    Zhang, JB
    [J]. PHYSICAL REVIEW E, 1997, 55 (01) : 836 - 844
  • [3] LEARNING ALGORITHMS FOR MULTILAYER NEURAL NETWORKS
    AVEDYAN, ED
    [J]. AUTOMATION AND REMOTE CONTROL, 1995, 56 (04) : 541 - 551
  • [4] Multilayer Batch Learning Growing Neural Gas for Learning Multiscale Topologies
    Toda, Yuichiro
    Matsuno, Takayuki
    Minami, Mamoru
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2021, 25 (06) : 1011 - 1023
  • [5] Quantizability and learning complexity in multilayer neural networks
    Fu, LM
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 1998, 28 (02): : 295 - 300
  • [6] An algorithm of supervised learning for multilayer neural networks
    Tang, Z
    Wang, XG
    Tamura, H
    Ishii, M
    [J]. NEURAL COMPUTATION, 2003, 15 (05) : 1125 - 1142
  • [7] Supervised Learning in Multilayer Spiking Neural Networks
    Sporea, Ioana
    Gruening, Andre
    [J]. NEURAL COMPUTATION, 2013, 25 (02) : 473 - 509
  • [8] Learning with noise and regularizers in multilayer neural networks
    Saad, D
    Solla, SA
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 260 - 266
  • [9] EXACT SOLUTION FOR ONLINE LEARNING IN MULTILAYER NEURAL NETWORKS
    SAAD, D
    SOLLA, SA
    [J]. PHYSICAL REVIEW LETTERS, 1995, 74 (21) : 4337 - 4340
  • [10] Globally optimal learning rates for multilayer neural networks
    Saad, D
    Rattray, M
    [J]. PHILOSOPHICAL MAGAZINE B-PHYSICS OF CONDENSED MATTER STATISTICAL MECHANICS ELECTRONIC OPTICAL AND MAGNETIC PROPERTIES, 1998, 77 (05): : 1523 - 1530