Geometrical singularities in the neuromanifold of multilayer Perceptrons

被引:0
|
作者
Amari, S [1 ]
Park, H [1 ]
Ozeki, T [1 ]
机构
[1] RIKEN, Brain Sci Inst, Wako, Saitama 3510198, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Singularities are ubiquitous in the parameter space of hierarchical models such as multilayer perceptrons. At singularities, the Fisher information matrix degenerates, and the Cramer-Rao paradigm does no more hold, implying that the classical model selection theory such as AIC and MDL cannot be applied. It is important to study the relation between the generalization error and the training error at singularities. The present paper demonstrates a method of analyzing these errors both for the maximum likelihood estimator and the Bayesian predictive distribution in terms of Gaussian random fields, by using simple models.
引用
收藏
页码:343 / 350
页数:8
相关论文
共 50 条
  • [21] ON LANGEVIN UPDATING IN MULTILAYER PERCEPTRONS
    ROGNVALDSSON, T
    NEURAL COMPUTATION, 1994, 6 (05) : 916 - 926
  • [22] Active learning in multilayer perceptrons
    Fukumizu, K
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 295 - 301
  • [23] ON THE INITIALIZATION AND OPTIMIZATION OF MULTILAYER PERCEPTRONS
    WEYMAERE, N
    MARTENS, JP
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (05): : 738 - 751
  • [24] Multilayer perceptrons on Splash 2
    Ratha, NK
    Jain, AK
    CAMP'97 - FOURTH IEEE INTERNATIONAL WORKSHOP ON COMPUTER ARCHITECTURE FOR MACHINE PERCEPTION, PROCEEDINGS, 1997, : 138 - 142
  • [25] DYNAMIC SIZING OF MULTILAYER PERCEPTRONS
    APOLLONI, B
    RONCHINI, G
    BIOLOGICAL CYBERNETICS, 1994, 71 (01) : 49 - 63
  • [26] On the weight sparsity of multilayer perceptrons
    Drakopoulos, Georgios
    Megalooikonomou, Vasileios
    2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2015,
  • [27] Fast training of multilayer perceptrons
    Verma, B
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (06): : 1314 - 1320
  • [28] Parameter by parameter algorithm for multilayer perceptrons
    Li, YL
    Zhang, D
    Wang, KQ
    NEURAL PROCESSING LETTERS, 2006, 23 (02) : 229 - 242
  • [29] Entropy minimization algorithm for multilayer perceptrons
    Erdogmus, D
    Principe, JC
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 3003 - 3008
  • [30] Statistical active learning in multilayer perceptrons
    Fukumizu, K
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (01): : 17 - 26