Geometrical singularities in the neuromanifold of multilayer Perceptrons

被引：0

作者：

Amari, S ^{[1
]}

Park, H ^{[1
]}

Ozeki, T ^{[1
]}

机构：

[1] RIKEN, Brain Sci Inst, Wako, Saitama 3510198, Japan

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2 | 2002年 / 14卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Singularities are ubiquitous in the parameter space of hierarchical models such as multilayer perceptrons. At singularities, the Fisher information matrix degenerates, and the Cramer-Rao paradigm does no more hold, implying that the classical model selection theory such as AIC and MDL cannot be applied. It is important to study the relation between the generalization error and the training error at singularities. The present paper demonstrates a method of analyzing these errors both for the maximum likelihood estimator and the Bayesian predictive distribution in terms of Gaussian random fields, by using simple models.

引用

页码：343 / 350

页数：8

共 50 条

[21] ON LANGEVIN UPDATING IN MULTILAYER PERCEPTRONS
ROGNVALDSSON, T
NEURAL COMPUTATION, 1994, 6 (05) : 916 - 926
[22] Active learning in multilayer perceptrons
Fukumizu, K
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 295 - 301
[23] ON THE INITIALIZATION AND OPTIMIZATION OF MULTILAYER PERCEPTRONS
WEYMAERE, N
MARTENS, JP
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (05): : 738 - 751
[24] Multilayer perceptrons on Splash 2
Ratha, NK
Jain, AK
CAMP'97 - FOURTH IEEE INTERNATIONAL WORKSHOP ON COMPUTER ARCHITECTURE FOR MACHINE PERCEPTION, PROCEEDINGS, 1997, : 138 - 142
[25] DYNAMIC SIZING OF MULTILAYER PERCEPTRONS
APOLLONI, B
RONCHINI, G
BIOLOGICAL CYBERNETICS, 1994, 71 (01) : 49 - 63
[26] On the weight sparsity of multilayer perceptrons
Drakopoulos, Georgios
Megalooikonomou, Vasileios
2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2015,
[27] Fast training of multilayer perceptrons
Verma, B
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (06): : 1314 - 1320
[28] Parameter by parameter algorithm for multilayer perceptrons
Li, YL
Zhang, D
Wang, KQ
NEURAL PROCESSING LETTERS, 2006, 23 (02) : 229 - 242
[29] Entropy minimization algorithm for multilayer perceptrons
Erdogmus, D
Principe, JC
IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 3003 - 3008
[30] Statistical active learning in multilayer perceptrons
Fukumizu, K
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (01): : 17 - 26

← 1 2 3 4 5 →