Estimating the number of components in a mixture of multilayer perceptrons

被引:5
|
作者
Olteanu, M. [1 ]
Rynkiewicz, J. [1 ]
机构
[1] Univ Paris 01, SAMOS MATISSE CES, UMR 8174, F-75013 Paris, France
关键词
penalized likelihood; Bayesian information criterion (BIC); mixture models; multilayer perceptrons;
D O I
10.1016/j.neucom.2007.12.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian information criterion (BIC) criterion is widely used by the neural-network community for model selection tasks, although its convergence properties are not always theoretically established. In this paper we will focus oil estimating the number of components in a mixture of multilayer perceptrons and proving the convergence of the BIC criterion in this frame. The penalized marginal-likelihood for mixture models and hidden Markov models introduced by Keribin [Consistent estimation of the order of mixture models, Sankhya Indian J. Stat. 62 (2000) 49-66] and, respectively, Gassiat [Likelihood ratio inequalities with applications to various mixtures, Ann. Inst. Henri Poincare 38 (2002) 897-906] is extended to mixtures of multilayer perceptrons for which a penalized-likelihood criterion is proposed. We prove its convergence under some hypothesis which involve essentially the bracketing entropy of the generalized score-function class and illustrate it by some numerical examples. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:1321 / 1329
页数:9
相关论文
共 50 条
  • [31] Multilayer perceptrons and data compression
    Manger, Robert
    Puljic, Krunoslav
    COMPUTING AND INFORMATICS, 2007, 26 (01) : 45 - 62
  • [32] An approach to encode Multilayer Perceptrons
    Korczak, J
    Blindauer, E
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 302 - 307
  • [33] Testing the number of components in a normal mixture
    Lo, YT
    Mendell, NR
    Rubin, DB
    BIOMETRIKA, 2001, 88 (03) : 767 - 778
  • [34] On the number of components in a Gaussian mixture model
    McLachlan, Geoffrey J.
    Rathnayake, Suren
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 4 (05) : 341 - 355
  • [35] Representation and extrapolation in multilayer perceptrons
    Browne, A
    NEURAL COMPUTATION, 2002, 14 (07) : 1739 - 1754
  • [36] Geometry of learning in multilayer perceptrons
    Amari, S
    Park, H
    Ozeki, T
    COMPSTAT 2004: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2004, : 49 - 60
  • [37] Clustering using multilayer perceptrons
    Charalampidis, Dimitrios
    Muldrey, Barry
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2009, 71 (12) : E2807 - E2813
  • [38] Mixture Models With a Prior on the Number of Components
    Miller, Jeffrey W.
    Harrison, Matthew T.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (521) : 340 - 356
  • [39] ON LANGEVIN UPDATING IN MULTILAYER PERCEPTRONS
    ROGNVALDSSON, T
    NEURAL COMPUTATION, 1994, 6 (05) : 916 - 926
  • [40] Active learning in multilayer perceptrons
    Fukumizu, K
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 295 - 301