Estimating the number of components in a finite mixture model: the special case of homogeneity

被引:15
|
作者
Schlattmann, P [1 ]
机构
[1] Free Univ Berlin, Dept Psychiat & Psychotherapy, D-14050 Berlin, Germany
关键词
finite mixture models; Poisson distribution; disease mapping; likelihood ratio test; nonparametric bootstrap; simulation studies;
D O I
10.1016/S0167-9473(02)00173-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite mixture models arise in a natural way in that they are modeling unobserved population heterogeneity. An application in disease mapping shows that mixture models are useful in separating signal from noise. Thus, the number of components k of the mixture model needs to be estimated where k = 1 is the important homogenous case. Because of the irregularity of the parameter space, the log-likelihood-ratio statistic (LRS) does not have a chi(2) limit distribution and therefore it is difficult to use the LRS to test for the number of components. An alternative approach applies the nonparametric bootstrap such that a mixture algorithm is applied B times to bootstrap samples obtained from the original sample with replacement. The number of components k is obtained as the mode of the bootstrap distribution of k. This approach provides on empirical grounds a mode-unbiased and consistent estimator for the number of components in the homogeneous Poisson case. The distribution of the log-likelihood-ratio statistic (LRS) for the testing problem H-0 : k = 1 vs. H-1 : k > 1 is addressed for the Poisson case. For a very large sample size of n = 10 000 this distribution approximates a chi(1)(2) distribution. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:441 / 451
页数:11
相关论文
共 50 条
  • [41] A practical sampling approach for a Bayesian mixture model with unknown number of components
    Wang, Liqun
    Fu, James C.
    STATISTICAL PAPERS, 2007, 48 (04) : 631 - 653
  • [42] Selection of the number of components using a genetic algorithm for mixture model classifiers
    Tenmoto, H
    Kudo, M
    Shimbo, M
    ADVANCES IN PATTERN RECOGNITION, 2000, 1876 : 511 - 520
  • [43] A practical sampling approach for a Bayesian mixture model with unknown number of components
    Liqun Wang
    James C. Fu
    Statistical Papers, 2007, 48 : 631 - 653
  • [44] GENERALIZED LIKELIHOOD-RATIO TEST OF THE NUMBER OF COMPONENTS IN FINITE MIXTURE-MODELS
    CHEN, JH
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1994, 22 (03): : 387 - 399
  • [45] ESTIMATING THE COMPONENT AGES IN A FINITE MIXTURE
    GALBRAITH, RF
    GREEN, PF
    NUCLEAR TRACKS AND RADIATION MEASUREMENTS, 1990, 17 (03): : 197 - 206
  • [46] Testing the number of components in a normal mixture
    Lo, YT
    Mendell, NR
    Rubin, DB
    BIOMETRIKA, 2001, 88 (03) : 767 - 778
  • [47] Mixture Models With a Prior on the Number of Components
    Miller, Jeffrey W.
    Harrison, Matthew T.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (521) : 340 - 356
  • [48] NEW MODEL FOR ESTIMATING NUMBER OF ELECTROPHORETICALLY DETECTABLE ALLELES IN A FINITE POPULATION
    OHTA, T
    KIMURA, M
    GENETICS, 1973, 74 (JUN) : S201 - S201
  • [49] Influence of stirrer type on mixture homogeneity in continuous powder mixing: A model case and a pharmaceutical case
    Marikh, K.
    Berthiaux, H.
    Gatumel, C.
    Mizonov, V.
    Barantseva, E.
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2008, 86 (9A): : 1027 - 1037
  • [50] Finite mixture models with negative components
    Zhang, BB
    Zhang, CS
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINDS, 2005, 3587 : 31 - 41