Estimating the number of components in a finite mixture model: the special case of homogeneity

被引:15
|
作者
Schlattmann, P [1 ]
机构
[1] Free Univ Berlin, Dept Psychiat & Psychotherapy, D-14050 Berlin, Germany
关键词
finite mixture models; Poisson distribution; disease mapping; likelihood ratio test; nonparametric bootstrap; simulation studies;
D O I
10.1016/S0167-9473(02)00173-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite mixture models arise in a natural way in that they are modeling unobserved population heterogeneity. An application in disease mapping shows that mixture models are useful in separating signal from noise. Thus, the number of components k of the mixture model needs to be estimated where k = 1 is the important homogenous case. Because of the irregularity of the parameter space, the log-likelihood-ratio statistic (LRS) does not have a chi(2) limit distribution and therefore it is difficult to use the LRS to test for the number of components. An alternative approach applies the nonparametric bootstrap such that a mixture algorithm is applied B times to bootstrap samples obtained from the original sample with replacement. The number of components k is obtained as the mode of the bootstrap distribution of k. This approach provides on empirical grounds a mode-unbiased and consistent estimator for the number of components in the homogeneous Poisson case. The distribution of the log-likelihood-ratio statistic (LRS) for the testing problem H-0 : k = 1 vs. H-1 : k > 1 is addressed for the Poisson case. For a very large sample size of n = 10 000 this distribution approximates a chi(1)(2) distribution. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:441 / 451
页数:11
相关论文
共 50 条