An effective strategy for initializing the EM algorithm in finite mixture models

被引:13
|
作者
Michael, Semhar [1 ]
Melnykov, Volodymyr [2 ]
机构
[1] South Dakota State Univ, Dept Math & Stat, Brookings, SD 57007 USA
[2] Univ Alabama, Dept Informat Syst Stat & Management Sci, Tuscaloosa, AL 35487 USA
关键词
Finite mixture models; EM algorithm; Initialization; Model averaging; BIC; SIMULATING DATA; MULTIVARIATE; PERFORMANCE; COMPONENTS;
D O I
10.1007/s11634-016-0264-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Finite mixture models represent one of the most popular tools for modeling heterogeneous data. The traditional approach for parameter estimation is based on maximizing the likelihood function. Direct optimization is often troublesome due to the complex likelihood structure. The expectation-maximization algorithm proves to be an effective remedy that alleviates this issue. The solution obtained by this procedure is entirely driven by the choice of starting parameter values. This highlights the importance of an effective initialization strategy. Despite efforts undertaken in this area, there is no uniform winner found and practitioners tend to ignore the issue, often finding misleading or erroneous results. In this paper, we propose a simple yet effective tool for initializing the expectation-maximization algorithm in the mixture modeling setting. The idea is based on model averaging and proves to be efficient in detecting correct solutions even in those cases when competitors perform poorly. The utility of the proposed methodology is shown through comprehensive simulation study and applied to a well-known classification dataset with good results.
引用
收藏
页码:563 / 583
页数:21
相关论文
共 50 条
  • [1] An effective strategy for initializing the EM algorithm in finite mixture models
    Semhar Michael
    Volodymyr Melnykov
    [J]. Advances in Data Analysis and Classification, 2016, 10 : 563 - 583
  • [2] Initializing the EM algorithm in Gaussian mixture models with an unknown number of components
    Melnykov, Volodymyr
    Melnykov, Igor
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) : 1381 - 1395
  • [3] Competitive EM algorithm for finite mixture models
    Zhang, BB
    Zhang, CS
    Yi, X
    [J]. PATTERN RECOGNITION, 2004, 37 (01) : 131 - 144
  • [4] Finite mixture models estimation with a credal EM algorithm
    Vannoorenberghe, Patrick
    [J]. TRAITEMENT DU SIGNAL, 2007, 24 (02) : 103 - 113
  • [5] Multi-View EM algorithm for finite mixture models
    Yi, X
    Xu, YP
    Zhang, CS
    [J]. PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 420 - 425
  • [6] RANDOM SWAP EM ALGORITHM FOR FINITE MIXTURE MODELS IN IMAGE SEGMENTATION
    Zhao, Qinpei
    Hautamaki, Ville
    Kaerkkaeinen, Ismo
    Franti, Pasi
    [J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2397 - +
  • [7] Initializing the EM Algorithm for Univariate Gaussian, Multi-Component, Heteroscedastic Mixture Models by Dynamic Programming Partitions
    Polanski, Andrzej
    Marczyk, Michal
    Pietrowska, Monika
    Widlak, Piotr
    Polanska, Joanna
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2018, 15 (03)
  • [8] A note on EM algorithm for mixture models
    Yao, Weixin
    [J]. STATISTICS & PROBABILITY LETTERS, 2013, 83 (02) : 519 - 526
  • [9] Recursive EM algorithm for finite mixture models with application to Internet traffic modeling
    Liu, Z
    Almhana, J
    Choulakian, V
    McGorman, R
    [J]. SECOND ANNUAL CONFERENCE ON COMMUNICATION NETWORKS AND SERVICES RESEARCH, PROCEEDINGS, 2004, : 198 - 207
  • [10] An EM algorithm for a semiparametric finite mixture model
    Zhang, B
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2002, 72 (10) : 791 - 802