Unsupervised learning of regression mixture models with unknown number of components

被引:13
|
作者
Chamroukhi, Faicel [1 ,2 ]
机构
[1] Aix Marseille Univ, CNRS, ENSAM, LSIS UMR 7296, Marseille, France
[2] Univ Toulon & Var, CNRS, LSIS UMR 7296, La Garde, France
关键词
Unsupervised learning; regression mixtures; EM algorithm; robust EM-like algorithm; model selection; curve clustering; DISCRIMINANT-ANALYSIS; MAXIMUM-LIKELIHOOD; EM ALGORITHM; CLASSIFICATION; CURVES;
D O I
10.1080/00949655.2015.1109096
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a new unsupervised learning algorithm to fit regression mixture models with unknown number of components. The developed approach consists in a penalized maximum likelihood estimation carried out by a robust expectation-maximization (EM)-like algorithm. We derive it for polynomial, spline, and B-spline regression mixtures. The proposed learning approach is unsupervised: (i) it simultaneously infers the model parameters and the optimal number of the regression mixture components from the data as the learning proceeds, rather than in a two-fold scheme as in standard model-based clustering using afterward model selection criteria, and (ii) it does not require accurate initialization unlike the standard EM for regression mixtures. The developed approach is applied to curve clustering problems. Numerical experiments on simulated and real data show that the proposed algorithm performs well and provides accurate clustering results, and confirm its benefit for practical applications.
引用
收藏
页码:2308 / 2334
页数:27
相关论文
共 50 条
  • [1] Variable selection in finite mixture of regression models with an unknown number of components
    Lee, Kuo-Jung
    Feldkircher, Martin
    Chen, Yi-Chi
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 158
  • [2] Particle filters for mixture models with an unknown number of components
    Fearnhead, P
    [J]. STATISTICS AND COMPUTING, 2004, 14 (01) : 11 - 21
  • [3] Overfitting Bayesian Mixture Models with an Unknown Number of Components
    van Havre, Zoe
    White, Nicole
    Rousseau, Judith
    Mengersen, Kerrie
    [J]. PLOS ONE, 2015, 10 (07):
  • [4] Particle filters for mixture models with an unknown number of components
    Paul Fearnhead
    [J]. Statistics and Computing, 2004, 14 : 11 - 21
  • [5] Unsupervised learning of mixture regression models for longitudinal data
    Xu, Peirong
    Peng, Heng
    Huang, Tao
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 125 : 44 - 56
  • [6] Testing the Number of Components in Normal Mixture Regression Models
    Kasahara, Hiroyuki
    Shimotsu, Katsumi
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (512) : 1632 - 1645
  • [7] Determining the number of components in mixture regression models: an experimental design
    Brochado, Ana
    Martins, Vitorino
    [J]. ECONOMICS BULLETIN, 2020, 40 (02): : 1465 - 1474
  • [8] Initializing the EM algorithm in Gaussian mixture models with an unknown number of components
    Melnykov, Volodymyr
    Melnykov, Igor
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) : 1381 - 1395
  • [9] Bayesian Analysis of Mixture Structural Equation Models With an Unknown Number of Components
    Liu, Hefei
    Song, Xin Yuan
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2018, 25 (01) : 41 - 55
  • [10] Physical mixture modeling with unknown number of components
    Fischer, R
    Dose, V
    [J]. BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2002, 617 : 143 - 154