Variable selection in finite mixture of regression models

被引：165

作者：

Khalili, Abbas ^{[1
]}

Chen, Jiahua

机构：

[1] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA

[2] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z2, Canada

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 2007年 / 102卷 / 479期

关键词：

EM algorithm; LASSO; mixture model; penalty method; SCAD;

D O I：

10.1198/016214507000000590

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In the applications of finite mixture of regression (FMR) models, often many covariates are used, and their contributions to the response variable vary from one component to another of the mixture model. This creates a complex variable selection problem. Existing methods, such as the Akaike information criterion and the Bayes information criterion, are computationally expensive as the number of covariates and components in the mixture model increases. In this article we introduce a penalized likelihood approach for variable selection in FMR models. The new method introduces penalties that depend on the size of the regression coefficients and the mixture structure. The new method is shown to be consistent for variable selection. A data-adaptive method for selecting tuning parameters and an EM algorithm for efficient numerical computations are developed. Simulations show that the method performs very well and requires much less computing power than existing methods. The new method is illustrated by analyzing two real data sets.

引用

页码：1025 / 1038

页数：14

共 50 条

[1] Robust variable selection for finite mixture regression models
Qingguo Tang
R. J. Karunamuni
[J]. Annals of the Institute of Statistical Mathematics, 2018, 70 : 489 - 521
[2] Robust variable selection for finite mixture regression models
Tang, Qingguo
Karunamuni, R. J.
[J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2018, 70 (03) : 489 - 521
[3] Variable selection in finite mixture of regression models with an unknown number of components
Lee, Kuo-Jung
Feldkircher, Martin
Chen, Yi-Chi
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 158
[4] Variable selection in finite mixture of semi-parametric regression models
Ormoz, Ehsan
Eskandari, Farzad
[J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (03) : 695 - 711
[5] Componentwise variable selection in finite mixture regression
Chen, Bin
Ye, Keying
[J]. STATISTICS AND ITS INTERFACE, 2015, 8 (02) : 239 - 254
[6] Robust variable selection in finite mixture of regression models using the t distribution
Dai, Lin
Yin, Junhui
Xie, Zhengfen
Wu, Liucang
[J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2019, 48 (21) : 5370 - 5386
[7] Variable selection in finite mixture of regression models using the skew-normal distribution
Yin, Junhui
Wu, Liucang
Dai, Lin
[J]. JOURNAL OF APPLIED STATISTICS, 2020, 47 (16) : 2941 - 2960
[8] Finite mixture regression: A sparse variable selection by model selection for clustering
Devijver, Emilie
[J]. ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02): : 2642 - 2674
[9] Robust variable selection for mixture linear regression models
Jiang, Yunlu
[J]. HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2016, 45 (02): : 549 - 559
[10] Variable selection in finite mixture of median regression models using skew-normal distribution
Zeng, Xin
Ju, Yuanyuan
Wu, Liucang
[J]. STATISTICAL THEORY AND RELATED FIELDS, 2023, 7 (01) : 30 - 48

← 1 2 3 4 5 →