Leveraging independence in high-dimensional mixed linear regression

被引:0
|
作者
Wang, Ning [1 ]
Deng, Kai [2 ]
Mai, Qing [2 ]
Zhang, Xin [2 ]
机构
[1] Beijing Normal Univ, Dept Stat, Zhuhai 519000, Peoples R China
[2] Florida State Univ, Dept Stat, 17 N Woodward Ave, Tallahassee, FL 32312 USA
基金
美国国家科学基金会;
关键词
EM algorithm; finite mixture model; group lasso; latent variable model; GAUSSIAN MIXTURES; EM ALGORITHM;
D O I
10.1093/biomtc/ujae103
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We address the challenge of estimating regression coefficients and selecting relevant predictors in the context of mixed linear regression in high dimensions, where the number of predictors greatly exceeds the sample size. Recent advancements in this field have centered on incorporating sparsity-inducing penalties into the expectation-maximization (EM) algorithm, which seeks to maximize the conditional likelihood of the response given the predictors. However, existing procedures often treat predictors as fixed or overlook their inherent variability. In this paper, we leverage the independence between the predictor and the latent indicator variable of mixtures to facilitate efficient computation and also achieve synergistic variable selection across all mixture components. We establish the non-asymptotic convergence rate of the proposed fast group-penalized EM estimator to the true regression parameters. The effectiveness of our method is demonstrated through extensive simulations and an application to the Cancer Cell Line Encyclopedia dataset for the prediction of anticancer drug sensitivity.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] THE TAP FREE ENERGY FOR HIGH-DIMENSIONAL LINEAR REGRESSION
    Qiu, Jiaze
    Sen, Subhabrata
    ANNALS OF APPLIED PROBABILITY, 2023, 33 (04): : 2643 - 2680
  • [22] Empirical likelihood for high-dimensional linear regression models
    Guo, Hong
    Zou, Changliang
    Wang, Zhaojun
    Chen, Bin
    METRIKA, 2014, 77 (07) : 921 - 945
  • [23] SPReM: Sparse Projection Regression Model For High-Dimensional Linear Regression
    Sun, Qiang
    Zhu, Hongtu
    Liu, Yufeng
    Ibrahim, Joseph G.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (509) : 289 - 302
  • [24] Tests for high-dimensional partially linear regression modelsTests for high-dimensional partially linear regression modelsH. Shi et al.
    Hongwei Shi
    Weichao Yang
    Bowen Sun
    Xu Guo
    Statistical Papers, 2025, 66 (3)
  • [25] Boosting for high-multivariate responses in high-dimensional linear regression
    Lutz, Roman Werner
    Buehlmann, Peter
    STATISTICA SINICA, 2006, 16 (02) : 471 - 494
  • [26] Robust transfer learning for high-dimensional regression with linear constraints
    Chen, Xuan
    Song, Yunquan
    Wang, Yuanfeng
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (11) : 2462 - 2482
  • [27] MODEL SELECTION FOR HIGH-DIMENSIONAL LINEAR REGRESSION WITH DEPENDENT OBSERVATIONS
    Ing, Ching-Kang
    ANNALS OF STATISTICS, 2020, 48 (04): : 1959 - 1980
  • [28] Variable screening in multivariate linear regression with high-dimensional covariates
    Bizuayehu, Shiferaw B.
    Li, Lu
    Xu, Jin
    STATISTICAL THEORY AND RELATED FIELDS, 2022, 6 (03) : 241 - 253
  • [29] HIGH-DIMENSIONAL LINEAR REGRESSION FOR DEPENDENT DATA WITH APPLICATIONS TO NOWCASTING
    Han, Yuefeng
    Tsay, Ruey S.
    STATISTICA SINICA, 2020, 30 (04) : 1797 - 1827
  • [30] Sparsity Oriented Importance Learning for High-Dimensional Linear Regression
    Ye, Chenglong
    Yang, Yi
    Yang, Yuhong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (524) : 1797 - 1812