Leveraging independence in high-dimensional mixed linear regression

被引:0
|
作者
Wang, Ning [1 ]
Deng, Kai [2 ]
Mai, Qing [2 ]
Zhang, Xin [2 ]
机构
[1] Beijing Normal Univ, Dept Stat, Zhuhai 519000, Peoples R China
[2] Florida State Univ, Dept Stat, 17 N Woodward Ave, Tallahassee, FL 32312 USA
基金
美国国家科学基金会;
关键词
EM algorithm; finite mixture model; group lasso; latent variable model; GAUSSIAN MIXTURES; EM ALGORITHM;
D O I
10.1093/biomtc/ujae103
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We address the challenge of estimating regression coefficients and selecting relevant predictors in the context of mixed linear regression in high dimensions, where the number of predictors greatly exceeds the sample size. Recent advancements in this field have centered on incorporating sparsity-inducing penalties into the expectation-maximization (EM) algorithm, which seeks to maximize the conditional likelihood of the response given the predictors. However, existing procedures often treat predictors as fixed or overlook their inherent variability. In this paper, we leverage the independence between the predictor and the latent indicator variable of mixtures to facilitate efficient computation and also achieve synergistic variable selection across all mixture components. We establish the non-asymptotic convergence rate of the proposed fast group-penalized EM estimator to the true regression parameters. The effectiveness of our method is demonstrated through extensive simulations and an application to the Cancer Cell Line Encyclopedia dataset for the prediction of anticancer drug sensitivity.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Variational Inference in high-dimensional linear regression
    Mukherjee, Sumit
    Sen, Subhabrata
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [2] ACCURACY ASSESSMENT FOR HIGH-DIMENSIONAL LINEAR REGRESSION
    Cai, T. Tony
    Guo, Zijian
    ANNALS OF STATISTICS, 2018, 46 (04): : 1807 - 1836
  • [3] Prediction in abundant high-dimensional linear regression
    Cook, R. Dennis
    Forzani, Liliana
    Rothman, Adam J.
    ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 3059 - 3088
  • [4] Elementary Estimators for High-Dimensional Linear Regression
    Yang, Eunho
    Lozano, Aurelie C.
    Ravikumar, Pradeep
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 388 - 396
  • [5] Variational Inference in high-dimensional linear regression
    Mukherjee, Sumit
    Sen, Subhabrata
    Journal of Machine Learning Research, 2022, 23
  • [6] A Note on High-Dimensional Linear Regression With Interactions
    Hao, Ning
    Zhang, Hao Helen
    AMERICAN STATISTICIAN, 2017, 71 (04): : 291 - 297
  • [7] High-Dimensional Mixed-Frequency IV Regression
    Babii, Andrii
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2022, 40 (04) : 1470 - 1483
  • [8] High-dimensional linear regression via implicit regularization
    Zhao, Peng
    Yang, Yun
    He, Qiao-Chu
    BIOMETRIKA, 2022, 109 (04) : 1033 - 1046
  • [9] A Model Selection Criterion for High-Dimensional Linear Regression
    Owrang, Arash
    Jansson, Magnus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (13) : 3436 - 3446
  • [10] New approach to Bayesian high-dimensional linear regression
    Jalali, Shirin
    Maleki, Arian
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2018, 7 (04) : 605 - 655