Penalized Quantile Regression for Distributed Big Data Using the Slack Variable Representation

被引:6
|
作者
Fan, Ye [1 ]
Lin, Nan [2 ]
Yin, Xianjun [1 ]
机构
[1] Cent Univ Finance & Econ, Sch Stat & Math, Beijing, Peoples R China
[2] Washington Univ, Dept Math & Stat, St Louis, MO 63130 USA
关键词
ADMM; Big data; Nonconvex penalty; Quantile regression; SELECTION;
D O I
10.1080/10618600.2020.1840996
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Penalized quantile regression is a widely used tool for analyzing high-dimensional data with heterogeneity. Although its estimation theory has been well studied in the literature, its computation still remains a challenge in big data, due to the nonsmoothness of the check loss function and the possible nonconvexity of the penalty term. In this article, we propose the QPADM-slack method, a parallel algorithm formulated via the alternating direction method of multipliers (ADMM) that supports penalized quantile regression in big data. Our proposal is different from the recent QPADM algorithm and uses the slack variable representation of the quantile regression problem. Simulation studies demonstrate that this new formulation is significantly faster than QPADM, especially when the data volume n or the dimension p is large, and has favorable estimation accuracy in big data analysis for both nondistributed and distributed environments. We further illustrate the practical performance of QPADM-slack by analyzing a news popularity dataset.
引用
收藏
页码:557 / 565
页数:9
相关论文
共 50 条
  • [41] Variable selection in censored quantile regression with high dimensional data
    Fan, Yali
    Tang, Yanlin
    Zhu, Zhongyi
    [J]. SCIENCE CHINA-MATHEMATICS, 2018, 61 (04) : 641 - 658
  • [42] Penalized empirical likelihood based variable selection for partially linear quantile regression models with missing responses
    Tang, Xinrong
    Zhao, Peixin
    [J]. HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2018, 47 (03): : 721 - 739
  • [43] Quantile Regression for Hypothesis Testing and Hypothesis Screening at the Dawn of Big Data
    Rehkopf, David H.
    [J]. EPIDEMIOLOGY, 2012, 23 (05) : 665 - 667
  • [44] Bayesian variable selection and estimation in quantile regression using a quantile-specific prior
    Mai Dao
    Min Wang
    Souparno Ghosh
    Keying Ye
    [J]. Computational Statistics, 2022, 37 : 1339 - 1368
  • [45] Bayesian variable selection and estimation in quantile regression using a quantile-specific prior
    Dao, Mai
    Wang, Min
    Ghosh, Souparno
    Ye, Keying
    [J]. COMPUTATIONAL STATISTICS, 2022, 37 (03) : 1339 - 1368
  • [46] Variable selection in additive quantile regression using nonconcave penalty
    Zhao, Kaifeng
    Lian, Heng
    [J]. STATISTICS, 2016, 50 (06) : 1276 - 1289
  • [47] Robust and smoothing variable selection for quantile regression models with longitudinal data
    Fu, Z. C.
    Fu, L. Y.
    Song, Y. N.
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (15) : 2600 - 2624
  • [48] Communication-efficient sparse composite quantile regression for distributed data
    Yang, Yaohong
    Wang, Lei
    [J]. METRIKA, 2023, 86 (03) : 261 - 283
  • [49] Communication-efficient sparse composite quantile regression for distributed data
    Yaohong Yang
    Lei Wang
    [J]. Metrika, 2023, 86 : 261 - 283
  • [50] Multi-round smoothed composite quantile regression for distributed data
    Fengrui Di
    Lei Wang
    [J]. Annals of the Institute of Statistical Mathematics, 2022, 74 : 869 - 893