l1-PENALIZED QUANTILE REGRESSION IN HIGH-DIMENSIONAL SPARSE MODELS

被引:352
|
作者
Belloni, Alexandre [1 ]
Chernozhukov, Victor [2 ,3 ]
机构
[1] Duke Univ, Fuqua Sch Business, Durham, NC 27708 USA
[2] MIT, Dept Econ, Cambridge, MA 02142 USA
[3] MIT, Ctr Operat Res, Cambridge, MA 02142 USA
来源
ANNALS OF STATISTICS | 2011年 / 39卷 / 01期
基金
美国国家科学基金会;
关键词
Median regression; quantile regression; sparse models; LASSO; AGGREGATION; ESTIMATORS; SELECTION; RECOVERY;
D O I
10.1214/10-AOS827
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider median regression and, more generally, a possibly infinite collection of quantile regressions in high-dimensional sparse models. In these models, the number of regressors p is very large, possibly larger than the sample size n, but only at most s regressors have a nonzero impact on each conditional quantile of the response variable, where s grows more slowly than n. Since ordinary quantile regression is not consistent in this case, we consider l(1)-penalized quantile regression (l(1)-QR), which penalizes the l(1)-norm of regression coefficients, as well as the post-penalized QR estimator (post-l(1)-QR), which applies ordinary QR to the model selected by l(1)-QR. First, we show that under general conditions l(1)-QR is consistent at the near-oracle rate. root s/n root log(p boolean OR n), uniformly in the compact set u subset of (0, 1) of quantile indices. In deriving this result, we propose a partly pivotal, data-driven choice of the penalty level and show that it satisfies the requirements for achieving this rate. Second, we show that under similar conditions post-l(1)-QR is consistent at the near-oracle rate root s/n root log(p boolean OR n), uniformly over u, even if the l(1)-QR-selected models miss some components of the true models, and the rate could be even closer to the oracle rate otherwise. Third, we characterize conditions under which l(1)-QR contains the true model as a submodel, and derive bounds on the dimension of the selected model, uniformly over u; we also provide conditions under which hard-thresholding selects the minimal true model, uniformly over u.
引用
收藏
页码:82 / 130
页数:49
相关论文
共 50 条
  • [31] Unconditional quantile regression with high-dimensional data
    Sasaki, Yuya
    Ura, Takuya
    Zhang, Yichong
    [J]. QUANTITATIVE ECONOMICS, 2022, 13 (03) : 955 - 978
  • [32] Inference for High-Dimensional Censored Quantile Regression
    Fei, Zhe
    Zheng, Qi
    Hong, Hyokyoung G.
    Li, Yi
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 898 - 912
  • [33] Efficient sparse portfolios based on composite quantile regression for high-dimensional index tracking
    Li, Ning
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (08) : 1466 - 1478
  • [34] SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES
    Zhu, Liping
    Huang, Mian
    Li, Runze
    [J]. STATISTICA SINICA, 2012, 22 (04) : 1379 - 1401
  • [35] High-dimensional model averaging for quantile regression
    Xie, Jinhan
    Ding, Xianwen
    Jiang, Bei
    Yan, Xiaodong
    Kong, Linglong
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2024, 52 (02): : 618 - 635
  • [36] Sparse High-Dimensional Isotonic Regression
    Gamarnik, David
    Gaudio, Julia
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [37] Efficient distributed optimization for large-scale high-dimensional sparse penalized Huber regression
    Pan, Yingli
    Xu, Kaidong
    Wei, Sha
    Wang, Xiaojuan
    Liu, Zhan
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (07) : 3106 - 3125
  • [38] PENALIZED LINEAR REGRESSION WITH HIGH-DIMENSIONAL PAIRWISE SCREENING
    Gong, Siliang
    Zhang, Kai
    Liu, Yufeng
    [J]. STATISTICA SINICA, 2021, 31 (01) : 391 - 420
  • [39] A lack-of-fit test for quantile regression models with high-dimensional covariates
    Conde-Amboage, Mercedes
    Sanchez-Sellero, Cesar
    Gonzalez-Manteiga, Wenceslao
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 88 : 128 - 138
  • [40] Variable selection in high-dimensional sparse multiresponse linear regression models
    Luo, Shan
    [J]. STATISTICAL PAPERS, 2020, 61 (03) : 1245 - 1267