A MODEL-AVERAGING METHOD FOR HIGH-DIMENSIONAL REGRESSION WITH MISSING RESPONSES AT RANDOM

被引:13
|
作者
Xie, Jinhan [1 ]
Yan, Xiaodong [2 ]
Tang, Niansheng [1 ]
机构
[1] Yunnan Univ, Key Lab Stat Modeling & Data Anal Yunnan Prov, Kunming 650500, Yunnan, Peoples R China
[2] Shandong Univ, Sch Econ, Jinan 250100, Peoples R China
基金
中国国家自然科学基金;
关键词
High-dimensional data; missing at random; model averaging; multiple imputation; screening; weighted delete-one cross-validation; GENERALIZED LINEAR-MODELS; EMPIRICAL LIKELIHOOD; VARIABLE SELECTION;
D O I
10.5705/ss.202018.0297
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This study considers the ultrahigh-dimensional prediction problem in the presence of responses missing at random. A two-step model-averaging procedure is proposed to improve the prediction accuracy of the conditional mean of the response variable. The first step specifies several candidate models, each with low-dimensional predictors. To implement this step, a new feature-screening method is developed to distinguish between the active and inactive predictors. The method uses the multiple-imputation sure independence screening (MI-SIS) procedure, and candidate models are formed by grouping covariates with similar size MI-SIS values. The second step develops a new criterion to find the optimal weights for averaging a set of candidate models using weighted delete-one cross-validation (WDCV). Under some regularity conditions, we show that the proposed screening statistic enjoys the ranking consistency property, and that the WDCV criterion asymptotically achieves the lowest possible prediction loss. Simulation studies and an example demonstrate the proposed methodology.
引用
收藏
页码:1005 / 1026
页数:22
相关论文
共 50 条
  • [1] A Model-Averaging Approach for High-Dimensional Regression
    Ando, Tomohiro
    Li, Ker-Chau
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (505) : 254 - 265
  • [2] High-dimensional model averaging for quantile regression
    Xie, Jinhan
    Ding, Xianwen
    Jiang, Bei
    Yan, Xiaodong
    Kong, Linglong
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2024, 52 (02): : 618 - 635
  • [3] Jackknife model averaging for high-dimensional quantile regression
    Wang, Miaomiao
    Zhang, Xinyu
    Wan, Alan T. K.
    You, Kang
    Zou, Guohua
    BIOMETRICS, 2023, 79 (01) : 178 - 189
  • [4] Mallows model averaging based on kernel regression imputation with responses missing at random
    Zhu, Hengkun
    Zou, Guohua
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2024, 231
  • [5] A model-averaging approach for smoothing spline regression
    Xu, Liwen
    Zhou, Jiabin
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (08) : 2438 - 2451
  • [6] Nested model averaging on solution path for high-dimensional linear regression
    Feng, Yang
    Liu, Qingfeng
    STAT, 2020, 9 (01):
  • [7] Model averaging for linear models with responses missing at random
    Yuting Wei
    Qihua Wang
    Wei Liu
    Annals of the Institute of Statistical Mathematics, 2021, 73 : 535 - 553
  • [8] Model averaging for linear models with responses missing at random
    Wei, Yuting
    Wang, Qihua
    Liu, Wei
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (03) : 535 - 553
  • [9] Functional Martingale Residual Process for High-Dimensional Cox Regression with Model Averaging
    He, Baihua
    Liu, Yanyan
    Wu, Yuanshan
    Yin, Guosheng
    Zhao, Xingqiu
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [10] Bayesian variable selection and model averaging in high-dimensional multinomial nonparametric regression
    Yau, P
    Kohn, R
    Wood, S
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (01) : 23 - 54