Ultra-high dimensional variable selection for doubly robust causal inference

被引:5
|
作者
Tang, Dingke [1 ]
Kong, Dehan [1 ]
Pan, Wenliang [2 ]
Wang, Linbo [1 ]
机构
[1] Univ Toronto, Dept Stat Sci, Toronto, ON M5S 3G3, Canada
[2] Sun Yat Sen Univ, Sch Math, Dept Stat Sci, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Alzheimer's disease; average causal effect; ball covariance; confounder selection; variable screening; PROPENSITY SCORE; ALZHEIMERS-DISEASE; MODEL SELECTION; ADAPTIVE LASSO; EFFICIENT; TAU; BIOMARKERS;
D O I
10.1111/biom.13625
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Causal inference has been increasingly reliant on observational studies with rich covariate information. To build tractable causal procedures, such as the doubly robust estimators, it is imperative to first extract important features from high or even ultra-high dimensional data. In this paper, we propose causal ball screening for confounder selection from modern ultra-high dimensional data sets. Unlike the familiar task of variable selection for prediction modeling, our confounder selection procedure aims to control for confounding while improving efficiency in the resulting causal effect estimate. Previous empirical and theoretical studies suggest excluding causes of the treatment that are not confounders. Motivated by these results, our goal is to keep all the predictors of the outcome in both the propensity score and outcome regression models. A distinctive feature of our proposal is that we use an outcome model-free procedure for propensity score model selection, thereby maintaining double robustness in the resulting causal effect estimator. Our theoretical analyses show that the proposed procedure enjoys a number of properties, including model selection consistency and pointwise normality. Synthetic and real data analysis show that our proposal performs favorably with existing methods in a range of realistic settings. Data used in preparation of this paper were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database.
引用
收藏
页码:903 / 914
页数:12
相关论文
共 50 条
  • [1] Robust adaptive variable selection in ultra-high dimensional linear regression models
    Ghosh, Abhik
    Jaenada, Maria
    Pardo, Leandro
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (03) : 571 - 603
  • [2] A robust variable screening procedure for ultra-high dimensional data
    Ghosh, Abhik
    Thoresen, Magne
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (08) : 1816 - 1832
  • [3] Bayesian Multiresolution Variable Selection for Ultra-High Dimensional Neuroimaging Data
    Zhao, Yize
    Kang, Jian
    Long, Qi
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (02) : 537 - 550
  • [4] Forward variable selection for ultra-high dimensional quantile regression models
    Honda, Toshio
    Lin, Chien-Tong
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2023, 75 (03) : 393 - 424
  • [5] Forward variable selection for ultra-high dimensional quantile regression models
    Toshio Honda
    Chien-Tong Lin
    [J]. Annals of the Institute of Statistical Mathematics, 2023, 75 : 393 - 424
  • [6] Forward Variable Selection for Sparse Ultra-High Dimensional Varying Coefficient Models
    Cheng, Ming-Yen
    Honda, Toshio
    Zhang, Jin-Ting
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (515) : 1209 - 1221
  • [7] A Bayesian view of doubly robust causal inference
    Saarela, O.
    Belzile, L. R.
    Stephens, D. A.
    [J]. BIOMETRIKA, 2016, 103 (03) : 667 - 681
  • [8] Enhanced Doubly Robust Procedure for Causal Inference
    Ao Yuan
    Anqi Yin
    Ming T. Tan
    [J]. Statistics in Biosciences, 2021, 13 : 454 - 478
  • [9] Relaxed doubly robust estimation in causal inference
    Xu, Tinghui
    Zhao, Jiwei
    [J]. STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (01) : 69 - 79
  • [10] Enhanced Doubly Robust Procedure for Causal Inference
    Yuan, Ao
    Yin, Anqi
    Tan, Ming T.
    [J]. STATISTICS IN BIOSCIENCES, 2021, 13 (03) : 454 - 478