Assessing variable importance in survival analysis using machine learning

被引:0
|
作者
Wolock, C. J. [1 ]
Gilbert, P. B. [2 ]
Simon, N. [3 ]
Carone, M. [3 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, 432 Guardian Dr, Philadelpia, PA 19104 USA
[2] Fred Hutchinson Canc Ctr, Vaccine & Infect Dis Div, 1100 Fairview Ave North,POB 19024, Seattle, WA 98109 USA
[3] Univ Washington, Dept Biostat, 3980 15th Ave NE, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
Censoring; Debiased machine learning; Feature importance; Time-to-event outcome; PREDICTIVE ACCURACY; EXPLAINED VARIATION; HIV; MODELS; ACQUISITION; MEN; SEX;
D O I
10.1093/biomet/asae061
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Given a collection of features available for inclusion in a predictive model, it may be of interest to quantify the relative importance of a subset of features for the prediction task at hand. For example, in HIV vaccine trials, participant baseline characteristics are used to predict the probability of HIV acquisition over the intended follow-up period, and investigators may wish to understand how much certain types of predictors, such as behavioural factors, contribute to overall predictiveness. Time-to-event outcomes such as time to HIV acquisition are often subject to right censoring, and existing methods for assessing variable importance are typically not intended to be used in this setting. We describe a broad class of algorithm-agnostic variable importance measures for prediction in the context of survival data. We propose a nonparametric efficient estimation procedure that incorporates flexible learning of nuisance parameters, yields asymptotically valid inference and enjoys double robustness. We assess the performance of our proposed procedure via numerical simulations and analyse data from the HVTN 702 vaccine trial to inform enrolment strategies for future HIV vaccine trials.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Predictive Modeling of Korean Traditional Paper Characteristics Using Machine Learning Approaches (Part 2): Prediction of Carbonyl Content and Analysis of Variable Importance Using Random Forest
    Kim K.-J.
    Kim J.-H.
    Park G.
    Jeong M.-J.
    Palpu Chongi Gisul/Journal of Korea Technical Association of the Pulp and Paper Industry, 2023, 55 (05): : 13 - 23
  • [32] Modeling Pipe Break Data Using Survival Analysis with Machine Learning Imputation Methods
    Xu, Hao
    Sinha, Sunil K.
    JOURNAL OF PERFORMANCE OF CONSTRUCTED FACILITIES, 2021, 35 (05)
  • [33] Prostate cancer prognosis using machine learning: A critical review of survival analysis methods
    Ahuja, Garvita
    Kaur, Ishleen
    Lamba, Puneet Singh
    Virmani, Deepali
    Jain, Achin
    Chakraborty, Somenath
    Mallik, Saurav
    PATHOLOGY RESEARCH AND PRACTICE, 2024, 264
  • [34] Dynamic Predictive Models With Visualized Machine Learning for Assessing Chondrosarcoma Overall Survival
    Li, Wenle
    Wang, Gui
    Wu, Rilige
    Dong, Shengtao
    Wang, Haosheng
    Xu, Chan
    Wang, Bing
    Li, Wanying
    Hu, Zhaohui
    Chen, Qi
    Yin, Chengliang
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [36] Robust Machine Learning Variable Importance Analyses of Medical Conditions for Health Care Spending
    Rose, Sherri
    HEALTH SERVICES RESEARCH, 2018, 53 (05) : 3836 - 3854
  • [37] Machine learning assessment of the importance of unirradiated yield strength as a variable in embrittlement trend forecasting
    Ferreno, Diego
    Erickson, Marjorie
    Kirk, Mark
    Sainz-Aja, Jose A.
    INTERNATIONAL JOURNAL OF PRESSURE VESSELS AND PIPING, 2025, 214
  • [38] Assessing Variable Importance for Best Subset Selection
    Seedorff, Jacob
    Cavanaugh, Joseph E.
    ENTROPY, 2024, 26 (09)
  • [39] Predicting compressive strength of grouted masonry using machine learning models with feature importance analysis
    Sathiparan, Navaratnarajah
    MATERIALS TODAY COMMUNICATIONS, 2024, 41
  • [40] Snowplow Truck Performance Assessment and Feature Importance Analysis Using Machine-Learning Techniques
    Yi, Zhiyan
    Liu, Xiaoyue Cathy
    Wei, Ran
    Grubesic, Tony H.
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2021, 147 (02)