Assessing variable importance in survival analysis using machine learning

被引:0
|
作者
Wolock, C. J. [1 ]
Gilbert, P. B. [2 ]
Simon, N. [3 ]
Carone, M. [3 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, 432 Guardian Dr, Philadelpia, PA 19104 USA
[2] Fred Hutchinson Canc Ctr, Vaccine & Infect Dis Div, 1100 Fairview Ave North,POB 19024, Seattle, WA 98109 USA
[3] Univ Washington, Dept Biostat, 3980 15th Ave NE, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
Censoring; Debiased machine learning; Feature importance; Time-to-event outcome; PREDICTIVE ACCURACY; EXPLAINED VARIATION; HIV; MODELS; ACQUISITION; MEN; SEX;
D O I
10.1093/biomet/asae061
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Given a collection of features available for inclusion in a predictive model, it may be of interest to quantify the relative importance of a subset of features for the prediction task at hand. For example, in HIV vaccine trials, participant baseline characteristics are used to predict the probability of HIV acquisition over the intended follow-up period, and investigators may wish to understand how much certain types of predictors, such as behavioural factors, contribute to overall predictiveness. Time-to-event outcomes such as time to HIV acquisition are often subject to right censoring, and existing methods for assessing variable importance are typically not intended to be used in this setting. We describe a broad class of algorithm-agnostic variable importance measures for prediction in the context of survival data. We propose a nonparametric efficient estimation procedure that incorporates flexible learning of nuisance parameters, yields asymptotically valid inference and enjoys double robustness. We assess the performance of our proposed procedure via numerical simulations and analyse data from the HVTN 702 vaccine trial to inform enrolment strategies for future HIV vaccine trials.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Rejoinder to "Nonparametric variable importance assessment using machine learning techniques"
    Williamson, Brian D.
    Gilbert, Peter B.
    Carone, Marco
    Simon, Noah
    BIOMETRICS, 2021, 77 (01) : 28 - 30
  • [2] Visualizing Variable Importance and Variable Interaction Effects in Machine Learning Models
    Inglis, Alan
    Parnell, Andrew
    Hurley, Catherine B.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (03) : 766 - 778
  • [3] Shapley variable importance cloud for interpretable machine learning
    Ning, Yilin
    Ong, Marcus Eng Hock
    Chakraborty, Bibhas
    Goldstein, Benjamin Alan
    Ting, Daniel Shu Wei
    Vaughan, Roger
    Liu, Nan
    PATTERNS, 2022, 3 (04):
  • [4] Controlling factors in the variability of soil magnetic measures by machine learning and variable importance analysis
    Azizi, Kamran
    Ayoubi, Shamsollah
    Dematte, Jose A. M.
    JOURNAL OF APPLIED GEOPHYSICS, 2023, 210
  • [5] Using EHRs and Machine Learning for Heart Failure Survival Analysis
    Panahiazar, Maryam
    Taslimitehrani, Vahid
    Pereira, Naveen
    Pathak, Jyotishman
    MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 40 - 44
  • [6] Survival analysis using machine learning in transplantation: a practical introduction
    Andrea Garcia-Lopez
    Maritza Jiménez-Gómez
    Andrea Gomez-Montero
    Juan Camilo Gonzalez-Sierra
    Santiago Cabas
    Fernando Giron-Luque
    BMC Medical Informatics and Decision Making, 25 (1)
  • [7] Survival Analysis of Oncological Patients Using Machine Learning Method
    Al Fryan, Latefa Hamad
    Alazzam, Malik Bader
    HEALTHCARE, 2023, 11 (01)
  • [8] Analysis of analysis: Using machine learning to evaluate the importance of music parameters for Schenkerian analysis
    Kirlin, Phillip B.
    Yust, Jason
    JOURNAL OF MATHEMATICS AND MUSIC, 2016, 10 (02) : 127 - 148
  • [9] vivid: An R package for Variable Importance and Variable Interactions Displays for Machine Learning Models
    Inglis, Alan
    Parnell, Andrew
    Hurley, Catherine
    R JOURNAL, 2023, 15 (02): : 344 - 361
  • [10] Machine Learning for Survival Analysis: A Survey
    Wang, Ping
    Li, Yan
    Reddy, Chandan K.
    ACM COMPUTING SURVEYS, 2019, 51 (06)