Assessing variable importance in survival analysis using machine learning

被引:0
|
作者
Wolock, C. J. [1 ]
Gilbert, P. B. [2 ]
Simon, N. [3 ]
Carone, M. [3 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, 432 Guardian Dr, Philadelpia, PA 19104 USA
[2] Fred Hutchinson Canc Ctr, Vaccine & Infect Dis Div, 1100 Fairview Ave North,POB 19024, Seattle, WA 98109 USA
[3] Univ Washington, Dept Biostat, 3980 15th Ave NE, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
Censoring; Debiased machine learning; Feature importance; Time-to-event outcome; PREDICTIVE ACCURACY; EXPLAINED VARIATION; HIV; MODELS; ACQUISITION; MEN; SEX;
D O I
10.1093/biomet/asae061
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Given a collection of features available for inclusion in a predictive model, it may be of interest to quantify the relative importance of a subset of features for the prediction task at hand. For example, in HIV vaccine trials, participant baseline characteristics are used to predict the probability of HIV acquisition over the intended follow-up period, and investigators may wish to understand how much certain types of predictors, such as behavioural factors, contribute to overall predictiveness. Time-to-event outcomes such as time to HIV acquisition are often subject to right censoring, and existing methods for assessing variable importance are typically not intended to be used in this setting. We describe a broad class of algorithm-agnostic variable importance measures for prediction in the context of survival data. We propose a nonparametric efficient estimation procedure that incorporates flexible learning of nuisance parameters, yields asymptotically valid inference and enjoys double robustness. We assess the performance of our proposed procedure via numerical simulations and analyse data from the HVTN 702 vaccine trial to inform enrolment strategies for future HIV vaccine trials.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Sign-Consistency Based Variable Importance for Machine Learning in Brain Imaging
    Vanessa Gómez-Verdejo
    Emilio Parrado-Hernández
    Jussi Tohka
    Neuroinformatics, 2019, 17 : 593 - 609
  • [22] Sign-Consistency Based Variable Importance for Machine Learning in Brain Imaging
    Gomez-Verdejo, Vanessa
    Parrado-Hernandez, Emilio
    Tohka, Jussi
    NEUROINFORMATICS, 2019, 17 (04) : 593 - 609
  • [23] Machine learning insights: probing the variable importance of ex-ante information
    Albada, Ali
    Abusham, Eimad Eldin
    Ong, Chui Zi
    Qatiti, Khalid Al
    MANAGERIAL FINANCE, 2025, 51 (01) : 12 - 31
  • [24] Ranking the Importance of Variables in a Nonparametric Frontier Analysis Using Unsupervised Machine Learning Techniques
    Moragues, Raul
    Aparicio, Juan
    Esteve, Miriam
    MATHEMATICS, 2023, 11 (11)
  • [25] Comparative Analysis of Machine Learning and Deep Learning Algorithms for Assessing Agricultural Product Quality Using NIRS
    Ren, Jiwen
    Xiong, Yuming
    Chen, Xinyu
    Hao, Yong
    SENSORS, 2024, 24 (16)
  • [26] ASSESSING CREDIT CARD APPLICATIONS USING MACHINE LEARNING
    CARTER, C
    CATLETT, J
    IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1987, 2 (03): : 71 - 79
  • [27] Assessing Ships' Environmental Performance Using Machine Learning
    Skarlatos, Kyriakos
    Fousteris, Andreas
    Georgakellos, Dimitrios
    Economou, Polychronis
    Bersimis, Sotirios
    ENERGIES, 2023, 16 (06)
  • [28] Assessing cyber-incidents using machine learning
    Gore R.
    Diallo S.Y.
    Padilla J.
    Ezell B.
    Gore, Ross (ross.gore@gmail.com), 2018, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (10) : 341 - 360
  • [29] Learning temporal weights of clinical events using variable importance
    Zhao, Jing
    Henriksson, Aron
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2016, 16
  • [30] Learning temporal weights of clinical events using variable importance
    Jing Zhao
    Aron Henriksson
    BMC Medical Informatics and Decision Making, 16