Individual risk prediction: Comparing random forests with Cox proportional-hazards model by a simulation study

被引:6
|
作者
Baralou, Valia [1 ]
Kalpourtzi, Natasa [1 ]
Touloumi, Giota [1 ]
机构
[1] Natl & Kapodistrian Univ Athens, Med Sch, Dept Hyg Epidemiol & Med Stat, Athens 11527, Greece
关键词
Cox model; machine learning; random survival forest; survival analysis; RANDOM SURVIVAL FORESTS; CARDIOVASCULAR-DISEASE; LIFE-STYLE; REGRESSION; SCORE;
D O I
10.1002/bimj.202100380
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
With big data becoming widely available in healthcare, machine learning algorithms such as random forest (RF) that ignores time-to-event information and random survival forest (RSF) that handles right-censored data are used for individual risk prediction alternatively to the Cox proportional hazards (Cox-PH) model. We aimed to systematically compare RF and RSF with Cox-PH. RSF with three split criteria [log-rank (RSF-LR), log-rank score (RSF-LRS), maximally selected rank statistics (RSF-MSR)]; RF, Cox-PH, and Cox-PH with splines (Cox-S) were evaluated through a simulation study based on real data. One hundred eighty scenarios were investigated assuming different associations between the predictors and the outcome (linear/linear and interactions/nonlinear/nonlinear and interactions), training sample sizes (500/1000/5000), censoring rates (50%/75%/93%), hazard functions (increasing/decreasing/constant), and number of predictors (seven, 15 including noise variables). Methods' performance was evaluated with time-dependent area under curve and integrated Brier score. In all scenarios, RF had the worst performance. In scenarios with a low number of events (<= 70), Cox-PH was at least noninferior to RSF, whereas under linearity assumption it outperformed RSF. Under the presence of interactions, RSF performed better than Cox-PH as the number of events increased whereas Cox-S reached at least similar performance with RSF under nonlinear effects. RSF-LRS performed slightly worse than RSF-LR and RSF-MSR when including noise variables and interaction effects. When applied to real data, models incorporating survival time performed better. Although RSF algorithms are a promising alternative to conventional Cox-PH as data complexity increases, they require a higher number of events for training. In time-to-event analysis, algorithms that consider survival time should be used.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A Simulation Study Comparing Two Methods of Handling Missing Covariate Values when Fitting a Cox Proportional-Hazards Regression Model
    Satty, Ali
    [J]. STATISTIKA-STATISTICS AND ECONOMY JOURNAL, 2014, 94 (01) : 64 - 72
  • [2] SEQUENTIAL-METHODS FOR COX PROPORTIONAL-HAZARDS MODEL
    JENNISON, C
    TURNBULL, BW
    [J]. BIOMETRICS, 1982, 38 (04) : 1111 - 1111
  • [3] Time-dependent covariates in the Cox proportional-hazards regression model
    Fisher, LD
    Lin, DY
    [J]. ANNUAL REVIEW OF PUBLIC HEALTH, 1999, 20 : 145 - 157
  • [4] Jackknifed random weighting for Cox proportional hazards model
    LI Xiao 1
    2 Department of Finance and Statistics
    [J]. Science China Mathematics, 2012, 55 (04) : 770 - 781
  • [5] Jackknifed random weighting for Cox proportional hazards model
    Li Xiao
    Wu YaoHua
    Tu DongSheng
    [J]. SCIENCE CHINA-MATHEMATICS, 2012, 55 (04) : 775 - 786
  • [6] Jackknifed random weighting for Cox proportional hazards model
    Xiao Li
    YaoHua Wu
    DongSheng Tu
    [J]. Science China Mathematics, 2012, 55 : 775 - 786
  • [7] USE OF JACKKNIFE TECHNIQUES IN DETERMINING A BEST SUBSET OF COVARIATES FOR THE COX PROPORTIONAL-HAZARDS MODEL
    FAGERSTROM, R
    SMITH, H
    SACKS, H
    COHEN, B
    [J]. CONTROLLED CLINICAL TRIALS, 1981, 2 (01): : 88 - 88
  • [8] Random weighting method for Cox’s proportional hazards model
    WenQuan Cui
    Kai Li
    YaNing Yang
    YueHua Wu
    [J]. Science in China Series A: Mathematics, 2008, 51 : 1843 - 1854
  • [9] Random weighting method for Cox’s proportional hazards model
    CUI WenQuan1
    [J]. Science China Mathematics, 2008, (10) : 1843 - 1854
  • [10] A Cox Proportional-Hazards Model Based on an Improved Aquila Optimizer with Whale Optimization Algorithm Operators
    Ewees, Ahmed A.
    Algamal, Zakariya Yahya
    Abualigah, Laith
    Al-qaness, Mohammed A. A.
    Yousri, Dalia
    Ghoniem, Rania M.
    Abd Elaziz, Mohamed
    [J]. MATHEMATICS, 2022, 10 (08)