NOVEL HEAD AND NECK CANCER SURVIVAL ANALYSIS APPROACH: RANDOM SURVIVAL FORESTS VERSUS COX PROPORTIONAL HAZARDS REGRESSION

被引:50
|
作者
Datema, Frank R. [1 ]
Moya, Ana [2 ]
Krause, Peter [3 ]
Baeck, Thomas [3 ]
Willmes, Lars [4 ]
Langeveld, Ton [5 ]
de Jong, Robert J. Baatenburg [1 ]
Blom, Henk M. [6 ]
机构
[1] Erasmus MC, Dept Otorhinolaryngol Head & Neck Surg, Rotterdam, Netherlands
[2] WAZ Mediengrp GmbH, Essen, Germany
[3] Div Intelligent Solut GmbH, Dortmund, Germany
[4] Intul Solut GmbH, Dortmund, Germany
[5] Leiden Univ, Dept Otorhinolaryngol Head & Neck Surg, Med Ctr, NL-2300 RA Leiden, Netherlands
[6] Haga Ziekenhuis, Dept Otorhinolaryngol, The Hague, Netherlands
关键词
Random survival forests; Cox regression; squamous cell carcinoma; survival prediction; relative importance; CLINICAL BIOSTATISTICS; COMORBIDITY; IMPACT;
D O I
10.1002/hed.21698
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
Background. Electronic patient files generate an enormous amount of medical data. These data can be used for research, such as prognostic modeling. Automatization of statistical prognostication processes allows automatic updating of models when new data is gathered. The increase of power behind an automated prognostic model makes its predictive capability more reliable. Cox proportional hazard regression is most frequently used in prognostication. Automatization of a Cox model is possible, but we expect the updating process to be time-consuming. A possible solution lies in an alternative modeling technique called random survival forests (RSFs). RSF is easily automated and is known to handle the proportionality assumption coherently and automatically. Performance of RSF has not yet been tested on a large head and neck oncological dataset. This study investigates performance of head and neck overall survival of RSF models. Performances are compared to a Cox model as the "gold standard." RSF might be an interesting alternative modeling approach for automatization when performances are similar. Methods. RSF models were created in R (Cox also in SPSS). Four RSF splitting rules were used: log-rank, conservation of events, log-rank score, and log-rank approximation. Models were based on historical data of 1371 patients with primary head-and-neck cancer, diagnosed between 1981 and 1998. Models contain 8 covariates: tumor site, T classification, N classification, M classification, age, sex, prior malignancies, and comorbidity. Model performances were determined by Harrell's concordance error rate, in which 33% of the original data served as a validation sample. Results. RSF and Cox models delivered similar error rates. The Cox model performed slightly better (error rate, 0.2826). The log-rank splitting approach gave the best RSF performance (error rate, 0.2873). In accord with Cox and RSF models, high T classification, high N classification, and severe comorbidity are very important covariates in the model, whereas sex, mild comorbidity, and a supraglottic larynx tumor are less important. A discrepancy arose regarding the importance of M1 classification (see Discussion). Conclusion. Both approaches delivered similar error rates. The Cox model gives a clinically understandable output on covariate impact, whereas RSF becomes more of a "black box." RSF complements the Cox model by giving more insight and confidence toward relative importance of model covariates. RSF can be recommended as the approach of choice in automating survival analyses. (C) 2011 Wiley Periodicals, Inc. Head Neck 34: 50-58, 2012
引用
收藏
页码:50 / 58
页数:9
相关论文
共 50 条
  • [1] Survival analysis: Cox proportional hazards model
    Jeyaseelan, L
    Walter, SD
    Shankar, V
    John, GT
    NATIONAL MEDICAL JOURNAL OF INDIA, 1999, 12 (05): : 230 - 233
  • [2] A general framework for random effects survival analysis in the cox proportional hazards setting
    Sargent, DJ
    BIOMETRICS, 1998, 54 (04) : 1486 - 1497
  • [3] The comparisons of random survival forests and Cox regression analysis with simulation and an application related to breast cancer
    Omurlu, Imran Kurt
    Ture, Mevlut
    Tokatli, Fuesun
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 8582 - 8588
  • [4] Head and Neck Cancer Survival Outcome Prediction Based On NRG Oncology RTOG 0522 with Random Forests and Random Survival Forests
    Huang, M.
    Cheng, C.
    Geng, H.
    Zhong, H.
    Wang, J.
    Lin, A.
    Guttmann, D.
    van Soest, J.
    Dekker, A.
    Bilker, W.
    Zhang, Z.
    Rosenthal, D.
    Axelrod, R.
    Galvin, J.
    Frank, S.
    Thorstad, W.
    Huth, B.
    Hsu, A.
    Trotti, A.
    Zhang, Q.
    Xiao, Y.
    MEDICAL PHYSICS, 2017, 44 (06)
  • [5] Survival analysis—part 2: Cox proportional hazards model
    Salil Vasudeo Deo
    Vaishali Deo
    Varun Sundaram
    Indian Journal of Thoracic and Cardiovascular Surgery, 2021, 37 : 229 - 233
  • [6] Predicting Postoperative Lung Cancer Recurrence and Survival Using Cox Proportional Hazards Regression and Machine Learning
    Pu, Lucy
    Dhupar, Rajeev
    Meng, Xin
    CANCERS, 2025, 17 (01)
  • [7] Penalized semiparametric Cox regression model on XGBoost and random survival forests
    Wang, Yating
    Su, Jinxia
    Zhao, Xuejing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3095 - 3103
  • [8] High-dimensional, massive sample-size Cox proportional hazards regression for survival analysis
    Mittal, Sushil
    Madigan, David
    Burd, Randall S.
    Suchard, Marc A.
    BIOSTATISTICS, 2014, 15 (02) : 207 - 221
  • [9] Utilizing shared frailty with the Cox proportional hazards regression: Post discharge survival analysis of CHF patients
    Ben-Assuli, Ofir
    Ramon-Gonen, Roni
    Heart, Tsipi
    Jacobi, Arie
    Klempfner, Robert
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 140
  • [10] Survival analysis under the Cox proportional hazards model with pooled covariates
    Saha-Chaudhuri, Paramita
    Juwara, Lamin
    STATISTICS IN MEDICINE, 2021, 40 (04) : 998 - 1020