NOVEL HEAD AND NECK CANCER SURVIVAL ANALYSIS APPROACH: RANDOM SURVIVAL FORESTS VERSUS COX PROPORTIONAL HAZARDS REGRESSION

被引:50
|
作者
Datema, Frank R. [1 ]
Moya, Ana [2 ]
Krause, Peter [3 ]
Baeck, Thomas [3 ]
Willmes, Lars [4 ]
Langeveld, Ton [5 ]
de Jong, Robert J. Baatenburg [1 ]
Blom, Henk M. [6 ]
机构
[1] Erasmus MC, Dept Otorhinolaryngol Head & Neck Surg, Rotterdam, Netherlands
[2] WAZ Mediengrp GmbH, Essen, Germany
[3] Div Intelligent Solut GmbH, Dortmund, Germany
[4] Intul Solut GmbH, Dortmund, Germany
[5] Leiden Univ, Dept Otorhinolaryngol Head & Neck Surg, Med Ctr, NL-2300 RA Leiden, Netherlands
[6] Haga Ziekenhuis, Dept Otorhinolaryngol, The Hague, Netherlands
关键词
Random survival forests; Cox regression; squamous cell carcinoma; survival prediction; relative importance; CLINICAL BIOSTATISTICS; COMORBIDITY; IMPACT;
D O I
10.1002/hed.21698
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
Background. Electronic patient files generate an enormous amount of medical data. These data can be used for research, such as prognostic modeling. Automatization of statistical prognostication processes allows automatic updating of models when new data is gathered. The increase of power behind an automated prognostic model makes its predictive capability more reliable. Cox proportional hazard regression is most frequently used in prognostication. Automatization of a Cox model is possible, but we expect the updating process to be time-consuming. A possible solution lies in an alternative modeling technique called random survival forests (RSFs). RSF is easily automated and is known to handle the proportionality assumption coherently and automatically. Performance of RSF has not yet been tested on a large head and neck oncological dataset. This study investigates performance of head and neck overall survival of RSF models. Performances are compared to a Cox model as the "gold standard." RSF might be an interesting alternative modeling approach for automatization when performances are similar. Methods. RSF models were created in R (Cox also in SPSS). Four RSF splitting rules were used: log-rank, conservation of events, log-rank score, and log-rank approximation. Models were based on historical data of 1371 patients with primary head-and-neck cancer, diagnosed between 1981 and 1998. Models contain 8 covariates: tumor site, T classification, N classification, M classification, age, sex, prior malignancies, and comorbidity. Model performances were determined by Harrell's concordance error rate, in which 33% of the original data served as a validation sample. Results. RSF and Cox models delivered similar error rates. The Cox model performed slightly better (error rate, 0.2826). The log-rank splitting approach gave the best RSF performance (error rate, 0.2873). In accord with Cox and RSF models, high T classification, high N classification, and severe comorbidity are very important covariates in the model, whereas sex, mild comorbidity, and a supraglottic larynx tumor are less important. A discrepancy arose regarding the importance of M1 classification (see Discussion). Conclusion. Both approaches delivered similar error rates. The Cox model gives a clinically understandable output on covariate impact, whereas RSF becomes more of a "black box." RSF complements the Cox model by giving more insight and confidence toward relative importance of model covariates. RSF can be recommended as the approach of choice in automating survival analyses. (C) 2011 Wiley Periodicals, Inc. Head Neck 34: 50-58, 2012
引用
收藏
页码:50 / 58
页数:9
相关论文
共 50 条
  • [31] Proportional hazards and artificial neural network models to predict survival in patients undergoing resection of head and neck squamous cell cancer.
    Medow, MA
    Weed, HG
    JOURNAL OF INVESTIGATIVE MEDICINE, 1998, 46 (07) : 279A - 279A
  • [32] Proportional Hazards Regression for the Analysis of Clustered Survival Data from Case-Cohort Studies
    Zhang, Hui
    Schaubel, Douglas E.
    Kalbfleisch, John D.
    BIOMETRICS, 2011, 67 (01) : 18 - 28
  • [33] A real survival analysis application via variable selection methods for Cox's proportional hazards model
    Androulakis, Emmanouil
    Koukouvinos, Christos
    Mylona, Kalliopi
    Vonta, Filia
    JOURNAL OF APPLIED STATISTICS, 2010, 37 (08) : 1399 - 1406
  • [34] A TRANSCRIPTOME ANALYSIS BY LASSO PENALIZED COX REGRESSION FOR PANCREATIC CANCER SURVIVAL
    Wu, Tong Tong
    Gong, Haijun
    Clarke, Edmund M.
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2011, 9 : 63 - 73
  • [35] DC-COX: Data collaboration Cox proportional hazards model for privacy-preserving survival analysis on multiple parties
    Imakura, Akira
    Tsunoda, Ryoya
    Kagawa, Rina
    Yamagata, Kunihiro
    Sakurai, Tetsuya
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 137
  • [36] The prognostic value of machine learning techniques versus cox regression model for head and neck cancer
    Peng, Jiajia
    Lu, Yongmei
    Chen, Li
    Qiu, Ke
    Chen, Fei
    Liu, Jun
    Xu, Wei
    Zhang, Wei
    Zhao, Yu
    Yu, Zhonghua
    Ren, Jianjun
    METHODS, 2022, 205 : 123 - 132
  • [37] Automated sarcopenia assessment in the neck and survival analysis in head and neck cancer patients
    Warr, H.
    Murray, O.
    McSweeney, D.
    McWilliam, A.
    Green, A.
    RADIOTHERAPY AND ONCOLOGY, 2021, 161 : S614 - S615
  • [38] A Bayesian Approach to Sparse Cox Regression in High-Dimentional Survival Analysis
    Krasotkina, Olga
    Mottl, Vadim
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2015, 2015, 9166 : 425 - 437
  • [39] An online framework for survival analysis: reframing Cox proportional hazards model for large data sets and neural networks
    Tarkhan, Aliasghar
    Simon, Noah
    BIOSTATISTICS, 2023, 25 (01) : 134 - 153
  • [40] APPLICATION OF THE PRINCIPAL COMPONENTS METHOD AND THE PROPORTIONAL HAZARDS REGRESSION-MODEL TO ANALYSIS OF SURVIVAL-DATA
    DANIELYAN, SA
    ZHARINOV, GM
    OSIPOVA, TT
    BIOMETRICAL JOURNAL, 1986, 28 (01) : 73 - 79