Assessment of performance of survival prediction models for cancer prognosis

被引:54
|
作者
Chen, Hung-Chia [1 ]
Kodell, Ralph L. [3 ]
Cheng, Kuang Fu [2 ]
Chen, James J. [1 ,2 ]
机构
[1] US FDA, Natl Ctr Toxicol Res, Div Bioinformat & Biostat, Jefferson, AR 72079 USA
[2] China Med Univ, Sch Publ Hlth, Ctr Biostat, Taichung, Taiwan
[3] Univ Arkansas Med Sci, Dept Biostat, Little Rock, AR 72205 USA
来源
关键词
CELL-LUNG-CANCER; GENE-EXPRESSION SIGNATURES; HIGH-DIMENSIONAL DATA; CROSS-VALIDATION; BREAST-CANCER; RISK STRATIFICATION; MICROARRAY DATA; LYMPHOMA; CLASSIFIERS; RECURRENCE;
D O I
10.1186/1471-2288-12-102
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Cancer survival studies are commonly analyzed using survival-time prediction models for cancer prognosis. A number of different performance metrics are used to ascertain the concordance between the predicted risk score of each patient and the actual survival time, but these metrics can sometimes conflict. Alternatively, patients are sometimes divided into two classes according to a survival-time threshold, and binary classifiers are applied to predict each patient's class. Although this approach has several drawbacks, it does provide natural performance metrics such as positive and negative predictive values to enable unambiguous assessments. Methods: We compare the survival-time prediction and survival-time threshold approaches to analyzing cancer survival studies. We review and compare common performance metrics for the two approaches. We present new randomization tests and cross-validation methods to enable unambiguous statistical inferences for several performance metrics used with the survival-time prediction approach. We consider five survival prediction models consisting of one clinical model, two gene expression models, and two models from combinations of clinical and gene expression models. Results: A public breast cancer dataset was used to compare several performance metrics using five prediction models. 1) For some prediction models, the hazard ratio from fitting a Cox proportional hazards model was significant, but the two-group comparison was insignificant, and vice versa. 2) The randomization test and cross-validation were generally consistent with the p-values obtained from the standard performance metrics. 3) Binary classifiers highly depended on how the risk groups were defined; a slight change of the survival threshold for assignment of classes led to very different prediction results. Conclusions: 1) Different performance metrics for evaluation of a survival prediction model may give different conclusions in its discriminatory ability. 2) Evaluation using a high-risk versus low-risk group comparison depends on the selected risk-score threshold; a plot of p-values from all possible thresholds can show the sensitivity of the threshold selection. 3) A randomization test of the significance of Somers' rank correlation can be used for further evaluation of performance of a prediction model. 4) The cross-validated power of survival prediction models decreases as the training and test sets become less balanced.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Evaluating Dynamic Discrimination Performance of Risk Prediction Models for Survival Outcomes
    Jing Zhang
    Jing Ning
    Ruosha Li
    Statistics in Biosciences, 2023, 15 : 353 - 371
  • [22] Evaluating Dynamic Discrimination Performance of Risk Prediction Models for Survival Outcomes
    Zhang, Jing
    Ning, Jing
    Li, Ruosha
    STATISTICS IN BIOSCIENCES, 2023, 15 (02) : 353 - 371
  • [23] On the assessment of lumped parameter models for gear pump performance prediction
    Battarra, Mattia
    Mucchi, Emiliano
    SIMULATION MODELLING PRACTICE AND THEORY, 2020, 99
  • [24] Performance of prediction models on survival outcomes of colorectal cancer with surgical resection: A systematic review and meta-analysis
    He, Yazhou
    Ong, Yuhan
    Li, Xue
    Din, Farhat V. N.
    Brown, Ewan
    Timofeeva, Maria
    Wang, Ziqiang
    Farrington, Susan M.
    Campbell, Harry
    Dunlop, Malcolm G.
    Theodoratou, Evropi
    SURGICAL ONCOLOGY-OXFORD, 2019, 29 : 196 - 202
  • [25] Performance Assessment of Machine Learning Based Models for Diabetes Prediction
    Deo, Ridhi
    Panigrahi, Suranjan
    2019 IEEE HEALTHCARE INNOVATIONS AND POINT OF CARE TECHNOLOGIES (HI-POCT), 2019, : 147 - 150
  • [26] Prognosis of survival in advanced cancer
    de Arriba Mendez, J. J.
    REVISTA CLINICA ESPANOLA, 2007, 207 (07): : 348 - 351
  • [27] Performance of clinician prediction of survival in oncology outpatients with advanced cancer
    Kim, Yu Jung
    Yoon, Seok Joon
    Suh, Sang-Yeon
    Hiratsuka, Yusuke
    Kang, Beodeul
    Lee, Si Won
    Ahn, Hong-Yup
    Suh, Koung Jin
    Kim, Ji-Won
    Kim, Se Hyun
    Kim, Jin Won
    Lee, Keun-Wook
    Kim, Jee Hyun
    Lee, Jong Seok
    PLOS ONE, 2022, 17 (04):
  • [28] Comparative analysis on Prediction Models with various Data Preprocessings in the Prognosis of Cervical Cancer
    Abdullah
    Bin Ashraf, Faisal
    Momo, Nusrat Suzana
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [29] Prediction of Pancreatic Cancer Survival through Automated Selection of Predictive Models
    Floyd, Stuart
    Ruiz, Carolina
    Alvarez, Sergio A.
    Tseng, Jennifer
    Whalen, Giles
    BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, 2011, 127 : 29 - +
  • [30] Construction of a New Ferroptosis-related Prognosis Model for Survival Prediction in Colorectal Cancer
    Chen, Lin
    Ge, Mengxiao
    Mo, Shaocong
    Shi, Menglin
    Zhang, Jun
    Liu, Jie
    CURRENT MEDICINAL CHEMISTRY, 2024,