Assessment of performance of survival prediction models for cancer prognosis

被引:54
|
作者
Chen, Hung-Chia [1 ]
Kodell, Ralph L. [3 ]
Cheng, Kuang Fu [2 ]
Chen, James J. [1 ,2 ]
机构
[1] US FDA, Natl Ctr Toxicol Res, Div Bioinformat & Biostat, Jefferson, AR 72079 USA
[2] China Med Univ, Sch Publ Hlth, Ctr Biostat, Taichung, Taiwan
[3] Univ Arkansas Med Sci, Dept Biostat, Little Rock, AR 72205 USA
来源
关键词
CELL-LUNG-CANCER; GENE-EXPRESSION SIGNATURES; HIGH-DIMENSIONAL DATA; CROSS-VALIDATION; BREAST-CANCER; RISK STRATIFICATION; MICROARRAY DATA; LYMPHOMA; CLASSIFIERS; RECURRENCE;
D O I
10.1186/1471-2288-12-102
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Cancer survival studies are commonly analyzed using survival-time prediction models for cancer prognosis. A number of different performance metrics are used to ascertain the concordance between the predicted risk score of each patient and the actual survival time, but these metrics can sometimes conflict. Alternatively, patients are sometimes divided into two classes according to a survival-time threshold, and binary classifiers are applied to predict each patient's class. Although this approach has several drawbacks, it does provide natural performance metrics such as positive and negative predictive values to enable unambiguous assessments. Methods: We compare the survival-time prediction and survival-time threshold approaches to analyzing cancer survival studies. We review and compare common performance metrics for the two approaches. We present new randomization tests and cross-validation methods to enable unambiguous statistical inferences for several performance metrics used with the survival-time prediction approach. We consider five survival prediction models consisting of one clinical model, two gene expression models, and two models from combinations of clinical and gene expression models. Results: A public breast cancer dataset was used to compare several performance metrics using five prediction models. 1) For some prediction models, the hazard ratio from fitting a Cox proportional hazards model was significant, but the two-group comparison was insignificant, and vice versa. 2) The randomization test and cross-validation were generally consistent with the p-values obtained from the standard performance metrics. 3) Binary classifiers highly depended on how the risk groups were defined; a slight change of the survival threshold for assignment of classes led to very different prediction results. Conclusions: 1) Different performance metrics for evaluation of a survival prediction model may give different conclusions in its discriminatory ability. 2) Evaluation using a high-risk versus low-risk group comparison depends on the selected risk-score threshold; a plot of p-values from all possible thresholds can show the sensitivity of the threshold selection. 3) A randomization test of the significance of Somers' rank correlation can be used for further evaluation of performance of a prediction model. 4) The cross-validated power of survival prediction models decreases as the training and test sets become less balanced.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Assessment of performance of survival prediction models for cancer prognosis
    Hung-Chia Chen
    Ralph L Kodell
    Kuang Fu Cheng
    James J Chen
    BMC Medical Research Methodology, 12
  • [2] Application of statistical models based on survival analysis in the assessment of cancer prognosis
    Zhang X.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [3] Assessment of survival prediction models based on microarray data
    Schumacher, Martin
    Binder, Harald
    Gerds, Thomas
    BIOINFORMATICS, 2007, 23 (14) : 1768 - 1774
  • [4] Prediction of Pavement Performance Using Multistate Survival Models
    Altarabsheh, Ahmad
    Altarabsheh, Rawan
    Altarabsheh, Sara
    Asi, Ibrahim
    JOURNAL OF TRANSPORTATION ENGINEERING PART B-PAVEMENTS, 2021, 147 (01)
  • [5] Prediction models for breast cancer prognosis among Asian women
    Fan, Run
    Chen, Yufan
    Nechuta, Sarah
    Cai, Hui
    Gu, Kai
    Shi, Liang
    Bao, Pingping
    Shyr, Yu
    Shu, Xiao-Ou
    Ye, Fei
    CANCER, 2021, 127 (11) : 1758 - 1769
  • [6] Machine learning models in breast cancer survival prediction
    Montazeri, Mitra
    Montazeri, Mohadeseh
    Montazeri, Mahdieh
    Beigzadeh, Amin
    TECHNOLOGY AND HEALTH CARE, 2016, 24 (01) : 31 - 42
  • [7] Spine metastasis in patients with prostate cancer: Survival prognosis assessment
    Amelot, Aymeric
    Terrier, Louis-Marie
    Le Nail, Louis-Romee
    Cristini, Joseph
    Cook, Ann-Rose
    Buffenoir, Kevin
    Pascal-Moussellard, Hugues
    Carpentier, Alexandre
    Dubory, Arnaud
    Mathon, Bertrand
    PROSTATE, 2021, 81 (02): : 91 - 101
  • [8] Machine learning-based prediction of survival prognosis in cervical cancer
    Dongyan Ding
    Tingyuan Lang
    Dongling Zou
    Jiawei Tan
    Jia Chen
    Lei Zhou
    Dong Wang
    Rong Li
    Yunzhe Li
    Jingshu Liu
    Cui Ma
    Qi Zhou
    BMC Bioinformatics, 22
  • [9] Assessment of tumour hypoxia for prediction of response to therapy and cancer prognosis
    Jubb, Adrian M.
    Buffa, Francesca M.
    Harris, Adrian L.
    JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2010, 14 (1-2) : 18 - 29
  • [10] Machine learning-based prediction of survival prognosis in cervical cancer
    Ding, Dongyan
    Lang, Tingyuan
    Zou, Dongling
    Tan, Jiawei
    Chen, Jia
    Zhou, Lei
    Wang, Dong
    Li, Rong
    Li, Yunzhe
    Liu, Jingshu
    Ma, Cui
    Zhou, Qi
    BMC BIOINFORMATICS, 2021, 22 (01)