Assessment of performance of survival prediction models for cancer prognosis

被引:54
|
作者
Chen, Hung-Chia [1 ]
Kodell, Ralph L. [3 ]
Cheng, Kuang Fu [2 ]
Chen, James J. [1 ,2 ]
机构
[1] US FDA, Natl Ctr Toxicol Res, Div Bioinformat & Biostat, Jefferson, AR 72079 USA
[2] China Med Univ, Sch Publ Hlth, Ctr Biostat, Taichung, Taiwan
[3] Univ Arkansas Med Sci, Dept Biostat, Little Rock, AR 72205 USA
来源
关键词
CELL-LUNG-CANCER; GENE-EXPRESSION SIGNATURES; HIGH-DIMENSIONAL DATA; CROSS-VALIDATION; BREAST-CANCER; RISK STRATIFICATION; MICROARRAY DATA; LYMPHOMA; CLASSIFIERS; RECURRENCE;
D O I
10.1186/1471-2288-12-102
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Cancer survival studies are commonly analyzed using survival-time prediction models for cancer prognosis. A number of different performance metrics are used to ascertain the concordance between the predicted risk score of each patient and the actual survival time, but these metrics can sometimes conflict. Alternatively, patients are sometimes divided into two classes according to a survival-time threshold, and binary classifiers are applied to predict each patient's class. Although this approach has several drawbacks, it does provide natural performance metrics such as positive and negative predictive values to enable unambiguous assessments. Methods: We compare the survival-time prediction and survival-time threshold approaches to analyzing cancer survival studies. We review and compare common performance metrics for the two approaches. We present new randomization tests and cross-validation methods to enable unambiguous statistical inferences for several performance metrics used with the survival-time prediction approach. We consider five survival prediction models consisting of one clinical model, two gene expression models, and two models from combinations of clinical and gene expression models. Results: A public breast cancer dataset was used to compare several performance metrics using five prediction models. 1) For some prediction models, the hazard ratio from fitting a Cox proportional hazards model was significant, but the two-group comparison was insignificant, and vice versa. 2) The randomization test and cross-validation were generally consistent with the p-values obtained from the standard performance metrics. 3) Binary classifiers highly depended on how the risk groups were defined; a slight change of the survival threshold for assignment of classes led to very different prediction results. Conclusions: 1) Different performance metrics for evaluation of a survival prediction model may give different conclusions in its discriminatory ability. 2) Evaluation using a high-risk versus low-risk group comparison depends on the selected risk-score threshold; a plot of p-values from all possible thresholds can show the sensitivity of the threshold selection. 3) A randomization test of the significance of Somers' rank correlation can be used for further evaluation of performance of a prediction model. 4) The cross-validated power of survival prediction models decreases as the training and test sets become less balanced.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Online performance and proactive maintenance assessment of data driven prediction models
    Shen, Yingjun
    Wang, Taohong
    Song, Zhe
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (08) : 3959 - 3993
  • [42] Performance of Hazard Prediction and Assessment Capability Urban Models for the UDINEE Project
    Miner, Sean
    Mazzola, Thomas
    Herring, Steven
    Fry, Richard
    Meris, Ronald
    BOUNDARY-LAYER METEOROLOGY, 2019, 171 (03) : 423 - 437
  • [43] Performance of Hazard Prediction and Assessment Capability Urban Models for the UDINEE Project
    Sean Miner
    Thomas Mazzola
    Steven Herring
    Richard Fry
    Ronald Meris
    Boundary-Layer Meteorology, 2019, 171 : 423 - 437
  • [44] Superior prognosis prediction performance of deep learning for gastric cancer compared to Yonsei prognosis prediction model using Cox regression.
    Hyung, Woo Jin
    Son, Taeil
    Park, Minseok
    Lee, Hansang
    Kim, Youn Nam
    Kim, Hyoung-Il
    Kim, Jong Won
    Cheong, Jae-Ho
    Choi, Seung Ho
    Noh, Sung Hoon
    Kim, Junmo
    JOURNAL OF CLINICAL ONCOLOGY, 2017, 35 (04)
  • [45] Endometrial cancer prognosis prediction using correlation models based on CDK family genes
    Gu, Xianhua
    Shen, Honghong
    Bai, Wenqi
    Xiang, Zheng
    Li, Xinwei
    Zhang, Rong
    Shi, Fan
    Li, Huiyuan
    Zhu, Guangzheng
    Guo, Suyang
    FRONTIERS IN GENETICS, 2022, 13
  • [46] Editorial: Clinically prediction models for gastrointestinal cancer diagnosis and prognosis in the era of precision oncology
    Gao, Yinyan
    Wu, Irene X. Y.
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [47] Survival and prognosis for cancer of the submandibular gland
    Bhattacharyya, N
    JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 2004, 62 (04) : 427 - 430
  • [49] Prognostic Prediction Models for Liver Metastasis and Overall Survival in Colorectal Cancer Patients
    Miyoshi, Norikatsu
    Ohue, Masayuki
    Yasui, Masayoshi
    Takahashi, Yusuke
    Fujino, Shiki
    Wada, Yuma
    Sugimura, Keijiro
    Tomokuni, Akira
    Akita, Hirofumi
    Kobayashi, Shogo
    Takahashi, Hidenori
    Omori, Takeshi
    Miyata, Hiroshi
    Yano, Masahiko
    INTERNATIONAL SURGERY, 2021, 105 (1-3) : 442 - 448
  • [50] HFBSurv: hierarchical multimodal fusion with factorized bilinear models for cancer survival prediction
    Li, Ruiqing
    Wu, Xingqi
    Li, Ao
    Wang, Minghui
    BIOINFORMATICS, 2022, 38 (09) : 2587 - 2594