Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data

被引:15
|
作者
Mohammed, Mohanad [1 ,2 ]
Mboya, Innocent B. [1 ,3 ]
Mwambi, Henry [1 ]
Elbashir, Murtada K. [4 ]
Omolo, Bernard [1 ,5 ,6 ]
机构
[1] Univ KwaZulu Natal, Sch Math Stat & Comp Sci, Pietermaritzburg, South Africa
[2] Univ Gezira, Fac Math & Comp Sci, Wad Madani, Sudan
[3] Kilimanjaro Christian Med Univ Coll KCMUCo, Dept Epidemiol & Biostat, Moshi, Tanzania
[4] Jouf Univ, Coll Comp & Informat Sci, Sakaka, Saudi Arabia
[5] Univ South Carolina Upstate, Div Math & Comp Sci, Spartanburg, SC USA
[6] Univ Witwatersrand, Fac Hlth Sci, Sch Publ Hlth, Johannesburg, South Africa
来源
PLOS ONE | 2021年 / 16卷 / 12期
关键词
MULTIPLE IMPUTATION; BIOMARKERS;
D O I
10.1371/journal.pone.0261625
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and find a robust signature for predicting CRC overall survival. We used stepwise regression to develop Cox PH model to analyse 54 common differentially expressed genes from three mutations. RSF is applied using log-rank and log-rank-score based on 5000 survival trees, and therefore, variables important obtained to find the genes that are most influential for CRC survival. We compared the predictive performance of the Cox PH model and RSF for early CRC detection and diagnosis. The results indicate that SLC9A8, IER5, ARSJ, ANKRD27, and PIPOX genes were significantly associated with the CRC overall survival. In addition, age, sex, and stages are also affecting the CRC overall survival. The RSF model using log-rank is better than log-rank-score, while log-rank-score needed more trees to stabilize. Overall, the imputation of missing values enhanced the model's predictive performance. In addition, Cox PH predictive performance was better than RSF.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Assessment of the fitness of Cox and parametric regression models of survival distribution for Iranian breast cancer patients' data
    Mohseny, Maryam
    Shekarriz-Foumani, Reza
    Amiri, Parastoo
    Vejdani, Marjan
    Farshidmehr, Pezhman
    Mahmoudabadi, Hossein Zabihi
    Amanpour, Farzaneh
    Mohaghegh, Pegah
    Tajdini, Farzad
    Sayarifard, Azadeh
    Davoudi-Monfared, Esmat
    JOURNAL OF ADVANCED PHARMACEUTICAL TECHNOLOGY & RESEARCH, 2019, 10 (01) : 39 - 44
  • [22] Head and Neck Cancer Survival Outcome Prediction Based On NRG Oncology RTOG 0522 with Random Forests and Random Survival Forests
    Huang, M.
    Cheng, C.
    Geng, H.
    Zhong, H.
    Wang, J.
    Lin, A.
    Guttmann, D.
    van Soest, J.
    Dekker, A.
    Bilker, W.
    Zhang, Z.
    Rosenthal, D.
    Axelrod, R.
    Galvin, J.
    Frank, S.
    Thorstad, W.
    Huth, B.
    Hsu, A.
    Trotti, A.
    Zhang, Q.
    Xiao, Y.
    MEDICAL PHYSICS, 2017, 44 (06)
  • [23] Nonparametric binary regression models with spherical predictors based on the random forests kernel
    Qin, Xu
    Gao, Huiqun
    COMPUTATIONAL STATISTICS, 2024, 39 (06) : 3031 - 3048
  • [24] Cox Models Survival Analysis Based on Breast Cancer Treatments
    Abadi, Alireza
    Yavari, Parvin
    Dehghani-Arani, Monireh
    Alavi-Majd, Hamid
    Ghasemi, Erfan
    Amanpour, Farzaneh
    Bajdik, Chris
    IRANIAN JOURNAL OF CANCER PREVENTION, 2014, 7 (03) : 124 - 129
  • [25] Predicting survival outcomes in ovarian cancer using gene expression data
    Ahn, TaeJin
    Kang, Nayeon
    Kim, Yonggab
    Kim, Se Ik
    Song, Yong-Sang
    Park, Taesung
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 21 (04) : 339 - 351
  • [26] SURVIVAL ANALYSIS WITH COX REGRESSION MODELS: VALIDATING A WEB-BASED CALCULATOR
    McGhan, W. F.
    Willey, V. J.
    Zaveri, V
    VALUE IN HEALTH, 2010, 13 (07) : A551 - A552
  • [27] Tumor COX2 expression does not affect colorectal cancer survival
    Rebecca Doherty
    Nature Clinical Practice Oncology, 2005, 2 (10): : 485 - 485
  • [28] Survival analysis for lung cancer patients: A comparison of Cox regression and machine learning models
    Germer, Sebastian
    Rudolph, Christiane
    Labohm, Louisa
    Katalinic, Alexander
    Rath, Natalie
    Rausch, Katharina
    Holleczek, Bernd
    Handels, Heinz
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 191
  • [29] Comparing Individualized Survival Predictions From Random Survival Forests and Multistate Models in the Presence of Missing Data: A Case Study of Patients With Oropharyngeal Cancer
    Abbott, Madeline R.
    Beesley, Lauren J.
    Bellile, Emily L.
    Shuman, Andrew G.
    Rozek, Laura S.
    Taylor, Jeremy M. G.
    CANCER INFORMATICS, 2023, 22
  • [30] Comparing Individualized Survival Predictions From Random Survival Forests and Multistate Models in the Presence of Missing Data: A Case Study of Patients With Oropharyngeal Cancer
    Abbott, Madeline R.
    Beesley, Lauren J.
    Bellile, Emily L.
    Shuman, Andrew G.
    Rozek, Laura S.
    Taylor, Jeremy M. G.
    CANCER INFORMATICS, 2023, 22