Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data

被引:15
|
作者
Mohammed, Mohanad [1 ,2 ]
Mboya, Innocent B. [1 ,3 ]
Mwambi, Henry [1 ]
Elbashir, Murtada K. [4 ]
Omolo, Bernard [1 ,5 ,6 ]
机构
[1] Univ KwaZulu Natal, Sch Math Stat & Comp Sci, Pietermaritzburg, South Africa
[2] Univ Gezira, Fac Math & Comp Sci, Wad Madani, Sudan
[3] Kilimanjaro Christian Med Univ Coll KCMUCo, Dept Epidemiol & Biostat, Moshi, Tanzania
[4] Jouf Univ, Coll Comp & Informat Sci, Sakaka, Saudi Arabia
[5] Univ South Carolina Upstate, Div Math & Comp Sci, Spartanburg, SC USA
[6] Univ Witwatersrand, Fac Hlth Sci, Sch Publ Hlth, Johannesburg, South Africa
来源
PLOS ONE | 2021年 / 16卷 / 12期
关键词
MULTIPLE IMPUTATION; BIOMARKERS;
D O I
10.1371/journal.pone.0261625
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and find a robust signature for predicting CRC overall survival. We used stepwise regression to develop Cox PH model to analyse 54 common differentially expressed genes from three mutations. RSF is applied using log-rank and log-rank-score based on 5000 survival trees, and therefore, variables important obtained to find the genes that are most influential for CRC survival. We compared the predictive performance of the Cox PH model and RSF for early CRC detection and diagnosis. The results indicate that SLC9A8, IER5, ARSJ, ANKRD27, and PIPOX genes were significantly associated with the CRC overall survival. In addition, age, sex, and stages are also affecting the CRC overall survival. The RSF model using log-rank is better than log-rank-score, while log-rank-score needed more trees to stabilize. Overall, the imputation of missing values enhanced the model's predictive performance. In addition, Cox PH predictive performance was better than RSF.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Penalized semiparametric Cox regression model on XGBoost and random survival forests
    Wang, Yating
    Su, Jinxia
    Zhao, Xuejing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3095 - 3103
  • [2] NOVEL HEAD AND NECK CANCER SURVIVAL ANALYSIS APPROACH: RANDOM SURVIVAL FORESTS VERSUS COX PROPORTIONAL HAZARDS REGRESSION
    Datema, Frank R.
    Moya, Ana
    Krause, Peter
    Baeck, Thomas
    Willmes, Lars
    Langeveld, Ton
    de Jong, Robert J. Baatenburg
    Blom, Henk M.
    HEAD AND NECK-JOURNAL FOR THE SCIENCES AND SPECIALTIES OF THE HEAD AND NECK, 2012, 34 (01): : 50 - 58
  • [3] Analysis of censored survival data using random regression models
    Veerkamp, RF
    Brotherstone, S
    Engel, B
    Meuwissen, THE
    ANIMAL SCIENCE, 2001, 72 : 1 - 10
  • [4] Cox survival analysis of microarray gene expression data using correlation principal component regression
    Zhao, Qiang
    Sun, Jianguo
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2007, 6
  • [5] The comparisons of random survival forests and Cox regression analysis with simulation and an application related to breast cancer
    Omurlu, Imran Kurt
    Ture, Mevlut
    Tokatli, Fuesun
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 8582 - 8588
  • [6] Cox regression models with functional covariates for survival data
    Gellar, Jonathan E.
    Colantuoni, Elizabeth
    Needham, Dale M.
    Crainiceanu, Ciprian M.
    STATISTICAL MODELLING, 2015, 15 (03) : 256 - 278
  • [7] COX REGRESSION MODELS WITH NONPROPORTIONAL HAZARDS APPLIED TO LUNG CANCER SURVIVAL DATA
    Ata, Nihal
    Sozer, M. Tekin
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2007, 36 (02): : 157 - 167
  • [8] Establishment of survival models for primary prostate cancer and colorectal cancer based on the random survival forest
    Ma, Bingqing
    Chen, Biao
    Cai, Chengjun
    Zhang, Jinxiang
    ASIAN JOURNAL OF SURGERY, 2023, 46 (12) : 5787 - 5788
  • [9] Rectal Cancer Outcome Prediction Based On Institutional Data with Random Forests and Random Survival Forests
    Huang, M.
    Zhong, H.
    Liu, D.
    Gabriel, P.
    Ben-Josef, E.
    Yin, L.
    Geng, H.
    Cheng, C.
    Bilker, W.
    Xiao, Y.
    MEDICAL PHYSICS, 2017, 44 (06)
  • [10] The use of random forests regression for estimating prognosis with survival data
    Royston, P
    CONTROLLED CLINICAL TRIALS, 2003, 24 : 198S - 198S