A Novel Cox Proportional Hazards Model for High-Dimensional Genomic Data in Cancer Prognosis

被引:0
|
作者
Huang, Hai-Hui [1 ,2 ]
Liang, Yong [1 ,2 ]
机构
[1] Macau Univ Sci & Technol, Fac Informat Technol, Macau 999078, Peoples R China
[2] Macau Univ Sci & Technol, State Key Lab Qual Res Chinese Med, Macau 999078, Peoples R China
关键词
Biological system modeling; Predictive models; Hazards; Mathematical model; Adaptation models; Computational modeling; Genomics; Cox model; regularization; variable selection; gene expression; VARIABLE SELECTION; GENE-EXPRESSION; BREAST-CANCER; REGULARIZATION; REGRESSION; SURVIVAL; LASSO;
D O I
10.1109/TCBB.2019.2961667
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The Cox proportional hazards model is a popular method to study the connection between feature and survival time. Because of the high-dimensionality of genomic data, existing Cox models trained on any specific dataset often generalize poorly to other independent datasets. In this paper, we suggest a novel strategy for the Cox model. This strategy is included a new learning technique, self-paced learning (SPL), and a new gene selection method, SCAD-Net penalty. The SPL method is adopted to aid to build a more accurate prediction with its built-in mechanism of learning from easy samples first and adaptively learning from hard samples. The SCAD-Net penalty has fixed the problem of the SCAD method without an inherent mechanism to fuse the prior graphical information. We combined the SPL with the SCAD-Net penalty to the Cox model (SSNC). The simulation shows that the SSNC outperforms the benchmark in terms of prediction and gene selection. The analysis of a large-scale experiment across several cancer datasets shows that the SSNC method not only results in higher prediction accuracies but also identifies markers that satisfactory stability across another validation dataset. The demo code for the proposed method is provided in supplemental file.
引用
收藏
页码:1821 / 1830
页数:10
相关论文
共 50 条
  • [1] Penalized Cox’s proportional hazards model for high-dimensional survival data with grouped predictors
    Xuan Dang
    Shuai Huang
    Xiaoning Qian
    [J]. Statistics and Computing, 2021, 31
  • [2] Penalized Cox's proportional hazards model for high-dimensional survival data with grouped predictors
    Dang, Xuan
    Huang, Shuai
    Qian, Xiaoning
    [J]. STATISTICS AND COMPUTING, 2021, 31 (06)
  • [3] A sequential feature selection procedure for high-dimensional Cox proportional hazards model
    Yu, Ke
    Luo, Shan
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2022, 74 (06) : 1109 - 1142
  • [4] A sequential feature selection procedure for high-dimensional Cox proportional hazards model
    Ke Yu
    Shan Luo
    [J]. Annals of the Institute of Statistical Mathematics, 2022, 74 : 1109 - 1142
  • [5] Spike-and-slab type variable selection in the Cox proportional hazards model for high-dimensional features
    Wu, Ryan
    Ahn, Mihye
    Yang, Hojin
    [J]. JOURNAL OF APPLIED STATISTICS, 2022, 49 (09) : 2189 - 2207
  • [6] High-dimensional, massive sample-size Cox proportional hazards regression for survival analysis
    Mittal, Sushil
    Madigan, David
    Burd, Randall S.
    Suchard, Marc A.
    [J]. BIOSTATISTICS, 2014, 15 (02) : 207 - 221
  • [7] Fitting the Cox proportional hazards model to big data
    Wang, Jianqiao
    Zeng, Donglin
    Lin, Dan-Yu
    [J]. BIOMETRICS, 2024, 80 (01)
  • [8] Utilizing Graph Neural Networks for Breast Cancer Prognosis Prediction with High-dimensional Genomic Data
    Huang, Tzu-Chen
    Hsu, Te-Cheng
    Hsieh, Yi-Hsien
    Che-Lin
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [9] NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA
    Sun, Hokeun
    Lin, Wei
    Feng, Rui
    Li, Hongzhe
    [J]. STATISTICA SINICA, 2014, 24 (03) : 1433 - 1459
  • [10] A novel ensemble method for high-dimensional genomic data classification
    Espichan, Alexandra
    Villanueva, Edwin
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2229 - 2236