Competing Risks Data Analysis with High-dimensional Covariates:An Application in Bladder Cancer

被引:0
|
作者
Leili Tapak [1 ]
Massoud Saidijam [2 ]
Majid Sadeghifar [3 ]
Jalal Poorolajal [1 ,4 ]
Hossein Mahjub [1 ,5 ]
机构
[1] Department of Biostatistics and Epidemiology,School of Public Health,Hamadan University of Medical Sciences
[2] Research Center for Molecular Medicine,Department of Molecular Medicine and Genetics,School of Medicine,Hamadan University of Medical Sciences
[3] Department of Statistics,Bu-Ali Sina University
[4] Modeling of Noncommunicable Diseases Research Center,School of Public Health,Hamadan University of Medical Sciences
[5] Research Center for Health Sciences,School of Public Health,Hamadan University of Medical Sciences
关键词
Microarray; Elastic net; Lasso; Competing risks; Subdistribution hazard; Cause-specific hazard;
D O I
暂无
中图分类号
R737.14 [膀胱肿瘤];
学科分类号
100214 ;
摘要
Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in highdimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typical variable selection techniques including Lasso, elastic net, and likelihood-based boosting for high-dimensional time-to-event data with competing risks. The performance of these methods was evaluated via a simulation study by analyzing a real dataset related to bladder cancer patients using time-dependent receiver operator characteristic(ROC) curve and bootstrap.632+ prediction error curves. The elastic net penalization method was shown to outperform Lasso and boosting. Based on the elastic net, 33 genes out of 1381 genes related to bladder cancer were selected. By fitting to the Fine and Gray model, eight genes were highly significant(P < 0.001). Among them, expression of RTN4, SON, IGF1 R, SNRPE, PTGR1, PLEK, and ETFDH was associated with a decrease in survival time, whereas SMARCAD1 expression was associated with an increase in survival time. This study indicates that the elastic net has a higher capacity than the Lasso and boosting for the prediction of survival time in bladder cancer patients.Moreover, genes selected by all methods improved the predictive power of the model based on only clinical variables, indicating the value of information contained in the microarray features.
引用
收藏
页码:169 / 176
页数:8
相关论文
共 50 条
  • [1] Competing Risks Data Analysis with High-dimensional Covariates:An Application in Bladder Cancer
    Leili Tapak
    Massoud Saidijam
    Majid Sadeghifar
    Jalal Poorolajal
    Hossein Mahjub
    [J]. Genomics,Proteomics & Bioinformatics., 2015, (03) - 176
  • [2] Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
    Tapak, Leili
    Saidijam, Massoud
    Sadeghifar, Majid
    Poorolajal, Jalal
    Mahjub, Hossein
    [J]. GENOMICS PROTEOMICS & BIOINFORMATICS, 2015, 13 (03) : 169 - 176
  • [3] Penalized estimation for competing risks regression with applications to high-dimensional covariates
    Ambrogi, Federico
    Scheike, Thomas H.
    [J]. BIOSTATISTICS, 2016, 17 (04) : 708 - 721
  • [4] Inference under Fine-Gray competing risks model with high-dimensional covariates
    Hou, Jue
    Bradic, Jelena
    Xu, Ronghui
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (02): : 4449 - 4507
  • [5] Boosting for high-dimensional time-to-event data with competing risks
    Binder, Harald
    Allignol, Arthur
    Schumacher, Martin
    Beyersmann, Jan
    [J]. BIOINFORMATICS, 2009, 25 (07) : 890 - 896
  • [6] Survival Analysis with High-Dimensional Covariates: An Application in Microarray Studies
    Engler, David
    Li, Yi
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
  • [7] Survival analysis with high-dimensional covariates
    Witten, Daniela M.
    Tibshirani, Robert
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (01) : 29 - 51
  • [8] PGEE: An R Package for Analysis of Longitudinal Data with High-Dimensional Covariates
    Inan, Gul
    Wang, Lan
    [J]. R JOURNAL, 2017, 9 (01): : 393 - 402
  • [9] High-dimensional variable selection and prediction under competing risks with application to SEER-Medicare linked data
    Hou, Jiayi
    Paravati, Anthony
    Hou, Jue
    Xu, Ronghui
    Murphy, James
    [J]. STATISTICS IN MEDICINE, 2018, 37 (24) : 3486 - 3502
  • [10] Missing covariates in competing risks analysis
    Bartlett, Jonathan W.
    Taylor, Jeremy M. G.
    [J]. BIOSTATISTICS, 2016, 17 (04) : 751 - 763