A transformer model for cause-specific hazard prediction

被引:0
|
作者
Oliver, Matthieu [1 ,2 ]
Allou, Nicolas [2 ,3 ]
Devineau, Marjolaine [3 ]
Allyn, Jerome [2 ,3 ,4 ]
Ferdynus, Cyril [2 ,4 ]
机构
[1] Reunion Univ Hosp, Methodol Support Unit, St Denis, La Reunion, France
[2] Reunion Univ Hosp, Clin Informat Dept, St Denis, La Reunion, France
[3] Reunion Univ Hosp, Intens Care Unit, St Denis, La Reunion, France
[4] INSERM, Clin Res Dept, CIC 1410, St Pierre, La Reunion, France
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
Transformer; Competing risks; Cause-specific hazard; Synthetic data; English longitudinal study of ageing;
D O I
10.1186/s12859-024-05799-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Backgroud Modelling discrete-time cause-specific hazards in the presence of competing events and non-proportional hazards is a challenging task in many domains. Survival analysis in longitudinal cohorts often requires such models; notably when the data is gathered at discrete points in time and the predicted events display complex dynamics. Current models often rely on strong assumptions of proportional hazards, that is rarely verified in practice; or do not handle sequential data in a meaningful way. This study proposes a Transformer architecture for the prediction of cause-specific hazards in discrete-time competing risks. Contrary to Multilayer perceptrons that were already used for this task (DeepHit), the Transformer architecture is especially suited for handling complex relationships in sequential data, having displayed state-of-the-art performance in numerous tasks with few underlying assumptions on the task at hand.Results Using synthetic datasets of 2000-50,000 patients, we showed that our Transformer model surpassed the CoxPH, PyDTS, and DeepHit models for the prediction of cause-specific hazard, especially when the proportional assumption did not hold. The error along simulated time outlined the ability of our model to anticipate the evolution of cause-specific hazards at later time steps where few events are observed. It was also superior to current models for prediction of dementia and other psychiatric conditions in the English longitudinal study of ageing cohort using the integrated brier score and the time-dependent concordance index. We also displayed the explainability of our model's prediction using the integrated gradients method.Conclusions Our model provided state-of-the-art prediction of cause-specific hazards, without adopting prior parametric assumptions on the hazard rates. It outperformed other models in non-proportional hazards settings for both the synthetic dataset and the longitudinal cohort study. We also observed that basic models such as CoxPH were more suited to extremely simple settings than deep learning models. Our model is therefore especially suited for survival analysis on longitudinal cohorts with complex dynamics of the covariate-to-outcome relationship, which are common in clinical practice. The integrated gradients provided the importance scores of input variables, which indicated variables guiding the model in its prediction. This model is ready to be utilized for time-to-event prediction in longitudinal cohorts.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Risk prediction for breast Cancer in Han Chinese women based on a cause-specific Hazard model
    Lu Wang
    Liyuan Liu
    Zhen Lou
    Lijie Ding
    Hui Guan
    Fei Wang
    Lixiang Yu
    Yujuan Xiang
    Fei Zhou
    Fuzhong Xue
    Zhigang Yu
    [J]. BMC Cancer, 19
  • [2] Risk prediction for breast Cancer in Han Chinese women based on a cause-specific Hazard model
    Wang, Lu
    Liu, Liyuan
    Lou, Zhen
    Ding, Lijie
    Guan, Hui
    Wang, Fei
    Yu, Lixiang
    Xiang, Yujuan
    Zhou, Fei
    Xue, Fuzhong
    Yu, Zhigang
    [J]. BMC CANCER, 2019, 19 (1)
  • [3] A semiparametric model for the cause-specific hazard under risk proportionality
    Lo, Simon M. S.
    Wilke, Ralf A.
    Emura, Takeshi
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 195
  • [4] A cause-specific hazard spatial frailty model for competing risks data
    Hesam, Saeed
    Mahmoudi, Mahmood
    Foroushani, Abbas Rahimi
    Yaseri, Mehdi
    Mansournia, Mohammad Ali
    [J]. SPATIAL STATISTICS, 2018, 26 : 101 - 124
  • [5] On concerns with cause-specific incidence and subdistribution hazard
    Nakamura, Tsuyoshi
    Yamada, Tomomi
    [J]. JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2024,
  • [6] A class of tests for the equality of k cause-specific hazard rates in a competing risks model
    Lam, KF
    [J]. BIOMETRIKA, 1998, 85 (01) : 179 - 188
  • [7] Analysis of the time-varying Cox model for the cause-specific hazard functions with missing causes
    Heng, Fei
    Sun, Yanqing
    Hyun, Seunggeun
    Gilbert, Peter B.
    [J]. LIFETIME DATA ANALYSIS, 2020, 26 (04) : 731 - 760
  • [8] Analysis of the time-varying Cox model for the cause-specific hazard functions with missing causes
    Fei Heng
    Yanqing Sun
    Seunggeun Hyun
    Peter B. Gilbert
    [J]. Lifetime Data Analysis, 2020, 26 : 731 - 760
  • [9] Kernel regression for cause-specific hazard models with nonparametric covariate functions
    Qi, Xiaomeng
    Yu, Zhangsheng
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2023, 35 (03) : 642 - 667
  • [10] Common Factor Cause-Specific Mortality Model
    Zittersteyn, Geert
    Alonso-Garcia, Jennifer
    [J]. RISKS, 2021, 9 (12)