Machine learning algorithms for identifying predictive variables of mortality risk following dementia diagnosis: a longitudinal cohort study

被引:0
|
作者
Shayan Mostafaei
Minh Tuan Hoang
Pol Grau Jurado
Hong Xu
Lluis Zacarias-Pons
Maria Eriksdotter
Saikat Chatterjee
Sara Garcia-Ptacek
机构
[1] Karolinska Institute,Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society
[2] Karolinska Institute,Department of Medical Epidemiology and Biostatistics
[3] Institut Universitari d’Investigació en Atenció Primària Jordi Gol i Gurina (IDIAP Jordi Gol),Vascular Health Research Group of Girona (ISV
[4] Primary Care,Girona)
[5] and Health Promotion (RICAPPS),Network for Research on Chronicity
[6] Karolinska University Hospital,Aging and Inflammation Theme
[7] KTH Royal Institute of Technology,Division of Information Science and Engineering, School of Electrical Engineering and Computer Science
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning (ML) could have advantages over traditional statistical models in identifying risk factors. Using ML algorithms, our objective was to identify the most important variables associated with mortality after dementia diagnosis in the Swedish Registry for Cognitive/Dementia Disorders (SveDem). From SveDem, a longitudinal cohort of 28,023 dementia-diagnosed patients was selected for this study. Sixty variables were considered as potential predictors of mortality risk, such as age at dementia diagnosis, dementia type, sex, body mass index (BMI), mini-mental state examination (MMSE) score, time from referral to initiation of work-up, time from initiation of work-up to diagnosis, dementia medications, comorbidities, and some specific medications for chronic comorbidities (e.g., cardiovascular disease). We applied sparsity-inducing penalties for three ML algorithms and identified twenty important variables for the binary classification task in mortality risk prediction and fifteen variables to predict time to death. Area-under-ROC curve (AUC) measure was used to evaluate the classification algorithms. Then, an unsupervised clustering algorithm was applied on the set of twenty-selected variables to find two main clusters which accurately matched surviving and dead patient clusters. A support-vector-machines with an appropriate sparsity penalty provided the classification of mortality risk with accuracy = 0.7077, AUROC = 0.7375, sensitivity = 0.6436, and specificity = 0.740. Across three ML algorithms, the majority of the identified twenty variables were compatible with literature and with our previous studies on SveDem. We also found new variables which were not previously reported in literature as associated with mortality in dementia. Performance of basic dementia diagnostic work-up, time from referral to initiation of work-up, and time from initiation of work-up to diagnosis were found to be elements of the diagnostic process identified by the ML algorithms. The median follow-up time was 1053 (IQR = 516–1771) days in surviving and 1125 (IQR = 605–1770) days in dead patients. For prediction of time to death, the CoxBoost model identified 15 variables and classified them in order of importance. These highly important variables were age at diagnosis, MMSE score, sex, BMI, and Charlson Comorbidity Index with selection scores of 23%, 15%, 14%, 12% and 10%, respectively. This study demonstrates the potential of sparsity-inducing ML algorithms in improving our understanding of mortality risk factors in dementia patients and their application in clinical settings. Moreover, ML methods can be used as a complement to traditional statistical methods.
引用
收藏
相关论文
共 50 条
  • [31] Predictive model and risk analysis for diabetic retinopathy using machine learning: a retrospective cohort study in China
    Li, Wanyue
    Song, Yanan
    Chen, Kang
    Ying, Jun
    Zheng, Zhong
    Qiao, Shen
    Yang, Ming
    Zhang, Maonian
    Zhang, Ying
    BMJ OPEN, 2021, 11 (11):
  • [32] Predictive Utility of a Machine Learning Algorithm in Estimating Mortality Risk in Cardiac Surgery
    Kilic, Arman
    Goyal, Anshul
    Miller, James K.
    Gjekmarkaj, Eva
    Tam, Weng Lam
    Gleason, Thomas G.
    Sultan, Ibrahim
    Dubrawksi, Artur
    ANNALS OF THORACIC SURGERY, 2020, 109 (06): : 1811 - 1819
  • [33] Predictive modeling of COVID-19 mortality risk in chronic kidney disease patients using multiple machine learning algorithms
    Luo, Lin
    Gao, Peng
    Yang, Chunhui
    Yu, Sha
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [34] Rethinking Dementia Risk Prediction: A Critical Evaluation of a Multimodal Machine Learning Predictive Model
    Ottaviani, Silvia
    Monacelli, Fiammetta
    JOURNAL OF ALZHEIMERS DISEASE, 2024, 97 (03) : 1097 - 1100
  • [35] Identifying the Main Risk Factors for Cardiovascular Diseases Prediction Using Machine Learning Algorithms
    Guarneros-Nolasco, Luis Rolando
    Cruz-Ramos, Nancy Aracely
    Alor-Hernandez, Giner
    Rodriguez-Mazahua, Lisbeth
    Sanchez-Cervantes, Jose Luis
    MATHEMATICS, 2021, 9 (20)
  • [36] Predictive etiological classification of acute ischemic stroke through interpretable machine learning algorithms: a multicenter, prospective cohort study
    Chen, Siding
    Yang, Xiaomeng
    Gu, Hongqiu
    Wang, Yanzhao
    Xu, Zhe
    Jiang, Yong
    Wang, Yongjun
    BMC MEDICAL RESEARCH METHODOLOGY, 2024, 24 (01)
  • [37] Integrating chemokines and machine learning algorithms for diagnosis and bleeding assessment in primary immune thrombocytopenia: A prospective cohort study
    Wen, Qing
    Sun, Ting
    Chen, Jia
    Li, Yang
    Liu, Xiaofan
    Li, Huiyuan
    Fu, Rongfeng
    Liu, Wei
    Xue, Feng
    Ju, Mankai
    Dong, Huan
    Dai, Xinyue
    Wang, Wentian
    Chi, Ying
    Yang, Renchi
    Chen, Yunfei
    Zhang, Lei
    BRITISH JOURNAL OF HAEMATOLOGY, 2024, 205 (05) : 1938 - 1950
  • [38] Dementia caregiving as a risk for morbidity and mortality within the Longitudinal Dementia Caregiver Stress Study (LEANDER-Study)
    Opterbeck, Ilga
    Schacke, Claudia
    Zank, Susanna
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 62 - 62
  • [39] Predicting the risk of mortality and rehospitalization in heart failure patients: A retrospective cohort study by machine learning approach
    Ketabi, Marzieh
    Andishgar, Aref
    Fereidouni, Zhila
    Sani, Maryam Mojarrad
    Abdollahi, Ashkan
    Vali, Mohebat
    Alkamel, Abdulhakim
    Tabrizi, Reza
    CLINICAL CARDIOLOGY, 2024, 47 (02)
  • [40] Weight loss and mortality risk in patients with different adiposity at diagnosis of type 2 diabetes: a longitudinal cohort study
    Owusu, Ebenezer S. Adjah
    Samanta, Mayukh
    Shaw, Jonathan E.
    Majeed, Azeem
    Khunti, Kamlesh
    Paul, Sanjoy K.
    NUTRITION & DIABETES, 2018, 8