Machine learning algorithms for identifying predictive variables of mortality risk following dementia diagnosis: a longitudinal cohort study

被引:0
|
作者
Shayan Mostafaei
Minh Tuan Hoang
Pol Grau Jurado
Hong Xu
Lluis Zacarias-Pons
Maria Eriksdotter
Saikat Chatterjee
Sara Garcia-Ptacek
机构
[1] Karolinska Institute,Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society
[2] Karolinska Institute,Department of Medical Epidemiology and Biostatistics
[3] Institut Universitari d’Investigació en Atenció Primària Jordi Gol i Gurina (IDIAP Jordi Gol),Vascular Health Research Group of Girona (ISV
[4] Primary Care,Girona)
[5] and Health Promotion (RICAPPS),Network for Research on Chronicity
[6] Karolinska University Hospital,Aging and Inflammation Theme
[7] KTH Royal Institute of Technology,Division of Information Science and Engineering, School of Electrical Engineering and Computer Science
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning (ML) could have advantages over traditional statistical models in identifying risk factors. Using ML algorithms, our objective was to identify the most important variables associated with mortality after dementia diagnosis in the Swedish Registry for Cognitive/Dementia Disorders (SveDem). From SveDem, a longitudinal cohort of 28,023 dementia-diagnosed patients was selected for this study. Sixty variables were considered as potential predictors of mortality risk, such as age at dementia diagnosis, dementia type, sex, body mass index (BMI), mini-mental state examination (MMSE) score, time from referral to initiation of work-up, time from initiation of work-up to diagnosis, dementia medications, comorbidities, and some specific medications for chronic comorbidities (e.g., cardiovascular disease). We applied sparsity-inducing penalties for three ML algorithms and identified twenty important variables for the binary classification task in mortality risk prediction and fifteen variables to predict time to death. Area-under-ROC curve (AUC) measure was used to evaluate the classification algorithms. Then, an unsupervised clustering algorithm was applied on the set of twenty-selected variables to find two main clusters which accurately matched surviving and dead patient clusters. A support-vector-machines with an appropriate sparsity penalty provided the classification of mortality risk with accuracy = 0.7077, AUROC = 0.7375, sensitivity = 0.6436, and specificity = 0.740. Across three ML algorithms, the majority of the identified twenty variables were compatible with literature and with our previous studies on SveDem. We also found new variables which were not previously reported in literature as associated with mortality in dementia. Performance of basic dementia diagnostic work-up, time from referral to initiation of work-up, and time from initiation of work-up to diagnosis were found to be elements of the diagnostic process identified by the ML algorithms. The median follow-up time was 1053 (IQR = 516–1771) days in surviving and 1125 (IQR = 605–1770) days in dead patients. For prediction of time to death, the CoxBoost model identified 15 variables and classified them in order of importance. These highly important variables were age at diagnosis, MMSE score, sex, BMI, and Charlson Comorbidity Index with selection scores of 23%, 15%, 14%, 12% and 10%, respectively. This study demonstrates the potential of sparsity-inducing ML algorithms in improving our understanding of mortality risk factors in dementia patients and their application in clinical settings. Moreover, ML methods can be used as a complement to traditional statistical methods.
引用
收藏
相关论文
共 50 条
  • [11] Comparative Study of Machine Learning Algorithms towards Predictive Analytics
    Petchiappan M.
    Aravindhen J.
    Recent Advances in Computer Science and Communications, 2023, 16 (06) : 69 - 79
  • [12] Machine learning models identify predictive features of patient mortality across dementia types
    Zhang, Jimmy
    Song, Luo
    Miller, Zachary
    Chan, Kwun C. G.
    Huang, Kuan-lin
    COMMUNICATIONS MEDICINE, 2024, 4 (01):
  • [13] Machine learning models identify predictive features of patient mortality across dementia types
    Jimmy Zhang
    Luo Song
    Zachary Miller
    Kwun C. G. Chan
    Kuan-lin Huang
    Communications Medicine, 4
  • [14] A Machine Learning Approach for Identifying Amino Acid Signatures in the HIV Env Gene Predictive of Dementia
    Holman, Alexander G.
    Gabuzda, Dana
    PLOS ONE, 2012, 7 (11):
  • [15] Identifying dementia using medical data linkage in a longitudinal cohort study: Lothian Birth Cohort 1936
    Mullin, Donncha S.
    Stirland, Lucy E.
    Buchanan, Emily
    Convery, Catherine-Anne
    Cox, Simon R.
    Deary, Ian J.
    Giuntoli, Cinzia
    Greer, Holly
    Page, Danielle
    Robertson, Elizabeth
    Shenkin, Susan D.
    Szalek, Anna
    Taylor, Adele
    Weatherdon, Georgina
    Wilkinson, Tim
    Russ, Tom C.
    BMC PSYCHIATRY, 2023, 23 (01)
  • [16] Identifying dementia using medical data linkage in a longitudinal cohort study: Lothian Birth Cohort 1936
    Donncha S. Mullin
    Lucy E. Stirland
    Emily Buchanan
    Catherine-Anne Convery
    Simon R. Cox
    Ian J. Deary
    Cinzia Giuntoli
    Holly Greer
    Danielle Page
    Elizabeth Robertson
    Susan D. Shenkin
    Anna Szalek
    Adele Taylor
    Georgina Weatherdon
    Tim Wilkinson
    Tom C. Russ
    BMC Psychiatry, 23
  • [17] Prediction of mortality following pediatric heart transplant using machine learning algorithms
    Miller, Rebecca
    Tumin, Dmitry
    Cooper, Jennifer
    Hayes, Don, Jr.
    Tobias, Joseph D.
    PEDIATRIC TRANSPLANTATION, 2019, 23 (03)
  • [18] DAILY WATER INTAKE AND RISK OF MORTALITY: LONGITUDINAL COHORT STUDY
    Palmer, Suetonia
    Germaine, Wong
    Iff, Samuel
    Craig, Jonathan
    Mitchell, Paul
    Wang, Jie Jin
    Strippoli, Giovanni
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2012, 27 : 121 - 121
  • [19] Predictive Model to Identify the Long Time Survivor in Patients with Glioblastoma: A Cohort Study Integrating Machine Learning Algorithms
    Yang, Xi-Lin
    Zeng, Zheng
    Wang, Chen
    Sheng, Yun-Long
    Wang, Guang-Yu
    Zhang, Fu-Quan
    Lian, Xin
    JOURNAL OF MOLECULAR NEUROSCIENCE, 2024, 74 (02)
  • [20] Establishment of a machine learning predictive model for non-alcoholic fatty liver disease: A longitudinal cohort study
    Cao, Tengrui
    Zhu, Qian
    Tong, Chao
    Halengbieke, Aheyeerke
    Ni, Xuetong
    Tang, Jianmin
    Han, Yumei
    Li, Qiang
    Yang, Xinghua
    NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2024, 34 (06) : 1456 - 1466