Establishment of a machine learning predictive model for non-alcoholic fatty liver disease: A longitudinal cohort study

被引:2
|
作者
Cao, Tengrui [1 ,2 ]
Zhu, Qian [1 ,2 ,3 ]
Tong, Chao [4 ]
Halengbieke, Aheyeerke [1 ,2 ]
Ni, Xuetong [1 ,2 ]
Tang, Jianmin [1 ,2 ]
Han, Yumei [5 ]
Li, Qiang [5 ]
Yang, Xinghua [1 ,2 ]
机构
[1] Capital Med Univ, Sch Publ Hlth, 10 Xitoutiao, Beijing 100069, Peoples R China
[2] Beijing Municipal Key Lab Clin Epidemiol, 10 Xitoutiao, Beijing 100069, Peoples R China
[3] Chinese Acad Med Sci & Peking Union Med Coll, Natl Canc Ctr, Natl Clin Res Ctr Canc, Canc Hosp,Off Canc Registry, Beijing 100021, Peoples R China
[4] Beijing Ctr Dis Prevent & Control, Beijing 100013, Peoples R China
[5] Beijing Phys Examinat Ctr, Sci & Educ Sect, 59 Beiwei Rd, Beijing 100050, Peoples R China
基金
北京市自然科学基金; 国家重点研发计划;
关键词
Non-alcoholic fatty liver disease; Predictive model; eXtreme gradient boosting; Machine learning; DIAGNOSIS; INDEX; NAFLD; TESTS;
D O I
10.1016/j.numecd.2024.02.004
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and aims: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver disease, which lacks effective drug treatments. This study aimed to construct an eXtreme Gradient Boosting (XGBoost) prediction model to identify or evaluate potential NAFLD patients. Methods and results: We conducted a longitudinal study of 22,140 individuals from the Beijing Health Management Cohort. Variable filtering was performed using the least absolute shrinkage and selection operator. Random Over Sampling Examples was used to address imbalanced data. Next, the XGBoost model and the other three machine learning (ML) models were built using balanced data. Finally, the variable importance of the XGBoost model was ranked. Among four ML algorithms, we got that the XGBoost model outperformed the other models with the following results: accuracy of 0.835, sensitivity of 0.835, specificity of 0.834, Youden index of 0.669, precision of 0.831, recall of 0.835, F-1 score of 0.833, and an area under the curve of 0.914. The top five variables with the greatest impact on the onset of NAFLD were aspartate aminotransferase, cardiometabolic index, body mass index, alanine aminotransferase, and triglyceride-glucose index. Conclusion: The predictive model based on the XGBoost algorithm enables early prediction of the onset of NAFLD. Additionally, assessing variable importance provides valuable insights into the prevention and treatment of NAFLD. (c) 2024 The Italian Diabetes Society, the Italian Society for the Study of Atherosclerosis, the Italian Society of Human Nutrition and the Department of Clinical Medicine and Surgery, Federico II University. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1456 / 1466
页数:11
相关论文
共 50 条
  • [1] A predictive model for the diagnosis of non-alcoholic fatty liver disease based on an integrated machine learning method
    Ma, Xuefeng
    Yang, Chao
    Liang, Kun
    Sun, Baokai
    Jin, Wenwen
    Chen, Lizhen
    Dong, Mengzhen
    Liu, Shousheng
    Xin, Yongning
    Zhuang, Likun
    AMERICAN JOURNAL OF TRANSLATIONAL RESEARCH, 2021, 13 (11): : 12704 - 12713
  • [2] Establishment of a mice nutritional non-alcoholic fatty liver disease model
    Cong, Weina
    Tao, Rongya
    Ye, Fei
    ACTA PHARMACOLOGICA SINICA, 2006, 27 : 298 - 298
  • [3] Development of machine learning model to detect fibrotic non-alcoholic steatohepatitis in patients with non-alcoholic fatty liver disease
    Aggarwal, Manik
    Rozenbaum, Daniel
    Bansal, Agam
    Garg, Rajat
    Bansal, Parikshit
    McCullough, Arthur
    DIGESTIVE AND LIVER DISEASE, 2021, 53 (12) : 1669 - 1672
  • [4] Predictive analysis on severity of Non-Alcoholic Fatty Liver Disease (NAFLD) using Machine Learning Algorithms
    Aslam, Muhamamd Haseeb
    Hussain, Syed Fawad
    Ali, Raja Hashim
    2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 95 - 100
  • [5] Smoking and the Risk of Non-Alcoholic Fatty Liver Disease: A Cohort Study
    Jung, Hyun-Suk
    Chang, Yoosoo
    Kwon, Min-Jung
    Sung, Eunju
    Yun, Kyung Eun
    Cho, Yong Kyun
    Shin, Hocheol
    Ryu, Seungho
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2019, 114 (03): : 453 - 463
  • [6] Non-alcoholic fatty liver disease and the development of nephrolithiasis: A cohort study
    Kim, Seolhye
    Chang, Yoosoo
    Sung, Eunju
    Kim, Cheol Hwan
    Yun, Kyung Eun
    Jung, Hyun-Suk
    Shin, Hocheol
    Ryu, Seungho
    PLOS ONE, 2017, 12 (10):
  • [7] Alcoholic and non-alcoholic fatty liver disease and liver-related mortality: A cohort study
    Cho, Yong Kyun
    Kim, Keonhwa
    Chang, Yoosoo
    Cho, Juhee
    Jung, Hyun-Suk
    Yun, Kyung Eun
    Ahn, Jiin
    Ryu, Seungho
    JOURNAL OF HEPATOLOGY, 2019, 70 (01) : E189 - E189
  • [8] MACHINE LEARNING MODEL OUTPERFORMS NONINVASIVE TESTS TO DETECT FIBROTIC NON-ALCOHOLIC STEATOHEPATITIS IN PATIENTS WITH NON-ALCOHOLIC FATTY LIVER DISEASE
    Aggarwal, Manik
    McCullough, Arthur J.
    HEPATOLOGY, 2021, 74 : 50A - 51A
  • [9] Machine learning to predict progression of non-alcoholic fatty liver to non-alcoholic steatohepatitis or fibrosis
    Ghandian, Sina
    Thapa, Rahul
    Garikipati, Anurag
    Barnes, Gina
    Green-Saxena, Abigail
    Calvert, Jacob
    Mao, Qingqing
    Das, Ritankar
    JGH OPEN, 2022, 6 (03): : 196 - 204
  • [10] Non-alcoholic fatty liver disease and hypertension in the community based cohort study
    Kim, G. S.
    Kim, B. O.
    Seo, J. K.
    Lee, H. Y.
    Jung, I. H.
    Byun, Y. S.
    EUROPEAN HEART JOURNAL, 2018, 39 : 256 - 256