Establishment of a machine learning predictive model for non-alcoholic fatty liver disease: A longitudinal cohort study

被引:2
|
作者
Cao, Tengrui [1 ,2 ]
Zhu, Qian [1 ,2 ,3 ]
Tong, Chao [4 ]
Halengbieke, Aheyeerke [1 ,2 ]
Ni, Xuetong [1 ,2 ]
Tang, Jianmin [1 ,2 ]
Han, Yumei [5 ]
Li, Qiang [5 ]
Yang, Xinghua [1 ,2 ]
机构
[1] Capital Med Univ, Sch Publ Hlth, 10 Xitoutiao, Beijing 100069, Peoples R China
[2] Beijing Municipal Key Lab Clin Epidemiol, 10 Xitoutiao, Beijing 100069, Peoples R China
[3] Chinese Acad Med Sci & Peking Union Med Coll, Natl Canc Ctr, Natl Clin Res Ctr Canc, Canc Hosp,Off Canc Registry, Beijing 100021, Peoples R China
[4] Beijing Ctr Dis Prevent & Control, Beijing 100013, Peoples R China
[5] Beijing Phys Examinat Ctr, Sci & Educ Sect, 59 Beiwei Rd, Beijing 100050, Peoples R China
基金
北京市自然科学基金; 国家重点研发计划;
关键词
Non-alcoholic fatty liver disease; Predictive model; eXtreme gradient boosting; Machine learning; DIAGNOSIS; INDEX; NAFLD; TESTS;
D O I
10.1016/j.numecd.2024.02.004
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and aims: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver disease, which lacks effective drug treatments. This study aimed to construct an eXtreme Gradient Boosting (XGBoost) prediction model to identify or evaluate potential NAFLD patients. Methods and results: We conducted a longitudinal study of 22,140 individuals from the Beijing Health Management Cohort. Variable filtering was performed using the least absolute shrinkage and selection operator. Random Over Sampling Examples was used to address imbalanced data. Next, the XGBoost model and the other three machine learning (ML) models were built using balanced data. Finally, the variable importance of the XGBoost model was ranked. Among four ML algorithms, we got that the XGBoost model outperformed the other models with the following results: accuracy of 0.835, sensitivity of 0.835, specificity of 0.834, Youden index of 0.669, precision of 0.831, recall of 0.835, F-1 score of 0.833, and an area under the curve of 0.914. The top five variables with the greatest impact on the onset of NAFLD were aspartate aminotransferase, cardiometabolic index, body mass index, alanine aminotransferase, and triglyceride-glucose index. Conclusion: The predictive model based on the XGBoost algorithm enables early prediction of the onset of NAFLD. Additionally, assessing variable importance provides valuable insights into the prevention and treatment of NAFLD. (c) 2024 The Italian Diabetes Society, the Italian Society for the Study of Atherosclerosis, the Italian Society of Human Nutrition and the Department of Clinical Medicine and Surgery, Federico II University. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1456 / 1466
页数:11
相关论文
共 50 条
  • [41] Cohort profile: the Trivandrum non-alcoholic fatty liver disease (NAFLD) cohort
    Chalmers, Jane
    Ban, Lu
    Leena, Kondarapassery B.
    Edwards, Kimberley L.
    Grove, Jane L.
    Aithal, Guruprasad P.
    Shenoy, Kotacherry T.
    BMJ OPEN, 2019, 9 (05):
  • [42] The risk of cardiometabolic disorders in lean non-alcoholic fatty liver disease: A longitudinal study
    Aneni, Ehimen C.
    Bittencourt, Marcio Sommer
    Teng, Catherine
    Cainzos-Achirica, Miguel
    Osondu, Chukwuemeka U.
    Soliman, Ahmed
    Al-Mallah, Mouaz
    Buddoff, Matthew
    Parise, Edison R.
    Santos, Raul D.
    Nasir, Khurram
    AMERICAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2020, 4
  • [43] PREDICTING RISK FOR HEPATOCELLULAR CARCINOMA IN NON-ALCOHOLIC FATTY LIVER DISEASE PATIENTS USING MACHINE LEARNING MODEL
    Piao, Cindy
    Alurwar, Aniket
    Donde, Rajiv
    Meyers, Frederick J.
    Sarkar, Souvik
    HEPATOLOGY, 2021, 74 : 662A - 662A
  • [44] Longitudinal analysis of risk of non-alcoholic fatty liver disease in adulthood
    Cuthbertson, Daniel J.
    Brown, Emily
    Koskinen, Juha
    Magnussen, Costan G.
    Hutri-Kahonen, Nina
    Sabin, Matthew
    Tossavainen, Paeivi
    Jokinen, Eero
    Laitinen, Tomi
    Viikari, Jorma
    Raitakari, Olli T.
    Juonala, Markus
    LIVER INTERNATIONAL, 2019, 39 (06) : 1147 - 1154
  • [45] Parental non-alcoholic fatty liver disease increases risk of non-alcoholic fatty liver disease in offspring
    Long, Michelle T.
    Gurary, Ellen B.
    Massaro, Joseph M.
    Ma, Jiantao
    Hoffmann, Udo
    Chung, Raymond T.
    Benjamin, Emelia J.
    Loomba, Rohit
    LIVER INTERNATIONAL, 2019, 39 (04) : 740 - 747
  • [46] Statins for non-alcoholic fatty liver disease and non-alcoholic steatohepatitis
    Eslami, Layli
    Merat, Shahin
    Malekzadeh, Reza
    Nasseri-Moghaddam, Siavosh
    Aramin, Hermineh
    COCHRANE DATABASE OF SYSTEMATIC REVIEWS, 2013, (12):
  • [47] Significance of non-alcoholic fatty liver disease in Crohn's disease: A retrospective cohort study
    Sagami, Shintaro
    Ueno, Yoshitaka
    Tanaka, Shinji
    Fujita, Akira
    Hayashi, Ryohei
    Oka, Shiro
    Hyogo, Hideyuki
    Chayama, Kazuaki
    HEPATOLOGY RESEARCH, 2017, 47 (09) : 872 - 881
  • [48] Development of chronic kidney disease in patients with non-alcoholic fatty liver disease: A cohort study
    Sinn, Dong Hyun
    Kang, Danbee
    Jang, Hye Ryoun
    Gu, Seonhye
    Cho, Soo Jin
    Paik, Seung Woon
    Ryu, Seungho
    Chang, Yoosoo
    Lazo, Mariana
    Guallar, Eliseo
    Cho, Juhee
    Gwak, Geum-Youn
    JOURNAL OF HEPATOLOGY, 2017, 67 (06) : 1274 - 1280
  • [49] The longitudinal associations between sweet potato intake and the risk of non-alcoholic fatty liver disease: the TCLSIH cohort study
    Yang, Honghao
    Zhang, Tingjing
    Rayamajhi, Sabina
    Thapa, Amrish
    Du, Wenxiu
    Meng, Ge
    Zhang, Qing
    Liu, Li
    Wu, Hongmei
    Gu, Yeqing
    Zhang, Shunming
    Wang, Xuena
    Li, Huiping
    Zhang, Juanjuan
    Dong, Jun
    Zheng, Xiaoxi
    Cao, Zhixia
    Zhang, Xu
    Dong, Xinrong
    Sun, Shaomei
    Wang, Xing
    Zhou, Ming
    Jia, Qiyu
    Song, Kun
    Niu, Kaijun
    INTERNATIONAL JOURNAL OF FOOD SCIENCES AND NUTRITION, 2022, 73 (06) : 809 - 820
  • [50] DECREASED LUNG FUNCTION IS ASSOCIATED WITH INCREASED RISK OF DEVELOPING NON-ALCOHOLIC FATTY LIVER DISEASE: A LONGITUDINAL COHORT STUDY
    Song, Jae-Uk
    Park, Hye Kyeong
    RESPIROLOGY, 2018, 23 : 318 - 318