Predicting the 5-Year Risk of Nonalcoholic Fatty Liver Disease Using Machine Learning Models: Prospective Cohort Study

被引:2
|
作者
Huang, Guoqing [1 ,2 ]
Jin, Qiankai [1 ,2 ]
Mao, Yushan [1 ]
机构
[1] Ningbo Univ, Affiliated Hosp 1, Dept Endocrinol, 247 Renmin Rd, Ningbo 315000, Peoples R China
[2] Ningbo Univ, Hlth Sci Ctr, Ningbo, Peoples R China
关键词
nonalcoholic fatty liver disease; machine learning; independent risk factors; prediction model; model; fatty liver; prevention; liver; prognostic; China; development; validation; risk model; clinical applicability; NORMAL-WEIGHT; EPIDEMIOLOGY; PREVALENCE;
D O I
10.2196/46891
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Nonalcoholic fatty liver disease (NAFLD) has emerged as a worldwide public health issue. Identifying and targeting populations at a heightened risk of developing NAFLD over a 5-year period can help reduce and delay adverse hepatic prognostic events. Objective: This study aimed to investigate the 5-year incidence of NAFLD in the Chinese population. It also aimed to establish and validate a machine learning model for predicting the 5-year NAFLD risk. Methods: The study population was derived from a 5-year prospective cohort study. A total of 6196 individuals without NAFLD who underwent health checkups in 2010 at Zhenhai Lianhua Hospital in Ningbo, China, were enrolled in this study. Extreme gradient boosting (XGBoost)-recursive feature elimination, combined with the least absolute shrinkage and selection operator (LASSO), was used to screen for characteristic predictors. A total of 6 machine learning models, namely logistic regression, decision tree, support vector machine, random forest, categorical boosting, and XGBoost, were utilized in the construction of a 5-year risk model for NAFLD. Hyperparameter optimization of the predictive model was performed in the training set, and a further evaluation of the model performance was carried out in the internal and external validation sets. Results: The 5-year incidence of NAFLD was 18.64% (n=1155) in the study population. We screened 11 predictors for risk prediction model construction. After the hyperparameter optimization, CatBoost demonstrated the best prediction performance in the training set, with an area under the receiver operating characteristic (AUROC) curve of 0.810 (95% CI 0.768-0.852). Logistic regression showed the best prediction performance in the internal and external validation sets, with AUROC curves of 0.778 (95% CI 0.759-0.794) and 0.806 (95% CI 0.788-0.821), respectively. The development of web-based calculators has enhanced the clinical feasibility of the risk prediction model. Conclusions: Developing and validating machine learning models can aid in predicting which populations are at the highest risk of developing NAFLD over a 5-year period, thereby helping delay and reduce the occurrence of adverse liver prognostic events.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] LONG-TERM USE OF ANTIBIOTICS AND THE RISK OF NONALCOHOLIC FATTY LIVER DISEASE: A PROSPECTIVE COHORT STUDY AMONG WOMEN
    Kim, Mi Na
    Lo, Chun-Han
    Corey, Kathleen E.
    Luo, Xiao
    Zhang, Xuehong
    Chan, Andrew T.
    Simon, Tracey G.
    HEPATOLOGY, 2020, 72 : 979 - 980
  • [32] Predicting diabetes in adults: identifying important features in unbalanced data over a 5-year cohort study using machine learning algorithm
    Maryam Talebi Moghaddam
    Yones Jahani
    Zahra Arefzadeh
    Azizallah Dehghan
    Mohsen Khaleghi
    Mehdi Sharafi
    Ghasem Nikfar
    BMC Medical Research Methodology, 24 (1)
  • [33] Disease Course of Primary Dupuytren Disease: 5-Year Results of a Prospective Cohort Study
    Broekstra, Dieuwke C.
    Lanting, Rosanne
    Werker, Paul M. N.
    van den Heuvel, Edwin R.
    PLASTIC AND RECONSTRUCTIVE SURGERY, 2022, 149 (06) : 1371 - 1378
  • [34] Exercise, weight maintenance, and nonalcoholic fatty liver disease risk: a Chinese cohort study
    Yang, Chao
    Yan, Peijing
    Deng, Jiaqi
    Li, Yujuan
    Jiang, Xia
    Zhang, Ben
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [35] Nonalcoholic Fatty Liver Disease and Risk of Dementia A Population-Based Cohort Study
    Shang, Ying
    Widman, Linnea
    Hagstrom, Hannes
    NEUROLOGY, 2022, 99 (06) : E574 - E582
  • [36] Effect of Alcohol Consumption on Survival in Nonalcoholic Fatty Liver Disease: A National Prospective Cohort Study
    Hajifathalian, Kaveh
    Sagvand, Babak Torabi
    McCullough, Arthur J.
    HEPATOLOGY, 2019, 70 (02) : 511 - 521
  • [37] Diabesity phenotype in relation to the incidence and resolution of nonalcoholic fatty liver disease: A prospective cohort study
    Kong, Lijie
    Ye, Chaojie
    Wang, Yiying
    Dou, Chun
    Zheng, Jie
    Wang, Shuangyuan
    Lin, Hong
    Zhao, Zhiyun
    Li, Mian
    Xu, Yu
    Chen, Yuhong
    Lu, Jieli
    Xu, Min
    Wang, Weiqing
    Ning, Guang
    Bi, Yufang
    Wang, Tiange
    JOURNAL OF DIABETES, 2024, 16 (01)
  • [38] Predicting risk of erectile dysfunction in patients with nonalcoholic fatty liver disease
    Yilmaz, Mehmet
    Odabas, Oner
    Karaaslan, Mustafa
    Guler, Omer Faruk
    Toprak, Tuncay
    Bicer, Sait
    Tonyali, Senol
    ANDROLOGIA, 2021, 53 (07)
  • [39] Increased risk of nonalcoholic fatty liver disease with occupational stress in Chinese policemen A 4-year cohort study
    Li, Chen
    Xing, Jing-Jing
    Shan, An-Qi
    Leng, Ling
    Liu, Jin-Chuan
    Yue, Song
    Yu, Hao
    Chen, Xi
    Tian, Feng-Shi
    Tang, Nai-Jun
    MEDICINE, 2016, 95 (46)
  • [40] Comparison of Machine Learning Models and the Fatty Liver Index in Predicting Lean Fatty Liver
    Su, Pei-Yuan
    Chen, Yang-Yuan
    Lin, Chun-Yu
    Su, Wei-Wen
    Huang, Siou-Ping
    Yen, Hsu-Heng
    DIAGNOSTICS, 2023, 13 (08)