Predicting the 5-Year Risk of Nonalcoholic Fatty Liver Disease Using Machine Learning Models: Prospective Cohort Study

被引:2
|
作者
Huang, Guoqing [1 ,2 ]
Jin, Qiankai [1 ,2 ]
Mao, Yushan [1 ]
机构
[1] Ningbo Univ, Affiliated Hosp 1, Dept Endocrinol, 247 Renmin Rd, Ningbo 315000, Peoples R China
[2] Ningbo Univ, Hlth Sci Ctr, Ningbo, Peoples R China
关键词
nonalcoholic fatty liver disease; machine learning; independent risk factors; prediction model; model; fatty liver; prevention; liver; prognostic; China; development; validation; risk model; clinical applicability; NORMAL-WEIGHT; EPIDEMIOLOGY; PREVALENCE;
D O I
10.2196/46891
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Nonalcoholic fatty liver disease (NAFLD) has emerged as a worldwide public health issue. Identifying and targeting populations at a heightened risk of developing NAFLD over a 5-year period can help reduce and delay adverse hepatic prognostic events. Objective: This study aimed to investigate the 5-year incidence of NAFLD in the Chinese population. It also aimed to establish and validate a machine learning model for predicting the 5-year NAFLD risk. Methods: The study population was derived from a 5-year prospective cohort study. A total of 6196 individuals without NAFLD who underwent health checkups in 2010 at Zhenhai Lianhua Hospital in Ningbo, China, were enrolled in this study. Extreme gradient boosting (XGBoost)-recursive feature elimination, combined with the least absolute shrinkage and selection operator (LASSO), was used to screen for characteristic predictors. A total of 6 machine learning models, namely logistic regression, decision tree, support vector machine, random forest, categorical boosting, and XGBoost, were utilized in the construction of a 5-year risk model for NAFLD. Hyperparameter optimization of the predictive model was performed in the training set, and a further evaluation of the model performance was carried out in the internal and external validation sets. Results: The 5-year incidence of NAFLD was 18.64% (n=1155) in the study population. We screened 11 predictors for risk prediction model construction. After the hyperparameter optimization, CatBoost demonstrated the best prediction performance in the training set, with an area under the receiver operating characteristic (AUROC) curve of 0.810 (95% CI 0.768-0.852). Logistic regression showed the best prediction performance in the internal and external validation sets, with AUROC curves of 0.778 (95% CI 0.759-0.794) and 0.806 (95% CI 0.788-0.821), respectively. The development of web-based calculators has enhanced the clinical feasibility of the risk prediction model. Conclusions: Developing and validating machine learning models can aid in predicting which populations are at the highest risk of developing NAFLD over a 5-year period, thereby helping delay and reduce the occurrence of adverse liver prognostic events.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Prospective Study of Outcomes in Adults with Nonalcoholic Fatty Liver Disease
    Sanyal, Arun J.
    Van Natta, Mark L.
    Clark, Jeanne
    Neuschwander-Tetri, Brent A.
    Diehl, AnnaMae
    Dasarathy, Srinivasan
    Loomba, Rohit
    Chalasani, Naga
    Kowdley, Kris
    Hameed, Bilal
    Wilson, Laura A.
    Yates, Katherine P.
    Belt, Patricia
    Lazo, Mariana
    Kleiner, David E.
    Behling, Cynthia
    Tonascia, James
    NEW ENGLAND JOURNAL OF MEDICINE, 2021, 385 (17): : 1559 - 1569
  • [42] Nonalcoholic fatty liver disease is an independent risk factor for ischemic stroke after revascularization in patients with Moyamoya disease: a prospective cohort study
    Zhang, Bojian
    Li, Junsheng
    Zeng, Chaofan
    Tao, Chuming
    He, Qiheng
    Liu, Chenglong
    Zheng, Zhiyao
    Zhao, Zhikang
    Mou, Siqi
    Sun, Wei
    Wang, Jia
    Zhang, Qian
    Wang, Rong
    Zhang, Yan
    Ge, Peicong
    Zhang, Dong
    LIPIDS IN HEALTH AND DISEASE, 2024, 23 (01)
  • [43] Nonalcoholic fatty liver disease is an independent risk factor for ischemic stroke after revascularization in patients with Moyamoya disease: a prospective cohort study
    Bojian Zhang
    Junsheng Li
    Chaofan Zeng
    Chuming Tao
    Qiheng He
    Chenglong Liu
    Zhiyao Zheng
    Zhikang Zhao
    Siqi Mou
    Wei Sun
    Jia Wang
    Qian Zhang
    Rong Wang
    Yan Zhang
    Peicong Ge
    Dong Zhang
    Lipids in Health and Disease, 23
  • [44] Predicting 5-year progression to vision-threatening complications using machine learning
    Damopoulos, Dimitrios
    Mendes, Luis
    Santos, Torcato
    Neubert, Ales
    Ferrara, Daniela
    Cunha-Vaz, Jose G.
    Benmansour, Fethallah
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2022, 63 (07)
  • [45] RSV Severe Infection Risk Stratification in a French 5-Year Birth Cohort Using Machine-learning
    Horvat, Come
    Chauvel, Cecile
    Casalegno, Jean-Sebastien
    Benchaib, Mehdi
    Ploin, Dominique
    Nunes, Marta C.
    PEDIATRIC INFECTIOUS DISEASE JOURNAL, 2024, 43 (09) : 819 - 824
  • [46] Predicting breast cancer 5-year survival using machine learning: A systematic review
    Li, Jiaxin
    Zhou, Zijun
    Dong, Jianyu
    Fu, Ying
    Li, Yuan
    Luan, Ze
    Peng, Xin
    PLOS ONE, 2021, 16 (04):
  • [47] Machine learning approaches for predicting 5-year breast cancer survival: A multicenter study
    Nguyen, Quynh Thi Nhu
    Nguyen, Phung-Anh
    Wang, Chun-Jung
    Phuc, Phan Thanh
    Lin, Ruo-Kai
    Hung, Chin-Sheng
    Kuo, Nei-Hui
    Cheng, Yu-Wen
    Lin, Shwu-Jiuan
    Hsieh, Zong-You
    Cheng, Chi-Tsun
    Hsu, Min-Huei
    Hsu, Jason C. C.
    CANCER SCIENCE, 2023, 114 (10) : 4063 - 4072
  • [48] Is fatty liver in diabetes a risk factor for HCC and cardiovascular disease? Prospective 10-year cohort study
    Seike, Masataka
    Honda, Koichi
    Oribe, Junya
    Endo, Mizuki
    Yoshihara, Mie
    Tokoro, Masanori
    Iwao, Masao
    Syo, Hiroki
    Murakami, Kazunari
    HEPATOLOGY, 2014, 60 : 620A - 620A
  • [49] The clinical availability of non alcoholic fatty liver disease as an early predictor of the metabolic syndrome in Korean men: 5-Year's prospective cohort study
    Ryoo, Jae-Hong
    Choi, Joong-Myung
    Moon, Soo Young
    Suh, Young Ju
    Shin, Ju-Young
    Shin, Ho Cheol
    Park, Sung Keun
    ATHEROSCLEROSIS, 2013, 227 (02) : 398 - 403
  • [50] Comparison of Animal Models for the Study of Nonalcoholic Fatty Liver Disease
    Zheng, Qing
    Zhu, Min
    Zeng, Xin
    Liu, Wen
    Fu, Fudong
    Li, Xiaoyu
    Liao, Guangneng
    Lu, Yanrong
    Chen, Younan
    LABORATORY INVESTIGATION, 2023, 103 (07)