Prediction and analysis of risk factors for diabetic retinopathy based on machine learning and interpretable models

被引:0
|
作者
Wang, Xu [1 ]
Wang, Weijie [1 ]
Ren, Huiling [1 ]
Li, Xiaoying [1 ]
Wen, Yili [1 ]
机构
[1] Chinese Acad Med Sci & Peking Union Med Coll, Inst Med Informat, Med Lib, 69 Dongdan North Str, Beijing, Peoples R China
关键词
Diabetic retinopathy; Machine learning; CatBoost; SHAP; Prediction model;
D O I
10.1016/j.heliyon.2024.e29497
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective: Diabetic retinopathy is one of the major complications of diabetes. In this study, a diabetic retinopathy risk prediction model integrating machine learning models and SHAP was established to increase the accuracy of risk prediction for diabetic retinopathy, explain the rationality of the findings from model prediction and improve the reliability of prediction results. Methods: Data were preprocessed for missing values and outliers, features selected through information gain, a diabetic retinopathy risk prediction model established using the CatBoost and the outputs of the mode interpreted using the SHAP model. Results: One thousand early warning data of diabetes complications derived from diabetes complication early warning dataset from the National Clinical Medical Sciences Data Center were used in this study. The CatBoost-based model for diabetic retinopathy prediction performed the best in the comparative model test. ALB_CR, HbA1c, UPR_24, NEPHROPATHY and SCR were positively correlated with diabetic retinopathy, while CP, HB, ALB, DBILI and CRP were negatively correlated with diabetic retinopathy. The relationships between HEIGHT, WEIGHT and ESR characteristics and diabetic retinopathy were not significant. Conclusion: The risk factors for diabetic retinopathy include poor renal function, elevated blood glucose level, liver disease, hematonosis and dysarteriotony, among others. Diabetic retinopathy can be prevented by monitoring and effectively controlling relevant indices. In this study, the influence relationships between the features were also analyzed to further explore the potential factors of diabetic retinopathy, which can provide new methods and new ideas for the early prevention and clinical diagnosis of subsequent diabetic retinopathy.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Interpretable Machine Learning Models for Modal Split Prediction in Transportation Systems
    Brenner, Aron
    Wu, Manxi
    Amin, Saurabh
    [J]. 2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 901 - 908
  • [22] Identification of Risk Factors and Machine Learning-Based Prediction Models for Knee Osteoarthritis Patients
    Kokkotis, Christos
    Moustakidis, Serafeim
    Giakas, Giannis
    Tsaopoulos, Dimitrios
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (19):
  • [23] DREAM: Diabetic Retinopathy Analysis Using Machine Learning
    Roychowdhury, Sohini
    Koozekanani, Dara D.
    Parhi, Keshab K.
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2014, 18 (05) : 1717 - 1728
  • [24] MULTIPLE FACTORS IN THE PREDICTION OF RISK OF PROLIFERATIVE DIABETIC-RETINOPATHY
    RAND, LI
    KROLEWSKI, AS
    AIELLO, LM
    WARRAM, JH
    BAKER, RS
    MAKI, T
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 1985, 313 (23): : 1433 - 1438
  • [25] Accurate band gap prediction based on an interpretable ?-machine learning
    Zhang, Lingyao
    Su, Tianhao
    Li, Musen
    Jia, Fanhao
    Hu, Shuobo
    Zhang, Peihong
    Ren, Wei
    [J]. MATERIALS TODAY COMMUNICATIONS, 2022, 33
  • [26] Prediction of the Fatigue Strength of Steel Based on Interpretable Machine Learning
    Liu, Chengcheng
    Wang, Xuandong
    Cai, Weidong
    Yang, Jiahui
    Su, Hang
    [J]. MATERIALS, 2023, 16 (23)
  • [27] Prediction and Analysis of Heat Transfer Characteristics of Supercritical Fluids Based on Interpretable Machine Learning
    Li, Haozhe
    Song, Meiqi
    Liu, Xiaojing
    [J]. Hedongli Gongcheng/Nuclear Power Engineering, 2024, 45 (06): : 63 - 74
  • [28] Integration of IDPC Clustering Analysis and Interpretable Machine Learning for Survival Risk Prediction of Patients with ESCC
    Dan Ling
    Anhao Liu
    Junwei Sun
    Yanfeng Wang
    Lidong Wang
    Xin Song
    Xueke Zhao
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 480 - 498
  • [29] Integration of IDPC Clustering Analysis and Interpretable Machine Learning for Survival Risk Prediction of Patients with ESCC
    Ling, Dan
    Liu, Anhao
    Sun, Junwei
    Wang, Yanfeng
    Wang, Lidong
    Song, Xin
    Zhao, Xueke
    [J]. INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (03) : 480 - 498
  • [30] Machine Learning Models for Prediction of Diabetic Microvascular Complications
    Kanbour, Sarah
    Harris, Catharine
    Lalani, Benjamin
    Wolf, Risa M.
    Fitipaldi, Hugo
    Gomez, Maria F.
    Mathioudakis, Nestoras
    [J]. JOURNAL OF DIABETES SCIENCE AND TECHNOLOGY, 2024, : 273 - 286