Machine learning-based prediction of diabetic patients using blood routine data

被引:0
|
作者
Li, Honghao [1 ]
Su, Dongqing [1 ]
Zhang, Xinpeng [1 ]
He, Yuanyuan [1 ]
Luo, Xu [1 ]
Xiong, Yuqiang [1 ]
Zou, Min [1 ]
Wei, Huiyan [2 ]
Wen, Shaoran [3 ]
Xi, Qilemuge [3 ]
Zuo, Yongchun [3 ,4 ]
Yang, Lei [1 ]
机构
[1] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin 150081, Peoples R China
[2] Harbin Med Univ, Biotechnol Expt Ctr, Harbin 150081, Peoples R China
[3] Inner Mongolia Univ, Coll Life Sci, State Key Lab Reprod Regulat & Breeding Grassland, Hohhot 010070, Peoples R China
[4] Inner Mongolia Int Mongolian Hosp, Hohhot 010065, Peoples R China
关键词
Diabetes; Blood routine test; Machine learning; Nomogram;
D O I
10.1016/j.ymeth.2024.07.001
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Diabetes stands as one of the most prevalent chronic diseases globally. The conventional methods for diagnosing diabetes are frequently overlooked until individuals manifest noticeable symptoms of the condition. This study aimed to address this gap by collecting comprehensive datasets, including 1000 instances of blood routine data from diabetes patients and an equivalent dataset from healthy individuals. To differentiate diabetes patients from their healthy counterparts, a computational framework was established, encompassing eXtreme Gradient Boosting (XGBoost), random forest, support vector machine, and elastic net algorithms. Notably, the XGBoost model emerged as the most effective, exhibiting superior predictive results with an area under the receiver operating characteristic curve (AUC) of 99.90% in the training set and 98.51% in the testing set. Moreover, the model showcased commendable performance during external validation, achieving an overall accuracy of 81.54%. The probability generated by the model serves as a risk score for diabetes susceptibility. Further interpretability was achieved through the utilization of the Shapley additive explanations (SHAP) algorithm, identifying pivotal indicators such as mean corpuscular hemoglobin concentration (MCHC), lymphocyte ratio (LY%), standard deviation of red blood cell distribution width (RDW-SD), and mean corpuscular hemoglobin (MCH). This enhances our understanding of the predictive mechanisms underlying diabetes. To facilitate the application in clinical and real-life settings, a nomogram was created based on the logistic regression algorithm, which can provide a preliminary assessment of the likelihood of an individual having diabetes. Overall, this research contributes valuable insights into the predictive modeling of diabetes, offering potential applications in clinical practice for more effective and timely diagnoses.
引用
收藏
页码:156 / 162
页数:7
相关论文
共 50 条
  • [21] Noninvasive prediction of Blood Lactate through a machine learning-based approach
    Shu-Chun Huang
    Richard Casaburi
    Ming-Feng Liao
    Kuo-Cheng Liu
    Yu-Jen Chen
    Tieh-Cheng Fu
    Hong-Ren Su
    [J]. Scientific Reports, 9
  • [22] Machine learning-based prediction of intraoperative hypoxemia for pediatric patients
    Park, Jung-Bin
    Lee, Ho-Jong
    Yang, Hyun-Lim
    Kim, Eun-Hee
    Lee, Hyung-Chul
    Jung, Chul-Woo
    Kim, Hee-Soo
    [J]. PLOS ONE, 2023, 18 (03):
  • [23] Machine learning-based prediction models for accidental hypothermia patients
    Okada, Yohei
    Matsuyama, Tasuku
    Morita, Sachiko
    Ehara, Naoki
    Miyamae, Nobuhiro
    Jo, Takaaki
    Sumida, Yasuyuki
    Okada, Nobunaga
    Watanabe, Makoto
    Nozawa, Masahiro
    Tsuruoka, Ayumu
    Fujimoto, Yoshihiro
    Okumura, Yoshiki
    Kitamura, Tetsuhisa
    Iiduka, Ryoji
    Ohtsuru, Shigeru
    [J]. JOURNAL OF INTENSIVE CARE, 2021, 9 (01)
  • [24] Machine learning-based prediction of mortality in pediatric trauma patients
    Deleon, M. P.
    Murula, A.
    Moreira, A.
    [J]. AMERICAN JOURNAL OF THE MEDICAL SCIENCES, 2024, 367 : S317 - S317
  • [25] Machine Learning-Based prediction of Post-Treatment ambulatory blood pressure in patients with hypertension
    Hae, Hyeonyong
    Kang, Soo-Jin
    Kim, Tae Oh
    Lee, Pil Hyung
    Lee, Seung-Whan
    Kim, Young-Hak
    Lee, Cheol Whan
    Park, Seong-Wook
    [J]. BLOOD PRESSURE, 2023, 32 (01)
  • [26] Prediction of inpatient pressure ulcers based on routine healthcare data using machine learning methodology
    Felix Walther
    Luise Heinrich
    Jochen Schmitt
    Maria Eberlein-Gonska
    Martin Roessler
    [J]. Scientific Reports, 12
  • [27] Prediction of inpatient pressure ulcers based on routine healthcare data using machine learning methodology
    Walther, Felix
    Heinrich, Luise
    Schmitt, Jochen
    Eberlein-Gonska, Maria
    Roessler, Martin
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [28] Machine Learning-Based Precipitation Prediction Using Cloud Properties
    Yakubu, Abdulaziz Tunde
    Abayomi, Abdultaofeek
    Chetty, Naven
    [J]. HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 243 - 252
  • [29] Prediction of Acute Myocardial Infarction Using a Machine Learning-Based Approach From Data at Admission
    Park, Ji Young
    Noh, Yungkyun
    Choi, Byoung Geol
    Rha, Seung Woon
    [J]. JACC-CARDIOVASCULAR INTERVENTIONS, 2020, 13 (04) : S13 - S13
  • [30] Machine Learning-Based Prediction of COVID-19 Prognosis Using Clinical and Hematologic Data
    Kamel, Fatemah O.
    Magadmi, Rania
    Qutub, Sulafah
    Badawi, Maha
    Badawi, Mazen
    Madani, Tariq A.
    Alhothali, Areej
    Abozinadah, Ehab A.
    Bakhshwin, Duaa M.
    Jamal, Maha H.
    Burzangi, Abdulhadi S.
    Bazuhair, Mohammed
    Alqutub, Hussamaldin
    Alqutub, Abdulaziz
    Felemban, Sameera M.
    Al-Sayes, Fatin
    Adam, Soheir
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (12)