Predicting the Risk of Chronic Kidney Disease (CKD) Using Machine Learning Algorithm

被引:27
|
作者
Wang, Weilun [1 ]
Chakraborty, Goutam [1 ,2 ]
Chakraborty, Basabi [1 ]
机构
[1] Iwate Prefectural Univ, Fac Software Informat Sci, Takizawa, Iwate 0200693, Japan
[2] Sendai Fdn Appl Informat Sci, Sendai, Miyagi 9800012, Japan
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 01期
关键词
chronic kidney disease; creatine; ensemble learning; regression; unbalanced data;
D O I
10.3390/app11010202
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Creatinine is a type of metabolite of blood that is strongly correlated to glomerular filtration rate (GFR). As measuring GFR is difficult, creatinine value is used for indirectly determining GFR and then the stage of chronic kidney disease (CKD). Adding a creatinine test into routine health examination could detect CKD. As more items for comprehensive examination means higher cost, creatinine testing is not included in the routine health examination in many countries. An algorithm based on common test results, without creatinine test, to evaluate the risk of CKD will increase the chance of its early detection and treatment. Methods: In this study, we used open source data containing 1 million samples. These data contain 23 health-related features, including common diagnostic test results provided by National Health Insurance Sharing Service (NHISS). A low GFR indicates possible chronic kidney disease (CKD). As is commonly accepted in the medical community, a GFR of 60 mL/min is used as the threshold, below which is considered to have CKD. In this study, the first step aims to build a regression model to predict the value of creatinine from 23 features, and then combine the predicted value of creatinine with the original 23 features to evaluate the risk of CKD. We will show by simulation that by the proposed method we can achieve better prediction results compared to direct prediction from 23 features. The data is extremely unbalanced for predicting the target variable creatinine. We used undersampling method and proposed a new cost-sensitive mean-squared error (MSE) loss function to deal with the problem. Regrading model selection, this work used three machine learning models: a bagging tree model named Random Forest, a boosting tree model named XGBoost, and a neural network based model named ResNet. To improve the result of the creatinine predictor, we averaged results from eight predictors, a method known as ensemble learning. Finally, the predicted creatinine and the original 23 features is used to predict the risk of CKD. Results: We optimized results of R-Squared (R2) value to select the appropriate undersampling strategy and the regression model for the regression stage of creatinine prediction. Ensembled model achieved the best performance of R2 of 0.5590. The six factors from 23 are selected from the top of the list of how strongly they affect the creatinine value. They are sex, age, hemoglobin, the level of urine protein, waist circumference, and habit of smoking. Using the predicted value of creatinine, an area under Receiver Operating Characteristic curve (AUC) of 0.76 is achieved while classifying samples for CKD. Conclusions: Using commonly available health parameters, the proposed system can assess the risk of CKD for public health. High-risk subjects can be screened and advised to take a creatinine test for further confirmation. In this way, we can reduce the impact of CKD on public health and facilitate early detection for many, where a blanket test of creatinine is not available for all.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [1] Predicting Chronic Kidney Disease Using Machine Learning Algorithms
    Farjana, Afia
    Liza, Fatema Tabassum
    Pandit, Parth Pratim
    Das, Madhab Chandra
    Hasan, Mahadi
    Tabassum, Fariha
    Hossen, Md. Helal
    [J]. 2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 1267 - 1271
  • [2] Interpretable machine learning for predicting chronic kidney disease progression risk
    Zheng, Jin-Xin
    Li, Xin
    Zhu, Jiang
    Guan, Shi-Yang
    Zhang, Shun-Xian
    Wang, Wei-Ming
    [J]. DIGITAL HEALTH, 2024, 10
  • [3] Classification and Detection of Chronic Kidney Disease (CKD) Using Machine Learning Algorithms
    Abuomar, O.
    Sogbe, P.
    [J]. INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 18 - 25
  • [4] Implementing a Machine-Learning Model to Predict Risk of Chronic Kidney Disease (CKD) Progression
    Tangri, Navdeep
    Singh, Rakesh
    Betts, Keith A.
    Du, Yuxian
    Gao, Sophie
    Katta, Arvind
    Farag, Youssef
    Fatoba, Samuel T.
    Liu, Hongjiao
    Chen, Jingyi
    Ferguson, Thomas
    Whitlock, Reid
    Leon, Silvia J.
    Singh, Ajay K.
    [J]. DIABETES, 2024, 73
  • [5] Chronic Kidney Disease Prediction using Machine Learning Ensemble Algorithm
    Nikhila
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 476 - 480
  • [6] Predicting early risk of chronic kidney disease in cats using routine clinical laboratory tests and machine learning
    Bradley, Richard
    Tagkopoulos, Ilias
    Kim, Minseung
    Kokkinos, Yiannis
    Panagiotakos, Theodoros
    Kennedy, James
    De Meyer, Geert
    Watson, Phillip
    Elliott, Jonathan
    [J]. JOURNAL OF VETERINARY INTERNAL MEDICINE, 2019, 33 (06) : 2644 - 2656
  • [7] Predicting Chronic Kidney Disease Using Hybrid Machine Learning Based on Apache Spark
    Abdel-Fattah, Manal A.
    Othman, Nermin Abdelhakim
    Goher, Nagwa
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [8] Chronic Kidney Disease Prediction Using Machine Learning
    Kaur, Chamandeep
    Kumar, M. Sunil
    Anjum, Afsana
    Binda, M. B.
    Mallu, Maheswara Reddy
    Al Ansari, Mohammed Saleh
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (02) : 384 - 391
  • [9] Machine Learning Techniques for Chronic Kidney Disease Risk Prediction
    Dritsas, Elias
    Trigka, Maria
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (03)
  • [10] Predicting cardiovascular mortality in chronic kidney disease (CKD) patients
    Sun, Wenhui
    Liu, Dahai
    Gong, Ping
    Shi, Xiaoyu
    Wang, Yong
    Wang, Ping
    Gong, Weihua
    [J]. ANNALS OF TRANSPLANTATION, 2014, 19