Predicting Chronic Kidney Disease Using Hybrid Machine Learning Based on Apache Spark

被引:15
|
作者
Abdel-Fattah, Manal A. [1 ]
Othman, Nermin Abdelhakim [1 ,2 ]
Goher, Nagwa [1 ,3 ]
机构
[1] Helwan Univ, Fac Comp & Artificial Intelligence, Dept Informat Syst, Helwan, Egypt
[2] British Univ, Fac Informat & Comp Sci, Cairo, Egypt
[3] Nahda Univ Beni Suef, Fac Comp Sci, Dept Informat Syst, Bani Suwayf, Egypt
关键词
BIG DATA;
D O I
10.1155/2022/9898831
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Chronic kidney disease (CKD) has become a widespread disease among people. It is related to various serious risks like cardiovascular disease, heightened risk, and end-stage renal disease, which can be feasibly avoidable by early detection and treatment of people in danger of this disease. The machine learning algorithm is a source of significant assistance for medical scientists to diagnose the disease accurately in its outset stage. Recently, Big Data platforms are integrated with machine learning algorithms to add value to healthcare. Therefore, this paper proposes hybrid machine learning techniques that include feature selection methods and machine learning classification algorithms based on big data platforms (Apache Spark) that were used to detect chronic kidney disease (CKD). The feature selection techniques, namely, Relief-F and chi-squared feature selection method, were applied to select the important features. Six machine learning classification algorithms were used in this research: decision tree (DT), logistic regression (LR), Naive Bayes (NB), Random Forest (RF), support vector machine (SVM), and Gradient-Boosted Trees (GBT Classifier) as ensemble learning algorithms. Four methods of evaluation, namely, accuracy, precision, recall, and F1-measure, were applied to validate the results. For each algorithm, the results of cross-validation and the testing results have been computed based on full features, the features selected by Relief-F, and the features selected by chi-squared feature selection method. The results showed that SVM, DT, and GBT Classifiers with the selected features had achieved the best performance at 100% accuracy. Overall, Relief-F's selected features are better than full features and the features selected by chi-square.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A machine learning-based early diagnosis model for chronic kidney disease using SPegasos
    Norouzi, Monire
    Kahriman, Elif Altintas
    [J]. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
  • [42] Research on Visual Machine Learning Algorithms Based on Apache Spark in Big Data Environment
    Wang, Jialin
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 124 : 144 - 144
  • [43] Machine Learning for Dynamically Predicting the Onset of Renal Replacement Therapy in Chronic Kidney Disease Patients Using Claims Data
    Lopez-Martinez, Daniel
    Chen, Christina
    Chen, Ming-Jun
    [J]. APPLICATIONS OF MEDICAL ARTIFICIAL INTELLIGENCE, AMAI 2022, 2022, 13540 : 18 - 28
  • [44] Predicting the Progression of Chronic Kidney Disease: A Systematic Review of Artificial Intelligence and Machine Learning Approaches
    Khalid, Fizza
    Alsadoun, Lara
    Khilji, Faria
    Mushtaq, Maham
    Eze-odurukwe, Anthony
    Mushtaq, Muhammad Muaz
    Ali, Husnain
    Farman, Rana Omer
    Ali, Syed Momin
    Fatima, Rida
    Bokhari, Syed Faqeer Hussain
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (05)
  • [45] A machine learning driven monogram for predicting chronic kidney disease stages 3-5
    Ghosh, Samit Kumar
    Khandoker, Ahsan H.
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [46] Amharic based Knowledge-Based System for Diagnosis and Treatment of Chronic Kidney Disease using Machine Learning
    Mohammed, Siraj
    Beshah, Tibebe
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 252 - 260
  • [47] Machine learning to predict end stage kidney disease in chronic kidney disease
    Bai, Qiong
    Su, Chunyan
    Tang, Wen
    Li, Yike
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [48] Machine learning to predict end stage kidney disease in chronic kidney disease
    Qiong Bai
    Chunyan Su
    Wen Tang
    Yike Li
    [J]. Scientific Reports, 12
  • [49] Performance evaluation of DNN with other machine learning techniques in a cluster using Apache Spark and MLlib
    JayaLakshmi, A. N. M.
    Kishore, K. V. Krishna
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (01) : 1311 - 1319
  • [50] Large-Scale Music Genre Analysis and Classification Using Machine Learning with Apache Spark
    Chaudhury, Mousumi
    Karami, Amin
    Ghazanfar, Mustansar Ali
    [J]. ELECTRONICS, 2022, 11 (16)