Analyzing Predictive Algorithms in Data Mining for Cardiovascular Disease using WEKA Tool

被引:0
|
作者
Aman [1 ]
Chhillar, Rajender Singh [1 ]
机构
[1] Maharshi Dayanand Univ, Dept Comp Sci & Applicat, Rohtak, Haryana, India
关键词
Logistic regression (LR); support vector machine (SVM); Statlog; Cleveland; WEKA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cardiovascular Disease (CVD) is the foremost cause of death worldwide that generates a high percentage of Electronic Health Records (EHRs). Analyzing these complex patterns from EHRs is a tedious process. To address this problem, Medical Institutions requires effective Predictive Algorithms for the Prognosis and Diagnosis of the Patients. Under this work, the current state-of-the-art studied to identify leading Predictive Algorithms. Further, these algorithms namely Support Vector Machine (SVM), Naive Bayes (NB), Decision Tree (DT), Random Forest (RF), Artificial Neural Network (ANN), Logistic Regression (LR), AdaBoost and k-Nearest Neighbors (k-NN) analyzed against the two datasets on open-source WEKA software. This work used two similar structured datasets i.e., Statlog Dataset and Cleveland Dataset. For Pre-Processing of Datasets, The missing values were replaced with the Mean value and later 10 Fold Cross-Validation was utilized for the evaluation. The result of the performance analysis showed that SVM outperforms other algorithms against both datasets. SVM showed an accuracy of 84.156% against the Cleveland dataset and 84.074% against the Statlog dataset. LR showed a ROC Area of 0.9 against both datasets. The findings of the work will help Health Institutions to understand the importance and usage of Predictive Algorithms for the automatic prediction of CVD based on the symptoms.
引用
收藏
页码:144 / 150
页数:7
相关论文
共 50 条
  • [21] Data Mining for Cardiovascular Disease Prediction
    Bárbara Martins
    Diana Ferreira
    Cristiana Neto
    António Abelha
    José Machado
    Journal of Medical Systems, 2021, 45
  • [22] Data Mining for Cardiovascular Disease Prediction
    Martins, Barbara
    Ferreira, Diana
    Neto, Cristiana
    Abelha, Antonio
    Machado, Jose
    JOURNAL OF MEDICAL SYSTEMS, 2021, 45 (01)
  • [23] A Comparative Analysis of Data Mining Techniques on Breast Cancer Diagnosis Data using WEKA Toolbox
    Alshammari, Majdah
    Mezher, Mohammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (08) : 224 - 229
  • [24] Transportation data analyzing by using data mining method
    Luo, Qi
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 766 - 767
  • [25] A Predictive Model for Heart Disease Detection Using Data Mining Techniques
    Premsmith, Jakkrit
    Ketmaneechairat, Hathairat
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2021, 12 (01) : 14 - 20
  • [26] Predict Chronic Kidney Disease Using Data Mining Algorithms In Hadoop
    Kaur, Guneet
    Sharma, Er. Ajay
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 973 - 979
  • [27] Analyzing Machine Performance Using Data Mining
    Pospisil, Milan
    Bartik, Vladimir
    Hruska, Tomas
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [28] Comparison of Machine Learning Algorithms and Fruit Classification using Orange Data Mining Tool
    Vaishnav, Devashree
    Rao, B. Rama
    PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), 2018, : 603 - 607
  • [29] Prediction of mortality in patients with cardiovascular disease using data mining methods
    Imamovic, Damir
    Babovic, Elmir
    Bijedic, Nina
    2020 19TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2020,
  • [30] EVALUATION OF PREDICTIVE DATA MINING ALGORITHMS IN STUDENT ACADEMIC PERFORMANCE
    Jidagam, Rohith
    Rizk, Nouhad
    INTED2016: 10TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2016, : 6314 - 6324