Analyzing Predictive Algorithms in Data Mining for Cardiovascular Disease using WEKA Tool

被引:0
|
作者
Aman [1 ]
Chhillar, Rajender Singh [1 ]
机构
[1] Maharshi Dayanand Univ, Dept Comp Sci & Applicat, Rohtak, Haryana, India
关键词
Logistic regression (LR); support vector machine (SVM); Statlog; Cleveland; WEKA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cardiovascular Disease (CVD) is the foremost cause of death worldwide that generates a high percentage of Electronic Health Records (EHRs). Analyzing these complex patterns from EHRs is a tedious process. To address this problem, Medical Institutions requires effective Predictive Algorithms for the Prognosis and Diagnosis of the Patients. Under this work, the current state-of-the-art studied to identify leading Predictive Algorithms. Further, these algorithms namely Support Vector Machine (SVM), Naive Bayes (NB), Decision Tree (DT), Random Forest (RF), Artificial Neural Network (ANN), Logistic Regression (LR), AdaBoost and k-Nearest Neighbors (k-NN) analyzed against the two datasets on open-source WEKA software. This work used two similar structured datasets i.e., Statlog Dataset and Cleveland Dataset. For Pre-Processing of Datasets, The missing values were replaced with the Mean value and later 10 Fold Cross-Validation was utilized for the evaluation. The result of the performance analysis showed that SVM outperforms other algorithms against both datasets. SVM showed an accuracy of 84.156% against the Cleveland dataset and 84.074% against the Statlog dataset. LR showed a ROC Area of 0.9 against both datasets. The findings of the work will help Health Institutions to understand the importance and usage of Predictive Algorithms for the automatic prediction of CVD based on the symptoms.
引用
收藏
页码:144 / 150
页数:7
相关论文
共 50 条
  • [41] Assortment Planning Using Data Mining Algorithms
    Guen, Ajlan Nihat
    Badur, Bertan
    2008 PORTLAND INTERNATIONAL CONFERENCE ON MANAGEMENT OF ENGINEERING & TECHNOLOGY, VOLS 1-5, 2008, : 2312 - 2322
  • [42] Data Mining Algorithms explained using R
    Basu, Sumanta
    BIOMETRICS, 2018, 74 (04) : 1519 - 1520
  • [43] Development of Data Mining Algorithms for Identifying the Best Anthropometric Predictors for Cardiovascular Disease: MASHAD Cohort Study
    Mansoori, Amin
    Hosseini, Zeinab Sadat
    Ahari, Rana Kolahi
    Poudineh, Mohadeseh
    Rad, Elias Sadooghi
    Zo, Mostafa Mahmoudi
    Izadi, Faezeh Salmani
    Hoseinpour, Mahdieh
    Miralizadeh, Amirreza
    Mashhadi, Yalda Alizadeh
    Hormozi, Maryam
    Firoozeh, Mohadeseh Taj
    Hajhoseini, Omolbanin
    Ferns, Gordon
    Esmaily, Habibollah
    Mobarhan, Majid Ghayour
    HIGH BLOOD PRESSURE & CARDIOVASCULAR PREVENTION, 2023, 30 (03) : 243 - 253
  • [44] Development of Data Mining Algorithms for Identifying the Best Anthropometric Predictors for Cardiovascular Disease: MASHAD Cohort Study
    Amin Mansoori
    Zeinab Sadat Hosseini
    Rana Kolahi Ahari
    Mohadeseh Poudineh
    Elias Sadooghi Rad
    Mostafa Mahmoudi Zo
    Faezeh Salmani Izadi
    Mahdieh Hoseinpour
    Amirreza Miralizadeh
    Yalda Alizadeh Mashhadi
    Maryam Hormozi
    Mohadeseh Taj Firoozeh
    Omolbanin Hajhoseini
    Gordon Ferns
    Habibollah Esmaily
    Majid Ghayour Mobarhan
    High Blood Pressure & Cardiovascular Prevention, 2023, 30 : 243 - 253
  • [45] Accuracy Comparison of Predictive Algorithms of Data Mining: Application in Education Sector
    Sharma, Mamta
    Mavani, Monali
    ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL, 2011, 125 : 189 - 194
  • [46] Analyzing Different Domains using Data Mining Techniques
    Mandan, Nelshan
    Agrawal, Kanika
    Kumar, Sunny
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 99 - 104
  • [47] Using Data Mining for Analyzing Experiential Marketing in Blogs
    Chen, Fu-Mei
    Li, Yan-Ze
    Sheu, Jyh-Jian
    Yang, Wei-Pang
    JOURNAL OF INTERNET TECHNOLOGY, 2008, 9 (04): : 421 - 430
  • [48] Data mining for classification of power quality problems using WEKA and the effect of attributes on classification accuracy
    Asha Kiranmai S.
    Jaya Laxmi A.
    Protection and Control of Modern Power Systems, 2018, 3 (01)
  • [49] Evaluation of Predictive Data Mining Algorithms in Soil Data Classification for Optimized Crop Recommendation
    Arooj, Ansif
    Riaz, Mohsin
    Akram, Malik Naeem
    2018 INTERNATIONAL CONFERENCE ON ADVANCEMENTS IN COMPUTATIONAL SCIENCES (ICACS), 2018, : 8 - +
  • [50] Analyzing the real time electricity data using data mining techniques
    Aki, Aravindh
    Reddy, Krishna Mohan D.
    Reddy, Koushik Y.
    Kavitha, C. R.
    Sasikala, T.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), 2017, : 545 - 549