Analyzing Predictive Algorithms in Data Mining for Cardiovascular Disease using WEKA Tool

被引:0
|
作者
Aman [1 ]
Chhillar, Rajender Singh [1 ]
机构
[1] Maharshi Dayanand Univ, Dept Comp Sci & Applicat, Rohtak, Haryana, India
关键词
Logistic regression (LR); support vector machine (SVM); Statlog; Cleveland; WEKA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cardiovascular Disease (CVD) is the foremost cause of death worldwide that generates a high percentage of Electronic Health Records (EHRs). Analyzing these complex patterns from EHRs is a tedious process. To address this problem, Medical Institutions requires effective Predictive Algorithms for the Prognosis and Diagnosis of the Patients. Under this work, the current state-of-the-art studied to identify leading Predictive Algorithms. Further, these algorithms namely Support Vector Machine (SVM), Naive Bayes (NB), Decision Tree (DT), Random Forest (RF), Artificial Neural Network (ANN), Logistic Regression (LR), AdaBoost and k-Nearest Neighbors (k-NN) analyzed against the two datasets on open-source WEKA software. This work used two similar structured datasets i.e., Statlog Dataset and Cleveland Dataset. For Pre-Processing of Datasets, The missing values were replaced with the Mean value and later 10 Fold Cross-Validation was utilized for the evaluation. The result of the performance analysis showed that SVM outperforms other algorithms against both datasets. SVM showed an accuracy of 84.156% against the Cleveland dataset and 84.074% against the Statlog dataset. LR showed a ROC Area of 0.9 against both datasets. The findings of the work will help Health Institutions to understand the importance and usage of Predictive Algorithms for the automatic prediction of CVD based on the symptoms.
引用
收藏
页码:144 / 150
页数:7
相关论文
共 50 条
  • [31] Data mining predictive algorithms for estimating soil water content
    Emami, Somayeh
    Rezaverdinejad, Vahid
    Dehghanisanij, Hossein
    Emami, Hojjat
    Elbeltagi, Ahmed
    SOFT COMPUTING, 2024, 28 (06) : 4915 - 4931
  • [32] Rules for comparing predictive data mining algorithms by error rate
    D. Koliastasis
    D.K. Despotis
    OPSEARCH, 2004, 41 (3) : 178 - 187
  • [33] Data mining predictive algorithms for estimating soil water content
    Somayeh Emami
    Vahid Rezaverdinejad
    Hossein Dehghanisanij
    Hojjat Emami
    Ahmed Elbeltagi
    Soft Computing, 2024, 28 : 4915 - 4931
  • [34] Disease prediction in data mining using association rule mining and keyword based clustering algorithms
    Ramasamy S.
    Nirmala K.
    International Journal of Computers and Applications, 2020, 42 (01) : 1 - 8
  • [35] A survey of data mining algorithms used in cardiovascular disease diagnosis from multi-lead ECG data
    Moses, Diana
    Deisy, C.
    KUWAIT JOURNAL OF SCIENCE, 2015, 42 (02) : 206 - 235
  • [36] Analyzing Athletes' Physical Performance and Trends in Athletics Competitions Using Time Series Data Mining Algorithms
    Ding, Yi
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (09) : 736 - 746
  • [37] Comparison of Data Mining Algorithms for Predicting the Cancer Disease Using Python']Python
    Mehdi, Mehtab
    Pahwa, Kanika
    Sharma, Bharti
    PROCEEDINGS OF THE 2019 8TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2019), 2019, : 155 - 160
  • [38] A new data mining tool for analyzing Coumarin-based prodrugs
    Fang, H
    Li, J
    Sun, Y
    Wang, B
    Zhang, YQ
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY VI, 2004, 5433 : 142 - 152
  • [39] Predictive security model using data mining
    Alampalayam, SP
    Kumar, A
    GLOBECOM '04: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6, 2004, : 2208 - 2212
  • [40] Predictive Analytics Using Data Mining Technique
    Gulati, Hina
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 713 - 716