Prediction of Heart Attacks using Data Mining Techniques

被引:1
|
作者
Abdelghani, Bassam A. [1 ]
Fadal, Sophia [1 ]
Bedoor, Shadi [1 ]
Banitaan, Shadi [1 ]
机构
[1] Univ Detroit Mercy, Dept Econ, Dept Elect & Comp Engn & Comp Sci, Detroit, MI 48221 USA
关键词
Data Mining; Heart Attack; Logistic Regression; Random Forest; eXtreme Gradient Boosting; ALGORITHM;
D O I
10.1109/ICMLA55696.2022.00159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining tools have proven to be highly effective in recognizing and predicting early signs of heart disease, especially in countries where doctors lack extensive cardiovascular knowledge. This study introduces a new comprehensive combined dataset consisting of Cleveland, Hungarian, Swiss, Long Beach VA, and Stalog (Heart) datasets. The combined dataset contains 11 unique attributes and 918 observations. A total of eight different algorithms were used to analyze this newly combined dataset, including Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbor, Gradient Boosting, AdaBoosting, and XGBoosting. Using the Cleveland data set alone, the best prediction result was obtained by the LR algorithm with an accuracy of 82.6%, while the RF algorithm provided the best prediction accuracy of 89.9% using the combined dataset. In addition, we have identified the five most important features of the combined dataset using a deep exploratory data analysis. As a result of refining our dataset to include only the most relevant features, the XGB algorithm provided the highest prediction accuracy of 89.1%. Even though this decreases accuracy by 0.8%, it reduces time and money spent on non-essential tests.
引用
收藏
页码:951 / 956
页数:6
相关论文
共 50 条
  • [1] HEART DISEASE PREDICTION USING DATA MINING TECHNIQUES
    Rairikar, Abhishek
    Kulkarni, Vedant
    Sabale, Vikas
    Kale, Harshavardhan
    Lamgunde, Anuradha
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL (I2C2), 2017,
  • [2] Prediction of heart disease using data mining techniques
    Ritika Chadha
    Shubhankar Mayank
    [J]. CSI Transactions on ICT, 2016, 4 (2-4) : 193 - 198
  • [3] Heart disease Prediction Using Data Mining Techniques
    [J]. 1600, Institute of Electrical and Electronics Engineers Inc.
  • [4] Chronic Heart Disease Prediction Using Data Mining Techniques
    Nalluri, Sravani
    Saraswathi, R. Vijaya
    Ramasubbareddy, Somula
    Govinda, K.
    Swetha, E.
    [J]. DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 903 - 912
  • [5] Human Heart Disease Prediction System using Data Mining Techniques
    Princy, Theresa R.
    Thomas, J.
    [J]. PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT 2016), 2016,
  • [6] Intelligent heart disease prediction system using data mining techniques
    Palaniappan, Sellappan
    Awang, Raflah
    [J]. 2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 108 - 115
  • [7] Prediction of Heart Disease Using Classification Based Data Mining Techniques
    Joshi, Sujata
    Nair, Mydhili K.
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 503 - 511
  • [8] Hybrid Approach for Heart Disease Prediction Using Data Mining Techniques
    Tarawneh, Monther
    Embarak, Ossama
    [J]. ADVANCES IN INTERNET, DATA AND WEB TECHNOLOGIES, 2019, 29 : 447 - 454
  • [9] Intelligent Heart Disease Prediction System Using Data Mining Techniques
    Palaniappan, Sellappan
    Awang, Rafiah
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (08): : 343 - 350
  • [10] Analysis of Data Mining Techniques for Heart Disease Prediction
    Sultana, Marjia
    Haider, Afrin
    Uddin, Mohammad Shorif
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT), 2016,