The Role of Data Pre-processing Techniques in Improving Machine Learning Accuracy for Predicting Coronary Heart Disease

被引:0
|
作者
Sami, Osamah [1 ]
Elsheikh, Yousef [1 ]
Almasalha, Fadi [1 ]
机构
[1] Appl Sci Private Univ, Fac Informat Technol, Amman 11931, Jordan
关键词
Coronary heart disease; heart; machine learning; data preprocessing; classification technique; DIAGNOSIS;
D O I
10.14569/IJACSA.2021.0120695
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
These days, in light of the rapid developments, people work day and night to live at a good level. This often causes them to not pay much attention to a healthy lifestyle, such as what they eat or even what physical activities they do. These people are often the most likely to suffer from coronary heart disease. The heart is a small organ responsible for pumping oxygen-rich blood to the rest of the human body through the coronary arteries. Accordingly, any blockage or narrowing in one of these coronary arteries may cause blood not to be pumped to the heart and from it to the rest of the body, and thus cause what is known as heart attacks. From here, the importance of early prediction of coronary heart disease has emerged, as it can help these people change their lifestyle and eating habits to become healthier and thus prevent coronary heart disease and avoid death. This paper improve the accuracy of machine learning techniques in predicting coronary heart disease using data preprocessing techniques. Data preprocessing is a technique used to improve the efficiency of a machine learning model by improving the quality of the feature. The popular Framingham Heart Study dataset was used for validation purposes. The results of the research paper indicate that the use of data preprocessing techniques had a role in improving the predictive accuracy of poorly efficient classifiers, and shows satisfactory performance in determining the risk of coronary heart disease. For example, the Decision Tree classifier led to a predictive accuracy of coronary heart disease of 91.39% with an increase of 1.39% over the previous work, the Random Forest classifier led to a predictive accuracy of 92.80% with an increase of 2.7% over the previous work, the K-Nearest Neighbor classifier led to a predictive accuracy of 92.68% with an increase of 2.58% over the previous work, the Multilayer Perceptron Neural Network (MLP) classifier led to a predictive accuracy of 92.64% with an increase of 2.64% over the previous work, and the Na<spacing diaeresis>ive Bayes classifier led to a predictive accuracy of 90.56% with an increase of 0.66% over the previous work.
引用
收藏
页码:812 / 820
页数:9
相关论文
共 50 条
  • [21] Visualization Techniques on the Examination Timetabling Pre-processing Data
    Thomas, J. Joshua
    Khader, Ahamad Tajudin
    Belaton, Bahari
    PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION, 2009, : 454 - 458
  • [22] Predicting congenital heart disease using machine learning techniques
    Rani, Samta
    Masood, Sarfaraz
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (01): : 293 - 303
  • [23] Data Pre-processing Techniques for Publication Performance Analysis
    Zulkepli, Fatin Shahirah
    Ibrahin, Roliana
    Saeed, Faisal
    RECENT TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2018, 5 : 59 - 65
  • [24] Survey of Pre-processing Techniques for Mining Big Data
    Hariharakrishnan, Jayaram
    Mohanavalli, S.
    Srividya
    Kumar, Sundhara K. B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 77 - 81
  • [25] Insights into enhanced machine learning techniques for surface water quantity and quality prediction based on data pre-processing algorithms
    Panahi, Javad
    Mastouri, Reza
    Shabanlou, Saeid
    JOURNAL OF HYDROINFORMATICS, 2022, : 875 - 897
  • [26] A data pre-processing method to increase efficiency and accuracy in data mining
    Razavi, AR
    Gill, H
    Åhlfeldt, H
    Shahsavar, N
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2005, 3581 : 434 - 443
  • [27] Exploring Machine Learning Techniques for Coronary Heart Disease Prediction
    Khdair, Hisham
    Dasari, Naga M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 28 - 36
  • [28] Fetal heart rate data pre-processing and annotation
    Chudacek, Vaclav
    Huptych, Michal
    Koucky, Michal
    Spilka, Jiri
    Bauer, Lukas
    Lhotska, Lenka
    2009 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS IN BIOMEDICINE, 2009, : 385 - +
  • [29] Semantic Data Pre-Processing for Machine Learning Based Bankruptcy Prediction Computational Model
    Yerashenia, Natalia
    Bolotov, Alexander
    Chan, David
    Pierantoni, Gabriele
    2020 IEEE 22ND CONFERENCE ON BUSINESS INFORMATICS (CBI 2020), VOL I - RESEARCH PAPERS, 2020, : 66 - 75
  • [30] A survey on pre-processing and post-processing techniques in data mining
    Tomar, Divya
    Agarwal, Sonali
    International Journal of Database Theory and Application, 2014, 7 (04): : 99 - 128