The Role of Data Pre-processing Techniques in Improving Machine Learning Accuracy for Predicting Coronary Heart Disease

被引:0
|
作者
Sami, Osamah [1 ]
Elsheikh, Yousef [1 ]
Almasalha, Fadi [1 ]
机构
[1] Appl Sci Private Univ, Fac Informat Technol, Amman 11931, Jordan
关键词
Coronary heart disease; heart; machine learning; data preprocessing; classification technique; DIAGNOSIS;
D O I
10.14569/IJACSA.2021.0120695
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
These days, in light of the rapid developments, people work day and night to live at a good level. This often causes them to not pay much attention to a healthy lifestyle, such as what they eat or even what physical activities they do. These people are often the most likely to suffer from coronary heart disease. The heart is a small organ responsible for pumping oxygen-rich blood to the rest of the human body through the coronary arteries. Accordingly, any blockage or narrowing in one of these coronary arteries may cause blood not to be pumped to the heart and from it to the rest of the body, and thus cause what is known as heart attacks. From here, the importance of early prediction of coronary heart disease has emerged, as it can help these people change their lifestyle and eating habits to become healthier and thus prevent coronary heart disease and avoid death. This paper improve the accuracy of machine learning techniques in predicting coronary heart disease using data preprocessing techniques. Data preprocessing is a technique used to improve the efficiency of a machine learning model by improving the quality of the feature. The popular Framingham Heart Study dataset was used for validation purposes. The results of the research paper indicate that the use of data preprocessing techniques had a role in improving the predictive accuracy of poorly efficient classifiers, and shows satisfactory performance in determining the risk of coronary heart disease. For example, the Decision Tree classifier led to a predictive accuracy of coronary heart disease of 91.39% with an increase of 1.39% over the previous work, the Random Forest classifier led to a predictive accuracy of 92.80% with an increase of 2.7% over the previous work, the K-Nearest Neighbor classifier led to a predictive accuracy of 92.68% with an increase of 2.58% over the previous work, the Multilayer Perceptron Neural Network (MLP) classifier led to a predictive accuracy of 92.64% with an increase of 2.64% over the previous work, and the Na<spacing diaeresis>ive Bayes classifier led to a predictive accuracy of 90.56% with an increase of 0.66% over the previous work.
引用
收藏
页码:812 / 820
页数:9
相关论文
共 50 条
  • [1] Impact of applying pre-processing techniques for improving classification accuracy
    Sharmila, T. Sree
    Ramar, K.
    Raja, T. Sree Renga
    SIGNAL IMAGE AND VIDEO PROCESSING, 2014, 8 (01) : 149 - 157
  • [2] Impact of applying pre-processing techniques for improving classification accuracy
    T. Sree Sharmila
    K. Ramar
    T. Sree Renga Raja
    Signal, Image and Video Processing, 2014, 8 : 149 - 157
  • [3] Comparative Study of Machine Learning Techniques for Pre-processing of Network Intrusion Data
    Rahat, Faiza
    Ahsan, Syed Nadeem
    2015 INTERNATIONAL CONFERENCE ON OPEN SOURCE SYSTEMS & TECHNOLOGIES (ICOSST), 2015, : 46 - 51
  • [4] Review of Data Pre-processing Techniques and Machine Learning in PTR-MS
    Sun Y.
    Chen Y.-B.
    Chu M.-J.
    Jiang X.-H.
    Wang Y.
    Guo B.-Q.
    2018, Chinese Society for Mass Spectrometry (39) : 513 - 523
  • [5] An evaluation of various data pre-processing techniques with machine learning models for water level prediction
    Ervin Shan Khai Tiu
    Yuk Feng Huang
    Jing Lin Ng
    Nouar AlDahoul
    Ali Najah Ahmed
    Ahmed Elshafie
    Natural Hazards, 2022, 110 : 121 - 153
  • [6] An evaluation of various data pre-processing techniques with machine learning models for water level prediction
    Tiu, Ervin Shan Khai
    Huang, Yuk Feng
    Ng, Jing Lin
    AlDahoul, Nouar
    Ahmed, Ali Najah
    Elshafie, Ahmed
    NATURAL HAZARDS, 2022, 110 (01) : 121 - 153
  • [7] Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison
    André Pfob
    Sheng-Chieh Lu
    Chris Sidey-Gibbons
    BMC Medical Research Methodology, 22
  • [8] Current breathomics-a review on data pre-processing techniques and machine learning in metabolomics breath analysis
    Smolinska, A.
    Hauschild, A-Ch
    Fijten, R. R. R.
    Dallinga, J. W.
    Baumbach, J.
    van Schooten, F. J.
    JOURNAL OF BREATH RESEARCH, 2014, 8 (02)
  • [9] Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison
    Pfob, Andre
    Lu, Sheng-Chieh
    Sidey-Gibbons, Chris
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [10] Optimizing Machine Learning Data Pre-Processing for Financial Fraud Detection
    Bower, Matthew
    Godasu, Rajesh
    Nyakundi, Nicholas
    Reynolds, Shawn
    2024 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY, EIT 2024, 2024, : 28 - 37