The Role of Data Pre-processing Techniques in Improving Machine Learning Accuracy for Predicting Coronary Heart Disease

被引:0
|
作者
Sami, Osamah [1 ]
Elsheikh, Yousef [1 ]
Almasalha, Fadi [1 ]
机构
[1] Appl Sci Private Univ, Fac Informat Technol, Amman 11931, Jordan
关键词
Coronary heart disease; heart; machine learning; data preprocessing; classification technique; DIAGNOSIS;
D O I
10.14569/IJACSA.2021.0120695
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
These days, in light of the rapid developments, people work day and night to live at a good level. This often causes them to not pay much attention to a healthy lifestyle, such as what they eat or even what physical activities they do. These people are often the most likely to suffer from coronary heart disease. The heart is a small organ responsible for pumping oxygen-rich blood to the rest of the human body through the coronary arteries. Accordingly, any blockage or narrowing in one of these coronary arteries may cause blood not to be pumped to the heart and from it to the rest of the body, and thus cause what is known as heart attacks. From here, the importance of early prediction of coronary heart disease has emerged, as it can help these people change their lifestyle and eating habits to become healthier and thus prevent coronary heart disease and avoid death. This paper improve the accuracy of machine learning techniques in predicting coronary heart disease using data preprocessing techniques. Data preprocessing is a technique used to improve the efficiency of a machine learning model by improving the quality of the feature. The popular Framingham Heart Study dataset was used for validation purposes. The results of the research paper indicate that the use of data preprocessing techniques had a role in improving the predictive accuracy of poorly efficient classifiers, and shows satisfactory performance in determining the risk of coronary heart disease. For example, the Decision Tree classifier led to a predictive accuracy of coronary heart disease of 91.39% with an increase of 1.39% over the previous work, the Random Forest classifier led to a predictive accuracy of 92.80% with an increase of 2.7% over the previous work, the K-Nearest Neighbor classifier led to a predictive accuracy of 92.68% with an increase of 2.58% over the previous work, the Multilayer Perceptron Neural Network (MLP) classifier led to a predictive accuracy of 92.64% with an increase of 2.64% over the previous work, and the Na<spacing diaeresis>ive Bayes classifier led to a predictive accuracy of 90.56% with an increase of 0.66% over the previous work.
引用
收藏
页码:812 / 820
页数:9
相关论文
共 50 条
  • [31] Object Pre-processing using Motion Stabilization and Key Frame Extraction with Machine Learning Techniques
    Archana, Kande
    Prasad, V. Kamakshi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 148 - 157
  • [32] Efficient Dengue Spread Prediction Using Machine Learning Models with Various Pre-processing Techniques
    Saraswathi, K.
    Rohini, K.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [33] Machine learning classifiers with pre-processing techniques for rumour detection on social media: an empirical study
    Al-Sarem M.
    Al-Harby M.
    Saeed F.
    Hezzam E.A.
    International Journal of Cloud Computing, 2022, 11 (04) : 330 - 344
  • [34] Analysis of Different Pre-Processing Techniques to the Development of Machine Learning Predictors with Gene Expression Profiles
    Duran, Ian
    Leandro, Roberto
    Guevara-Coto, Jose
    IV JORNADAS COSTARRICENSES DE INVESTIGACION EN COMPUTACION E INFORMATICA (JOCICI 2019), 2019,
  • [35] Comparative Analysis of Machine Learning Algorithms and Data Mining Techniques for Predicting the Existence of Heart Disease
    Alotaibi, Nourah
    Alzahrani, Mona
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 810 - 818
  • [36] EVALUATION OF THE IMPACT OF THE PRE-PROCESSING OF DATA ON THE EFFECTIVENESS AND ACCURACY OF SVM
    Cisty, Milan
    Bezak, Juraj
    Bajtek, Zbynek
    GEOCONFERENCE ON WATER RESOURCES, FOREST, MARINE AND OCEAN ECOSYSTEMS, 2013, : 141 - 147
  • [37] Improving Coronary Heart Disease Prediction Through Machine Learning and an Innovative Data Augmentation Technique
    Abdulrakeeb M. Al-Ssulami
    Randh S. Alsorori
    Aqil M. Azmi
    Hatim Aboalsamh
    Cognitive Computation, 2023, 15 : 1687 - 1702
  • [38] Improving Coronary Heart Disease Prediction Through Machine Learning and an Innovative Data Augmentation Technique
    Al-Ssulami, Abdulrakeeb M.
    Alsorori, Randh S.
    Azmi, Aqil M.
    Aboalsamh, Hatim
    COGNITIVE COMPUTATION, 2023, 15 (05) : 1687 - 1702
  • [39] Histogram-Based Image Pre-processing for Machine Learning
    Sada, Ayumi
    Kinoshita, Yuma
    Shiota, Sayaka
    Kiya, Hitoshi
    2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 272 - 275
  • [40] PRESISTANT: Learning based assistant for data pre-processing
    Bilalli, Besim
    Abello, Alberto
    Aluja-Banet, Tomas
    Wrembel, Robert
    DATA & KNOWLEDGE ENGINEERING, 2019, 123