The Role of Data Pre-processing Techniques in Improving Machine Learning Accuracy for Predicting Coronary Heart Disease

被引:0
|
作者
Sami, Osamah [1 ]
Elsheikh, Yousef [1 ]
Almasalha, Fadi [1 ]
机构
[1] Appl Sci Private Univ, Fac Informat Technol, Amman 11931, Jordan
关键词
Coronary heart disease; heart; machine learning; data preprocessing; classification technique; DIAGNOSIS;
D O I
10.14569/IJACSA.2021.0120695
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
These days, in light of the rapid developments, people work day and night to live at a good level. This often causes them to not pay much attention to a healthy lifestyle, such as what they eat or even what physical activities they do. These people are often the most likely to suffer from coronary heart disease. The heart is a small organ responsible for pumping oxygen-rich blood to the rest of the human body through the coronary arteries. Accordingly, any blockage or narrowing in one of these coronary arteries may cause blood not to be pumped to the heart and from it to the rest of the body, and thus cause what is known as heart attacks. From here, the importance of early prediction of coronary heart disease has emerged, as it can help these people change their lifestyle and eating habits to become healthier and thus prevent coronary heart disease and avoid death. This paper improve the accuracy of machine learning techniques in predicting coronary heart disease using data preprocessing techniques. Data preprocessing is a technique used to improve the efficiency of a machine learning model by improving the quality of the feature. The popular Framingham Heart Study dataset was used for validation purposes. The results of the research paper indicate that the use of data preprocessing techniques had a role in improving the predictive accuracy of poorly efficient classifiers, and shows satisfactory performance in determining the risk of coronary heart disease. For example, the Decision Tree classifier led to a predictive accuracy of coronary heart disease of 91.39% with an increase of 1.39% over the previous work, the Random Forest classifier led to a predictive accuracy of 92.80% with an increase of 2.7% over the previous work, the K-Nearest Neighbor classifier led to a predictive accuracy of 92.68% with an increase of 2.58% over the previous work, the Multilayer Perceptron Neural Network (MLP) classifier led to a predictive accuracy of 92.64% with an increase of 2.64% over the previous work, and the Na<spacing diaeresis>ive Bayes classifier led to a predictive accuracy of 90.56% with an increase of 0.66% over the previous work.
引用
收藏
页码:812 / 820
页数:9
相关论文
共 50 条
  • [41] Selective pre-processing of imbalanced data for improving classification performance
    Stefanowski, Jerzy
    Wilk, Szymon
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 283 - 292
  • [42] Improving the performance of data-driven techniques through data pre-processing for modelling daily reservoir inflow
    Jothiprakash, V.
    Kote, Alka S.
    HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2011, 56 (01): : 168 - 186
  • [43] Pre-processing SARS-CoV-2 Sequence Data for Application of Machine Learning Techniques for Visualization and Clustering of Virus Characteristics
    Kim, Juhyeon
    Ahn, Insung
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 663 - 669
  • [44] A Multi-purpose Data Pre-processing Framework using Machine Learning for Enterprise Data Models
    Ramana, Venkata B.
    Narsimha, G.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 646 - 656
  • [45] Pre-processing and Indexing Techniques for Constellation Queries in Big Data
    Khatibi, Amir
    Porto, Fabio
    Rittmeyer, Joao Guilherme
    Ogasawara, Eduardo
    Valduriez, Patrick
    Shasha, Dennis
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2017, 2017, 10440 : 164 - 172
  • [46] Predicting coronary heart disease in Chinese diabetics using machine learning
    Ma, Cai-Yi
    Luo, Ya-Mei
    Zhang, Tian-Yu
    Hao, Yu-Duo
    Xie, Xue-Qin
    Liu, Xiao-Wei
    Ren, Xiao-Lei
    He, Xiao-Lin
    Han, Yu-Mei
    Deng, Ke-Jun
    Yan, Dan
    Yang, Hui
    Tang, Hua
    Lin, Hao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [47] Evaluating Effects of Resting-State Electroencephalography Data Pre-Processing on a Machine Learning Task for Parkinson's Disease
    Vlieger, Robin
    Daskalaki, Elena
    Apthorp, Deborah
    Lueck, Christian J.
    Suominen, Hanna
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 1480 - 1481
  • [48] The generalizability of pre-processing techniques on the accuracy and fairness of data-driven building models: A case study
    Sun, Ying
    Fung, Benjamin C. M.
    Haghighat, Fariborz
    ENERGY AND BUILDINGS, 2022, 268
  • [49] Analysis of Benchmark Image Pre-Processing Techniques for Coronary Angiogram Images
    Kavipriya, K.
    Hiremath, Manjunatha
    2021 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2021,
  • [50] Deep Learning Framework With Essential Pre-Processing Techniques for Improving Mixed-Gas Concentration Prediction
    Eo, Moonjung
    Han, Jeongyun
    Rhee, Wonjong
    IEEE ACCESS, 2023, 11 : 25467 - 25479