Performance analysis of supervised classification models on heart disease prediction

被引:4
|
作者
Ogundepo, Ezekiel Adebayo [1 ]
Yahya, Waheed Babatunde [1 ]
机构
[1] Univ Ilorin, Dept Stat, Ilorin, Nigeria
关键词
Classifiers; Model selection; Feature selection; Exploratory data analysis; Evaluation metrics;
D O I
10.1007/s11334-022-00524-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a predictive analysis of data on heart disease patients to determine the possible risk factors associated with their heart disease status. Two independent (but similar) published heart disease datasets, the Cleveland data (used to build classification models) and the Statlog data (used for results' validation), were considered for analysis. A detailed exploratory analysis using the Chi-square test of independence was performed on the Cleveland data after which ten standard classification models were trained for class prediction. The classification models were built by partitioning the Cleveland data randomly into 208 (70%) training samples and 89 (30%) test samples over 200 replications. Preliminary results showed that some of the bio-clinical categorical variables are strongly associated with the heart disease conditions of the patients (p < 0.001). The classification results from the test samples indicated that the support vector machine yielded the best predictive performances with 85% accuracy, 82% sensitivity, 88% specificity, 87% precision, 91% area under the ROC curve, and 38% log loss value. These results were validated on the Statlog data in tenfold cross-validation which were all consistent with those obtained from the Cleveland dataset.
引用
收藏
页码:129 / 144
页数:16
相关论文
共 50 条
  • [21] Comparison of Supervised Learning Models for the Prediction of Coronary Artery Disease
    Vasquez-Gonzaga, Hillary
    Gutierrez-Cardenas, Juan
    2021 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY, AIVR 2021, 2021, : 98 - 103
  • [22] Early prediction of heart disease with data analysis using supervised learning with stochastic gradient boosting
    Jawalkar A.P.
    Swetcha P.
    Manasvi N.
    Sreekala P.
    Aishwarya S.
    Kanaka Durga Bhavani P.
    Anjani P.
    Journal of Engineering and Applied Science, 2023, 70 (01):
  • [23] On using supervised clustering analysis to improve classification performance
    Gan, Haitao
    Huang, Rui
    Luo, Zhizeng
    Xi, Xugang
    Gao, Yunyuan
    INFORMATION SCIENCES, 2018, 454 : 216 - 228
  • [24] A Study on the Performance of Supervised Algorithms for Classification in Sentiment Analysis
    Sunitha, P. B.
    Joseph, Shelbi
    Akhil, P., V
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 1351 - 1356
  • [25] DISEASE CLASSIFICATION AND PREDICTION VIA SEMI-SUPERVISED DIMENSIONALITY REDUCTION
    Batmanghelich, Kayhan N.
    Ye, Dong H.
    Pohl, Kilian M.
    Taskar, Ben
    Davatzikos, Christos
    2011 8TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2011, : 1086 - 1090
  • [26] A novel method for prediction of skin disease through supervised classification techniques
    Meena, K.
    Veni, N. N. Krishna
    Deepapriya, B. S.
    Vardhini, P. A. Harsha
    Kalyani, B. J. D.
    Sharmila, L.
    SOFT COMPUTING, 2022, 26 (19) : 10527 - 10533
  • [27] Comprehensive evaluation and performance analysis of machine learning in heart disease prediction
    Al-Alshaikh, Halah A.
    Prabu, P.
    Poonia, Ramesh Chandra
    Saudagar, Abdul Khader Jilani
    Yadav, Manoj
    AlSagri, Hatoon S.
    AlSanad, Abeer A.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [28] Clinical Prediction Models for Valvular Heart Disease
    Wessler, Benjamin S.
    Lundquist, Christine M.
    Koethe, Benjamin
    Park, Jinny G.
    Brown, Kristen
    Williamson, Tatum
    Ajlan, Muhammad
    Natto, Zuhair
    Lutz, Jennifer S.
    Paulus, Jessica K.
    Kent, David M.
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2019, 8 (20):
  • [29] Learning-based techniques for heart disease prediction: a survey of models and performance metrics
    Bizimana, Pierre Claver
    Zhang, Zuping
    Asim, Muhammad
    El-Latif, Ahmed A. Abd
    Hammad, Mohamed
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 39867 - 39921
  • [30] Prediction of Coronary Heart Disease using Supervised Machine Learning Algorithms
    Krishnani, Divya
    Kumari, Anjali
    Dewangan, Akash
    Singh, Aditya
    Naik, Nenavath Srinivas
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 367 - 372