Performance analysis of supervised classification models on heart disease prediction

被引:4
|
作者
Ogundepo, Ezekiel Adebayo [1 ]
Yahya, Waheed Babatunde [1 ]
机构
[1] Univ Ilorin, Dept Stat, Ilorin, Nigeria
关键词
Classifiers; Model selection; Feature selection; Exploratory data analysis; Evaluation metrics;
D O I
10.1007/s11334-022-00524-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a predictive analysis of data on heart disease patients to determine the possible risk factors associated with their heart disease status. Two independent (but similar) published heart disease datasets, the Cleveland data (used to build classification models) and the Statlog data (used for results' validation), were considered for analysis. A detailed exploratory analysis using the Chi-square test of independence was performed on the Cleveland data after which ten standard classification models were trained for class prediction. The classification models were built by partitioning the Cleveland data randomly into 208 (70%) training samples and 89 (30%) test samples over 200 replications. Preliminary results showed that some of the bio-clinical categorical variables are strongly associated with the heart disease conditions of the patients (p < 0.001). The classification results from the test samples indicated that the support vector machine yielded the best predictive performances with 85% accuracy, 82% sensitivity, 88% specificity, 87% precision, 91% area under the ROC curve, and 38% log loss value. These results were validated on the Statlog data in tenfold cross-validation which were all consistent with those obtained from the Cleveland dataset.
引用
收藏
页码:129 / 144
页数:16
相关论文
共 50 条
  • [31] Heart Disease Risk Prediction Expending of Classification Algorithms
    Mary, Nisha
    Khan, Bilal
    Asiri, Abdullah A.
    Muhammad, Fazal
    Khan, Salman
    Alqhtani, Samar
    Mehdar, Khlood M.
    Halwani, Hanan Talal
    Irfan, Muhammad
    Alshamrani, Khalaf A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 6595 - 6616
  • [32] Comparing classification models in the final exam performance prediction
    Gamulin, Jasna
    Gamulin, Ozren
    Kermek, Dragutin
    2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2014, : 663 - 668
  • [33] An analysis on classification models for customer churn prediction
    Mouli, Kathi Chandra
    Raghavendran, Ch. V.
    Bharadwaj, V. Y.
    Vybhavi, G. Y.
    Sravani, C.
    Vafaeva, Khristina Maksudovna
    Deorari, Rajesh
    Hussein, Laith
    COGENT ENGINEERING, 2024, 11 (01):
  • [34] Performance Analysis of Supervised Image Classification Techniques for the Classification of Multispectral Satellite Imagery
    Nawaz, Adil
    Iqbal, Zahid
    Ullah, Sadiq
    2015 FOURTH INTERNATIONAL CONFERENCE ON AEROSPACE SCIENCE AND ENGINEERING (ICASE), 2016,
  • [35] Heart Disease Classification Using Machine Learning Models
    Folorunso, Sakinat Oluwabukonla
    Awotunde, Joseph Bamidele
    Adeniyi, Emmanuel Abidemi
    Abiodun, Kazeem Moses
    Ayo, Femi Emmanuel
    INFORMATICS AND INTELLIGENT APPLICATIONS, 2022, 1547 : 35 - 49
  • [36] Performance Analysis of Supervised Machine Learning Algorithms for Text Classification
    Mishu, Sadia Zaman
    Rafiuddin, S. M.
    PROCEEDINGS OF THE 2016 19TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2016, : 409 - 413
  • [37] Performance analysis of pretrained convolutional neural network models for ophthalmological disease classification
    Emir, Busra
    Colak, Ertugrul
    ARQUIVOS BRASILEIROS DE OFTALMOLOGIA, 2024, 87 (05)
  • [38] Analysis of the different non-supervised classification techniques of heart beats
    Aguiar, R. O.
    Andreao, R. V.
    Bastos Filho, T. F.
    IV LATIN AMERICAN CONGRESS ON BIOMEDICAL ENGINEERING 2007, BIOENGINEERING SOLUTIONS FOR LATIN AMERICA HEALTH, VOLS 1 AND 2, 2008, 18 (1,2): : 69 - 73
  • [39] A partially supervised classification approach to dominant and recessive human disease gene prediction
    Calvo, Borja
    Lopez-Bigas, Nuria
    Furney, Simon J.
    Larranaga, Pedro
    Lozano, Jose A.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2007, 85 (03) : 229 - 237
  • [40] Statistical models and artificial neural networks: Supervised classification and prediction via soft trees
    Ciampi, Antonio
    Lechevallier, Yves
    ADVANCES IN STATISTICAL METHODS FOR THE HEALTH SCIENCES: APPLICATIONS TO CANCER AND AIDS STUDIES, GENOME SEQUENCE ANALYSIS, AND SURVIVAL ANALYSIS, 2007, : 239 - +