Predicting cardiovascular disease by combining optimal feature selection methods with machine learning

被引:2
|
作者
Rodriguez Segura, Mauricio [1 ]
Nicolis, Orietta [2 ]
Peralta Marquez, Billy [2 ]
Carrillo Azocar, Juan [3 ]
机构
[1] Univ Andres Bello, Dept Ciencias Ingn, Santiago, Chile
[2] Univ Andres Bello, Fac Ingn, Vina Del Mar, Chile
[3] Hosp Dr Carlos Cisternas Calama, Calama, Chile
关键词
Cardiovascular disease; PCA; linear regression; classification models;
D O I
10.1109/sccc51225.2020.9281168
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cardiovascular Disease (CVD) is one of the main causes of death in the world. Early detection could prevent deaths associated to cardiac problems. In this work, we propose a methodology based on data pre-processing and Machine Learning (ML) techniques for predicting cardiovascular disease, by using the Sleep Heart Health Study (SHHS) dataset. First, the principal component analysis and lowest p-value logistic regression are applied to select optimal features which could be related to the CVD. Then, the selected features are used for training four ML algorithms: Naive Bayes (NB), Feed Forward Neural Networks (NN), Support Vector Machine (SVM) and Random Forest (RF). A binary feature was considered as output of the proposed models and the SMOTE sampling has been used for balancing the training set. Among the proposed methods, NN provided the best accuracy (0.81) and AUC (0.76) outperforming the results obtained in other studies.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Feature Selection in Pulmonary Function Test Data with Machine Learning Methods
    Karakis, Rukiye
    Guler, Inan
    Isik, Ali Hakan
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [32] Predicting creep life of CrMo pressure vessel steel using machine learning models with optimal feature subset selection
    Chai, Mengyu
    He, Yuhang
    Wang, Junjie
    Wu, Zichuan
    Lei, Boyu
    [J]. International Journal of Pressure Vessels and Piping, 2024, 212
  • [33] Machine Learning Feature Selection for Predicting High Concentration Therapeutic Antibody Aggregation
    Lai, Pin-Kuang
    Fernando, Amendra
    Cloutier, Theresa K.
    Kingsbury, Jonathan S.
    Gokarn, Yatin
    Halloran, Kevin T.
    Calero-Rubio, Cesar
    Trout, Bernhardt L.
    [J]. JOURNAL OF PHARMACEUTICAL SCIENCES, 2021, 110 (04) : 1583 - 1591
  • [34] Ensemble feature selection and classification methods for machine learning-based coronary artery disease diagnosis
    Kolukisa, Burak
    Bakir-Gungor, Burcu
    [J]. COMPUTER STANDARDS & INTERFACES, 2023, 84
  • [35] Machine Learning and Feature Selection Methods for Disease Classification With Application to Lung Cancer Screening Image Data
    Delzell, Darcie A. P.
    Magnuson, Sara
    Peter, Tabitha
    Smith, Michelle
    Smith, Brian J.
    [J]. FRONTIERS IN ONCOLOGY, 2019, 9
  • [36] Descriptor selection for predicting interfacial thermal resistance by machine learning methods
    Tian, Xiaojuan
    Chen, Mingguang
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [37] Descriptor selection for predicting interfacial thermal resistance by machine learning methods
    Xiaojuan Tian
    Mingguang Chen
    [J]. Scientific Reports, 11
  • [38] Improved Microarray Data Analysis using Feature Selection Methods with Machine Learning Methods
    Sun, Jing
    Passi, Kalpdrum
    Jain, Chakresh Kumar
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1527 - 1534
  • [39] Advanced Cloud-Based Prediction Models for Cardiovascular Disease: Integrating Machine Learning and Feature Selection Techniques
    B. Dhiyanesh
    S. Ganapathi Ammal
    K. Saranya
    K. E. Narayana
    [J]. SN Computer Science, 5 (5)
  • [40] Predicting Upper Body Power of Cross-country Skiers Using Machine Learning Methods Combined with Feature Selection
    Akgol, Derman
    Akay, M. Fatih
    [J]. 2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 148 - 151