Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction

被引:17
|
作者
Dritsas, Elias [1 ]
Trigka, Maria [1 ]
机构
[1] Univ Patras, Dept Comp Engn & Informat, Patras 26504, Greece
关键词
healthcare; cardiovascular diseases; prediction; machine learning; data analysis; BLOOD-PRESSURE; ALGORITHMS; TOOLS;
D O I
10.3390/s23031161
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Cardiovascular diseases (CVDs) are now the leading cause of death, as the quality of life and human habits have changed significantly. CVDs are accompanied by various complications, including all pathological changes involving the heart and/or blood vessels. The list of pathological changes includes hypertension, coronary heart disease, heart failure, angina, myocardial infarction and stroke. Hence, prevention and early diagnosis could limit the onset or progression of the disease. Nowadays, machine learning (ML) techniques have gained a significant role in disease prediction and are an essential tool in medicine. In this study, a supervised ML-based methodology is presented through which we aim to design efficient prediction models for CVD manifestation, highlighting the SMOTE technique's superiority. Detailed analysis and understanding of risk factors are shown to explore their importance and contribution to CVD prediction. These factors are fed as input features to a plethora of ML models, which are trained and tested to identify the most appropriate for our objective under a binary classification problem with a uniform class probability distribution. Various ML models were evaluated after the use or non-use of Synthetic Minority Oversampling Technique (SMOTE), and comparing them in terms of Accuracy, Recall, Precision and an Area Under the Curve (AUC). The experiment results showed that the Stacking ensemble model after SMOTE with 10-fold cross-validation prevailed over the other ones achieving an Accuracy of 87.8%, Recall of 88.3%, Precision of 88% and an AUC equal to 98.2%.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Data-driven quality prediction in injection molding: An autoencoder and machine learning approach
    Ke, Kun-Cheng
    Wang, Jui-Chih
    Nian, Shih-Chih
    POLYMER ENGINEERING AND SCIENCE, 2024, 64 (09): : 4520 - 4538
  • [42] Multimodal data-driven machine learning for the prediction of surface topography in end milling
    L. Hu
    H. Phan
    S. Srinivasan
    C. Cooper
    J. Zhang
    B. Yuan
    R. Gao
    Y. B. Guo
    Production Engineering, 2024, 18 : 507 - 523
  • [43] Data-driven prediction of compressive strength of FRP-confined concrete members: An application of machine learning models
    Berradia, Mohammed
    Azab, Marc
    Ahmad, Zeeshan
    Accouche, Oussama
    Raza, Ali
    Alashker, Yasser
    STRUCTURAL ENGINEERING AND MECHANICS, 2022, 83 (04) : 515 - 535
  • [44] Data-Driven Traffic Accident Analysis and Prediction Using Machine Learning Models: A Case Study of Philadelphia City
    Lyu, Chengxuan
    SEVENTH INTERNATIONAL CONFERENCE ON TRAFFIC ENGINEERING AND TRANSPORTATION SYSTEM, ICTETS 2023, 2024, 13064
  • [45] Multimodal data-driven machine learning for the prediction of surface topography in end milling
    Hu, L.
    Phan, H.
    Srinivasan, S.
    Cooper, C.
    Zhang, J.
    Yuan, B.
    Gao, R.
    Guo, Y. B.
    PRODUCTION ENGINEERING-RESEARCH AND DEVELOPMENT, 2024, 18 (3-4): : 507 - 523
  • [46] Data-driven prediction of ship fuel oil consumption based on machine learning models considering meteorological factors
    Yang, Huirong
    Sun, Zhuo
    Han, Peixiu
    Ma, Mengjie
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART M-JOURNAL OF ENGINEERING FOR THE MARITIME ENVIRONMENT, 2024, 238 (03) : 483 - 502
  • [47] Efficient data-driven models for prediction and optimization of geothermal power plant operations
    Ling, Wei
    Liu, Yingxiang
    Young, Robert
    Cladouhos, Trenton T.
    Jafarpour, Behnam
    GEOTHERMICS, 2024, 119
  • [48] Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile
    Wiens, Jenna
    Campbell, Wayne N.
    Franklin, Ella S.
    Guttag, John V.
    Horvitz, Eric
    OPEN FORUM INFECTIOUS DISEASES, 2014, 1 (02):
  • [49] Estimation of data-driven streamflow predicting models using machine learning methods
    Siddiqi T.A.
    Ashraf S.
    Khan S.A.
    Iqbal M.J.
    Arabian Journal of Geosciences, 2021, 14 (11)
  • [50] Test Data-Driven Machine Learning Models for Reliable Quantum Circuit Output
    Saravanan, Vedika
    Saeed, Samah Mohamed
    2021 IEEE EUROPEAN TEST SYMPOSIUM (ETS 2021), 2021,