Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction

被引:17
|
作者
Dritsas, Elias [1 ]
Trigka, Maria [1 ]
机构
[1] Univ Patras, Dept Comp Engn & Informat, Patras 26504, Greece
关键词
healthcare; cardiovascular diseases; prediction; machine learning; data analysis; BLOOD-PRESSURE; ALGORITHMS; TOOLS;
D O I
10.3390/s23031161
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Cardiovascular diseases (CVDs) are now the leading cause of death, as the quality of life and human habits have changed significantly. CVDs are accompanied by various complications, including all pathological changes involving the heart and/or blood vessels. The list of pathological changes includes hypertension, coronary heart disease, heart failure, angina, myocardial infarction and stroke. Hence, prevention and early diagnosis could limit the onset or progression of the disease. Nowadays, machine learning (ML) techniques have gained a significant role in disease prediction and are an essential tool in medicine. In this study, a supervised ML-based methodology is presented through which we aim to design efficient prediction models for CVD manifestation, highlighting the SMOTE technique's superiority. Detailed analysis and understanding of risk factors are shown to explore their importance and contribution to CVD prediction. These factors are fed as input features to a plethora of ML models, which are trained and tested to identify the most appropriate for our objective under a binary classification problem with a uniform class probability distribution. Various ML models were evaluated after the use or non-use of Synthetic Minority Oversampling Technique (SMOTE), and comparing them in terms of Accuracy, Recall, Precision and an Area Under the Curve (AUC). The experiment results showed that the Stacking ensemble model after SMOTE with 10-fold cross-validation prevailed over the other ones achieving an Accuracy of 87.8%, Recall of 88.3%, Precision of 88% and an AUC equal to 98.2%.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Efficient data-driven machine learning models for scour depth predictions at sloping sea defences
    Habib, M. A.
    Abolfathi, S.
    O'Sullivan, John. J.
    Salauddin, M.
    FRONTIERS IN BUILT ENVIRONMENT, 2024, 10
  • [22] A data-driven approach to predicting diabetes and cardiovascular disease with machine learning
    Dinh, An
    Miertschin, Stacey
    Young, Amber
    Mohanty, Somya D.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [23] A data-driven approach to predicting diabetes and cardiovascular disease with machine learning
    An Dinh
    Stacey Miertschin
    Amber Young
    Somya D. Mohanty
    BMC Medical Informatics and Decision Making, 19
  • [24] DATA-DRIVEN CHIMNEY FIRE RISK PREDICTION USING MACHINE LEARNING AND POINT PROCESS TOOLS
    Lu, Changqin
    Van Lieshout, Marie-colette
    De Graaf, Maurits
    Visscher, Paul
    ANNALS OF APPLIED STATISTICS, 2023, 17 (04): : 3088 - 3111
  • [25] A Data-Driven Approach for Building a Cardiovascular Disease Risk Prediction System
    Wang, Hongkuan
    Wong, Raymond K.
    Ong, Kwok Leung
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT IV, PAKDD 2024, 2024, 14648 : 271 - 283
  • [26] Dirty engineering data-driven inverse prediction machine learning model
    Jin-Woong Lee
    Woon Bae Park
    Byung Do Lee
    Seonghwan Kim
    Nam Hoon Goo
    Kee-Sun Sohn
    Scientific Reports, 10
  • [27] Dirty engineering data-driven inverse prediction machine learning model
    Lee, Jin-Woong
    Park, Woon Bae
    Lee, Byung Do
    Kim, Seonghwan
    Goo, Nam Hoon
    Sohn, Kee-Sun
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [28] A Data-Driven Approach for Efficient Prediction of Permeability of Porous Rocks by Combining Multiscale Imaging and Machine Learning
    Iman Nabipour
    Maysam Mohammadzadeh-Shirazi
    Amir Raoof
    Jafar Qajar
    Transport in Porous Media, 2025, 152 (4)
  • [29] The Prediction of Flight Delay: Big Data-driven Machine Learning Approach
    Huo, Jiage
    Keung, K. L.
    Lee, C. K. M.
    Ng, Kam K. H.
    Li, K. C.
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 190 - 194
  • [30] Novel Data-Driven Machine Learning Models for Heating Load Prediction: Single and Optimized Naive Bayes
    Li, Fangyuan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 657 - 668