DASMcC: Data Augmented SMOTE Multi-Class Classifier for Prediction of Cardiovascular Diseases Using Time Series Features

被引:1
|
作者
Sinha, Nidhi [1 ]
Kumar, M. A. Ganesh [1 ]
Joshi, Amit M. [1 ]
Cenkeramaddi, Linga Reddy [2 ]
机构
[1] Malaviya Natl Inst Technol, Dept Elect & Commun Engn, Jaipur 302017, India
[2] Univ Agder, Dept Informat & Commun Technol, N-4879 Grimstad, Norway
关键词
Cardiovascular disease (CVD); PTB-XL data; machine learning; smart healthcare; ECG; heart failure; XG boost (XGB); random forest (RF); cat boost; K nearest neighbor (KNN); gradient boost (GB); ARRHYTHMIA DETECTION; LEARNING FRAMEWORK; MACHINE;
D O I
10.1109/ACCESS.2023.3325705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the leading causes of mortality worldwide is cardiovascular disease (CVD). Electrocardiography (ECG) is a noninvasive and cost-effective tool to diagnose the heart's health. This study presents a multi-class classifier for the prediction of four different types of Cardiovascular Diseases, i.e., Myocardial Infarction, Hypertrophy, Conduction Disturbances, and ST-T abnormality using 12-lead ECG. There are four key steps involved in the presented work: data preprocessing, feature extraction, data preparation, and augmentation, and modelling for multi-class CVD classification. The sixteen-time domain augmented features are used to train the classifier. The work is divided into three parts: extracting the features from raw 12-lead ECG signals, data preparation and augmentation, and training, testing, and validating the classifier. A comparative study of the performance of five different classifiers (i.e., Random Forest (RF), K Nearest Neighbors (KNN), Gradient Boost, Adda Boost, and XG Boost has also been presented. Accuracy, precision, recall, and F1 scores are used for performance evaluation. Further, the Receiver Operating Curve (ROC) is traced, and the Area Under the Curve (AUC) is calculated to ensure the unbiased performance of the classifier. The application of the proposed classifier in the Smart Healthcare framework has also been discussed.
引用
收藏
页码:117643 / 117655
页数:13
相关论文
共 47 条
  • [41] Fusion of multivariate time series meteorological and static soil data for multistage crop yield prediction using multi-head self attention network
    Kaur, Arshveer
    Goyal, Poonam
    Rajhans, Rohit
    Agarwal, Lakshya
    Goyal, Navneet
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 226
  • [42] Multi-step-ahead solar output time series prediction with gate recurrent unit neural network using data decomposition and cooperation search algorithm
    Feng, Zhong-kai
    Huang, Qing-qing
    Niu, Wen-jing
    Yang, Tao
    Wang, Jia-yang
    Wen, Shi-ping
    ENERGY, 2022, 261
  • [43] Ambient nitrogen dioxide and cardiovascular diseases in rural regions: a time-series analyses using data from the new rural cooperative medical scheme in Fuyang, East China
    Dong, Teng-Fei
    Zha, Zhen-Qiu
    Sun, Liang
    Liu, Ling-Li
    Li, Xing-Yang
    Wang, Yuan
    Meng, Xiang-Long
    Li, Huai-Biao
    Wang, Hong-Li
    Nie, Huan-Huan
    Yang, Lin-Sheng
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (18) : 51412 - 51421
  • [44] Ambient nitrogen dioxide and cardiovascular diseases in rural regions: a time-series analyses using data from the new rural cooperative medical scheme in Fuyang, East China
    Teng-Fei Dong
    Zhen-Qiu Zha
    Liang Sun
    Ling-Li Liu
    Xing-Yang Li
    Yuan Wang
    Xiang-Long Meng
    Huai-Biao Li
    Hong-Li Wang
    Huan-Huan Nie
    Lin-Sheng Yang
    Environmental Science and Pollution Research, 2023, 30 : 51412 - 51421
  • [45] Ambient nitrogen dioxide and cardiovascular diseases in rural regions: a time-series analyses using data from the new rural cooperative medical scheme in Fuyang, East China
    Dong, Teng-Fei
    Zha, Zhen-Qiu
    Sun, Liang
    Liu, Ling-Li
    Li, Xing-Yang
    Wang, Yuan
    Meng, Xiang-Long
    Li, Huai-Biao
    Wang, Hong-Li
    Nie, Huan-Huan
    Yang, Lin-Sheng
    Environmental Science and Pollution Research, 2023, 30 (18): : 51412 - 51421
  • [46] Mapping cropland extent of Southeast and Northeast Asia using multi-year time-series Landsat 30-m data using a random forest classifier on the Google Earth Engine Cloud
    Oliphant, Adam J.
    Thenkabail, Prasad S.
    Teluguntla, Pardhasaradhi
    Xiong, Jun
    Gumma, Murali Krishna
    Congalton, Russell G.
    Yadav, Kamini
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2019, 81 : 110 - 124
  • [47] Deep convolutional neural networks for multi-scale time-series classification and application to tokamak disruption prediction using raw, high temporal resolution diagnostic data
    Churchill, R. M.
    Tobias, B.
    Zhu, Y.
    PHYSICS OF PLASMAS, 2020, 27 (06)