DASMcC: Data Augmented SMOTE Multi-Class Classifier for Prediction of Cardiovascular Diseases Using Time Series Features

被引:1
|
作者
Sinha, Nidhi [1 ]
Kumar, M. A. Ganesh [1 ]
Joshi, Amit M. [1 ]
Cenkeramaddi, Linga Reddy [2 ]
机构
[1] Malaviya Natl Inst Technol, Dept Elect & Commun Engn, Jaipur 302017, India
[2] Univ Agder, Dept Informat & Commun Technol, N-4879 Grimstad, Norway
关键词
Cardiovascular disease (CVD); PTB-XL data; machine learning; smart healthcare; ECG; heart failure; XG boost (XGB); random forest (RF); cat boost; K nearest neighbor (KNN); gradient boost (GB); ARRHYTHMIA DETECTION; LEARNING FRAMEWORK; MACHINE;
D O I
10.1109/ACCESS.2023.3325705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the leading causes of mortality worldwide is cardiovascular disease (CVD). Electrocardiography (ECG) is a noninvasive and cost-effective tool to diagnose the heart's health. This study presents a multi-class classifier for the prediction of four different types of Cardiovascular Diseases, i.e., Myocardial Infarction, Hypertrophy, Conduction Disturbances, and ST-T abnormality using 12-lead ECG. There are four key steps involved in the presented work: data preprocessing, feature extraction, data preparation, and augmentation, and modelling for multi-class CVD classification. The sixteen-time domain augmented features are used to train the classifier. The work is divided into three parts: extracting the features from raw 12-lead ECG signals, data preparation and augmentation, and training, testing, and validating the classifier. A comparative study of the performance of five different classifiers (i.e., Random Forest (RF), K Nearest Neighbors (KNN), Gradient Boost, Adda Boost, and XG Boost has also been presented. Accuracy, precision, recall, and F1 scores are used for performance evaluation. Further, the Receiver Operating Curve (ROC) is traced, and the Area Under the Curve (AUC) is calculated to ensure the unbiased performance of the classifier. The application of the proposed classifier in the Smart Healthcare framework has also been discussed.
引用
收藏
页码:117643 / 117655
页数:13
相关论文
共 47 条
  • [21] Evaluating the Performance of Multi-Class and Single-Class Classification Approaches for Mountain Agriculture Extraction Using Time-Series NDVI
    Mondal, Saptarshi
    Jeganathan, Chockalingam
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2018, 46 (12) : 2045 - 2055
  • [22] Evaluating the Performance of Multi-Class and Single-Class Classification Approaches for Mountain Agriculture Extraction Using Time-Series NDVI
    Saptarshi Mondal
    Chockalingam Jeganathan
    Journal of the Indian Society of Remote Sensing, 2018, 46 : 2045 - 2055
  • [23] RETRACTED: Classification of Time Series Data by One Class Classifier using DTW-D (Retracted Article)
    Vasimalla, Kumar
    Narasimham, C.
    Sujith, B.
    ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 343 - 352
  • [24] Multi-label Prediction in Time Series Data using Deep Neural Networks
    Zhang, Wenyu
    Jha, Devesh K.
    Laftchiev, Emil
    Nikovski, Daniel
    INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT, 2019, 10
  • [25] Incomplete Time Series Prediction Using Max-Margin Classification of Data with Absent Features
    Shang Zhaowei
    Zhang Lingfeng
    Ma Shangjun
    Fang Bin
    Zhang Taiping
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2010, 2010
  • [26] Data-driven rated power prediction of diesel engines using improved multi-class imbalanced learning method
    Guo, Liangxun
    Zhuang, Zilong
    Sun, Yanning
    Qin, Wei
    30TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM2021), 2020, 51 : 324 - 329
  • [27] Occupancy prediction: A comparative study of static and MOTIF time series features using WiFi Syslog data
    Abdelghani, Bassam A.
    Al Mohammad, Ahlam
    Dari, Jamal
    Maleki, Mina
    Banitaan, Shadi
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2024, 44
  • [28] Comparison of Two Output-Coding Strategies for Multi-Class Tumor Classification Using Gene Expression Data and Latent Variable Model as Binary Classifier
    Joseph, Sandeep J.
    Robbins, Kelly R.
    Zhang, Wensheng
    Rekaya, Romdhane
    CANCER INFORMATICS, 2010, 9 : 39 - 48
  • [29] Time-Aware Multi-Type Data Fusion Representation Learning Framework for Risk Prediction of Cardiovascular Diseases
    An, Ying
    Tang, Kun
    Wang, Jianxin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3725 - 3734
  • [30] Multi-step ahead prediction of taxi demand using time-series and textual data
    Markou, Ioulia
    Rodrigues, Filipe
    Pereira, Francisco C.
    URBAN MOBILITY - SHAPING THE FUTURE TOGETHER, 2019, 41 : 540 - 544