Towards Effective Classification of Imbalanced Data with Convolutional Neural Networks

被引:33
|
作者
Raj, Vidwath [1 ]
Magg, Sven [1 ]
Wermter, Stefan [1 ]
机构
[1] Univ Hamburg, Dept Informat, Knowledge Technol, Hamburg, Germany
关键词
D O I
10.1007/978-3-319-46182-3_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class imbalance in machine learning is a problem often found with real-world data, where data from one class clearly dominates the dataset. Most neural network classifiers fail to learn to classify such datasets correctly if class-to-class separability is poor due to a strong bias towards the majority class. In this paper we present an algorithmic solution, integrating different methods into a novel approach using a class-to-class separability score, to increase performance on poorly separable, imbalanced datasets using Cost Sensitive Neural Networks. We compare different cost functions and methods that can be used for training Convolutional Neural Networks on a highly imbalanced dataset of multi-channel time series data. Results show that, despite being imbalanced and poorly separable, performance metrics such as G-Mean as high as 92.8% could be reached by using cost sensitive Convolutional Neural Networks to detect patterns and correctly classify time series from 3 different datasets.
引用
收藏
页码:150 / 162
页数:13
相关论文
共 50 条
  • [1] Dynamic Sampling in Convolutional Neural Networks for Imbalanced Data Classification
    Pouyanfar, Samira
    Tao, Yudong
    Mohan, Anup
    Tian, Haiman
    Kaseb, Ahmed S.
    Gauen, Kent
    Dailey, Ryan
    Aghajanzadeh, Sarah
    Lu, Yung-Hsiang
    Chen, Shu-Ching
    Shyu, Mei-Ling
    [J]. IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 112 - 117
  • [2] Adversarial oversampling for multi-class imbalanced data classification with convolutional neural networks
    Wojciechowski, Adam
    Lango, Mateusz
    [J]. FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183, 2022, 183 : 98 - 111
  • [3] On the Impact of Imbalanced Data in Convolutional Neural Networks Performance
    Pulgar, Francisco J.
    Rivera, Antonio J.
    Charte, Francisco
    del Jesus, Maria J.
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 220 - 232
  • [4] Classification of Imbalanced Electrocardiosignal Data using Convolutional Neural Network
    Du, Chaofan
    Liu, Peter Xiaoping
    Zheng, Minhua
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 214
  • [5] Sparse Matrix Classification on Imbalanced Datasets Using Convolutional Neural Networks
    Pichel, Juan C.
    Pateiro-Lopez, Beatriz
    [J]. IEEE ACCESS, 2019, 7 : 82377 - 82389
  • [6] Convolutional Neural Network for Imbalanced Data Classification of Silicon Wafer Defects
    Batool, Uzma
    Shapiai, Mohd Ibrahim
    Fauzi, Hilman
    Fong, Jia Xian
    [J]. 2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), 2020, : 230 - 235
  • [7] Data Balanced Bagging Ensemble of Convolutional-LSTM Neural Networks for Time Series Data Classification with an Imbalanced Dataset
    Ward, Matthew
    Malmsten, Kevin
    Salamy, Hassan
    Min, Cheol-Hong
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [8] Cost-sensitive convolutional neural networks for imbalanced time series classification
    Geng, Yue
    Luo, Xinyu
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (02) : 357 - 370
  • [9] Evolving Neural Networks with Maximum AUC for Imbalanced Data Classification
    Lu, Xiaofen
    Tang, Ke
    Yao, Xin
    [J]. HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 1, 2010, 6076 : 335 - 342
  • [10] Classification of imbalanced remote-sensing data by neural networks
    Bruzzone, L
    Serpico, SB
    [J]. PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1323 - 1328