HSDLM: A Hybrid Sampling With Deep Learning Method for Imbalanced Data Classification

被引:27
|
作者
Hasib, Khan Md [1 ]
Towhid, Nurul Akter [2 ]
Islam, Md Rafiqul [3 ]
机构
[1] Ahsanullah Univ Sci & Engn, Dhaka, Bangladesh
[2] Jahangirnagar Univ, Dhaka, Bangladesh
[3] Univ Technol Sydney UTS, Sydney, NSW, Australia
关键词
Class Imbalance; Classification; Deep Learning; ENN; LSTM; Sampling; SMOTE; SUPPORT; SMOTE;
D O I
10.4018/IJCAC.2021100101
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Imbalanced data presents many difficulties, as the majority of learners will be prejudice against the majority class, and in severe cases, may fully disregard the minority class. Over the last few decades, class inequality has been extensively researched using traditional machine learning techniques. However, there is relatively little analytical research in the field of deep learning with class inequality. In this article, the authors classify the imbalanced data with the combination of both sampling method and deep learning method. They propose a novel sampling-based deep learning method (HSDLM) to address the class imbalance problem. They preprocess the data with label encoding and remove the noisy data with the under-sampling technique edited nearest neighbor (ENN) algorithm. They also balance the data using the over-sampling technique SMOTE and apply parallelly three types of long short-term memory networks, which is a deep learning classifier. The experimental findings indicate that HSDLM is a promising and fruitful solution to working with strongly imbalanced datasets.
引用
下载
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] An Under-sampling Imbalanced Learning of Data Gravitation Based Classification
    Peng, Lizhi
    Yang, Bo
    Chen, Yuehui
    Zhou, Xiaoqing
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 419 - 425
  • [32] Entropy-based hybrid sampling ensemble learning for imbalanced data
    Dongdong, Li
    Ziqiu, Chi
    Bolu, Wang
    Zhe, Wang
    Hai, Yang
    Wenli, Du
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (07) : 3039 - 3067
  • [33] HSDP: A Hybrid Sampling Method for Imbalanced Big Data Based on Data Partition
    Chen, Liping
    Jiang, Jiabao
    Zhang, Yong
    COMPLEXITY, 2021, 2021
  • [34] An Effective Over-sampling Method for Imbalanced Data Sets Classification
    Zhai Yun
    Ma Nan
    Ruan Da
    An Bing
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 489 - 494
  • [35] Imbalanced Data Classification Based on a Hybrid Resampling SVM Method
    Cao, Lu
    Zhai, Yikui
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 1533 - 1536
  • [36] A Hybrid Under-Sampling Method (HUSBoost) to Classify Imbalanced Data
    Popel, Mahmudul Hasan
    Hasib, Khan Md
    Habib, Syed Ahsan
    Shah, Faisal Muhammad
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [37] Learning Deep Representation for Imbalanced Classification
    Huang, Chen
    Li, Yining
    Loy, Chen Change
    Tang, Xiaoou
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5375 - 5384
  • [38] Deep reinforcement learning for imbalanced classification
    Lin, Enlu
    Chen, Qiong
    Qi, Xiaoming
    APPLIED INTELLIGENCE, 2020, 50 (08) : 2488 - 2502
  • [39] Learning Deep Landmarks for Imbalanced Classification
    Bao, Feng
    Deng, Yue
    Kong, Youyong
    Ren, Zhiquan
    Suo, Jinli
    Dai, Qionghai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 2691 - 2704
  • [40] Deep reinforcement learning for imbalanced classification
    Enlu Lin
    Qiong Chen
    Xiaoming Qi
    Applied Intelligence, 2020, 50 : 2488 - 2502