Classification of Imbalanced Data Using Deep Learning with Adding Noise

被引:3
|
作者
Fan, Wan-Wei [1 ]
Lee, Ching-Hung [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu, Taiwan
关键词
SURFACE DEFECT DETECTION;
D O I
10.1155/2021/1735386
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a method to treat the classification of imbalanced data by adding noise to the feature space of convolutional neural network (CNN) without changing a data set (ratio of majority and minority data). Besides, a hybrid loss function of crossentropy and KL divergence is proposed. The proposed approach can improve the accuracy of minority class in the testing data. In addition, a simple design method for selecting structure of CNN is first introduced and then, we add noise in feature space of CNN to obtain proper features by a training process and to improve the classification results. From comparison results, we can find that the proposed method can extract the suitable features to improve the accuracy of minority class. Finally, illustrated examples of multiclass classification problems and the corresponding discussion in balance ratio are presented. Our approach performs well with smaller network structure compared with other deep models. In addition, the performance is improved over 40% in defective accuracy by adding noise approach. Finally, the accuracy is higher than 96%; even the imbalanced ratio (IR) is one hundred.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Imbalanced classification by learning hidden data structure
    Zhao, Yang
    Shrivastava, Abhishek K.
    Tsui, Kwok Leung
    IIE TRANSACTIONS, 2016, 48 (07) : 614 - 628
  • [22] Stroke classification based on deep reinforcement learning over stroke screening imbalanced data
    Zuo, Ting
    Li, Fenglian
    Zhang, Xueying
    Hu, Fengyun
    Huang, Lixia
    Jia, Wenhui
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 114
  • [23] Dynamic Curriculum Learning for Imbalanced Data Classification
    Wang, Yiru
    Gan, Weihao
    Yang, Jie
    Wu, Wei
    Yan, Junjie
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5016 - 5025
  • [24] Kinship recognition from faces using deep learning with imbalanced data
    Alice Othmani
    Duqing Han
    Xin Gao
    Runpeng Ye
    Abdenour Hadid
    Multimedia Tools and Applications, 2023, 82 : 15859 - 15874
  • [25] An Improved Ensemble Learning for Imbalanced Data Classification
    Yuan, Zhengwu
    Zhao, Pu
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 408 - 411
  • [26] Data Classification with Deep Learning using Tensorflow
    Ertam, Fatih
    Aydin, Galip
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 755 - 758
  • [27] Deep Learning with MCA-based Instance Selection and Bootstrapping for Imbalanced Data Classification
    Guan, Sheng
    Chen, Min
    Ha, Hsin-Yu
    Chen, Shu-Ching
    Shyu, Mei-Ling
    Zhang, Chengde
    2015 IEEE CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC), 2015, : 288 - 295
  • [28] Hygeia: A Multilabel Deep Learning-Based Classification Method for Imbalanced Electrocardiogram Data
    Xu, Xiaolong
    Xu, Haoyan
    Wang, Liying
    Zhang, Yuanyuan
    Xaio, Fu
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (04) : 2480 - 2493
  • [29] Deep active learning models for imbalanced image classification
    Jin, Qiuye
    Yuan, Mingzhi
    Wang, Haoran
    Wang, Manning
    Song, Zhijian
    KNOWLEDGE-BASED SYSTEMS, 2022, 257
  • [30] Deep Learning Applied to Imbalanced Malware Datasets Classification
    Salas, Marcelo Palma
    de Geus, Paulo Licio
    JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2024, 15 (01) : 342 - 359