A Cost-Sensitive Deep Belief Network for Imbalanced Classification

被引:185
|
作者
Zhang, Chong [1 ]
Tan, Kay Chen [2 ]
Li, Haizhou [1 ]
Hong, Geok Soon [3 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117583, Singapore
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Natl Univ Singapore, Dept Mech Engn, Singapore 117583, Singapore
关键词
Cost sensitive; deep belief network; evolutionary algorithm (EA); imbalanced classification; NEURAL-NETWORKS; SAMPLING APPROACH; ENSEMBLE; MACHINE; PERFORMANCE; ALGORITHMS; SMOTE; TOOL;
D O I
10.1109/TNNLS.2018.2832648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data with a skewed class distribution are common in many real-world applications. Deep Belief Network (DBN) is a machine learning technique that is effective in classification tasks. However, conventional DBN does not work well for imbalanced data classification because it assumes equal costs for each class. To deal with this problem, cost-sensitive approaches assign different misclassification costs for different classes without disrupting the true data sample distributions. However, due to lack of prior knowledge, the misclassification costs are usually unknown and hard to choose in practice. Moreover, it has not been well studied as to how cost-sensitive learning could improve DBN performance on imbalanced data problems. This paper proposes an evolutionary cost-sensitive deep belief network (ECS-DBN) for imbalanced classification. ECS-DBN uses adaptive differential evolution to optimize the misclassification costs based on the training data that presents an effective approach to incorporating the evaluation measure (i. e., G-mean) into the objective function. We first optimize the misclassification costs, and then apply them to DBN. Adaptive differential evolution optimization is implemented as the optimization algorithm that automatically updates its corresponding parameters without the need of prior domain knowledge. The experiments have shown that the proposed approach consistently outperforms the state of the art on both benchmark data sets and real-world data set for fault diagnosis in tool condition monitoring.
引用
收藏
页码:109 / 122
页数:14
相关论文
共 50 条
  • [41] Cost-sensitive KNN classification
    Zhang, Shichao
    [J]. NEUROCOMPUTING, 2020, 391 : 234 - 242
  • [42] Adversarial Cost-Sensitive Classification
    Asif, Kaiser
    Xing, Wei
    Behpour, Sima
    Ziebart, Brian D.
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 92 - 101
  • [43] Traffic Signal Classification with Cost-Sensitive Deep Learning Models
    Tsoi, Tsz Shing
    Wheelus, Charles
    [J]. 11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 586 - 592
  • [44] Cost-sensitive Texture Classification
    Schaefer, Gerald
    Krawczyk, Bartosz
    Doshi, Niraj P.
    Nakashima, Tomoharu
    [J]. 2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 105 - 108
  • [45] Cost-Sensitive Online Classification
    Wang, Jialei
    Zhao, Peilin
    Hoi, Steven C. H.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (10) : 2425 - 2438
  • [46] Cost-Sensitive Online Classification
    Wang, Jialei
    Zhao, Peilin
    Hoi, Steven C. H.
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 1140 - 1145
  • [47] Improved cost-sensitive representation of data for solving the imbalanced big data classification problem
    Fattahi, Mahboubeh
    Moattar, Mohammad Hossein
    Forghani, Yahya
    [J]. JOURNAL OF BIG DATA, 2022, 9 (01)
  • [48] Using Cost-Sensitive Learning and Feature Selection Algorithms to Improve the Performance of Imbalanced Classification
    Feng, Fang
    Li, Kuan-Ching
    Shen, Jun
    Zhou, Qingguo
    Yang, Xuhui
    [J]. IEEE ACCESS, 2020, 8 : 69979 - 69996
  • [49] Imbalanced classification of manufacturing quality conditions using cost-sensitive decision tree ensembles
    Kim, Aekyung
    Oh, Kyuhyup
    Jung, Jae-Yoon
    Kim, Bohyun
    [J]. INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2018, 31 (08) : 701 - 717
  • [50] MULTICLASS CLASSIFICATION WITH IMBALANCED DATASETS FOR CAR OWNERSHIP DEMAND MODEL - COST-SENSITIVE LEARNING
    Kaewwichian, Patiphan
    [J]. PROMET-TRAFFIC & TRANSPORTATION, 2021, 33 (03): : 361 - 371