Cost-Sensitive Learning based on Performance Metric for Imbalanced Data

被引:0
|
作者
Yuri Sousa Aurelio
Gustavo Matheus de Almeida
Cristiano Leite de Castro
Antonio Padua Braga
机构
[1] Federal University of Minas Gerais,
来源
Neural Processing Letters | 2022年 / 54卷
关键词
Classification; Imbalanced problem; Cost-sensitive function; Multi-Layer perceptron; Back-propagation; Confusion matrix;
D O I
暂无
中图分类号
学科分类号
摘要
Performance metrics are usually evaluated only after the neural network learning process using an error cost function. This procedure can result in suboptimal model selection, particularly for imbalanced classification problems. This work proposes the direct use of these metrics as cost functions, which are often derived from the confusion matrix. Commonly used metrics are covered, namely AUC, G-mean, F1-score and AG-mean. The only implementation change for model training occurs in the backpropagation error term. The results were compared to the standard MLP using the Rprop learning algorithm, SMOTE, SMTTL, WWE and RAMOBoost. Sixteen classical benchmark datasets were used in the experiments. Based on average ranks, the proposed formulation outperformed Rprop and all sampling strategies, namely SMOTE, SMTTL and WWE, for all metrics. These results were statistically confirmed for AUC and G-mean in relation to Rprop. For F1-score and AG-mean, all algorithms were considered statistically equivalent. The proposal was also superior to RAMOBoost for G-mean given average ranks. However, it was statistically faster than RAMOBoost for all metrics. It was also faster than SMTTL and statistically equivalent to Rprop, SMOTE and WWE. More, the solutions obtained are generally non-dominated ones compared to all other techniques, for all metrics. The results showed that the direct use of performance metrics as cost functions for neural network training favors generalization capacity and also computation time in imbalanced classification problems. Its extension to other performance metrics derived directly from the confusion matrix is straightforward.
引用
收藏
页码:3097 / 3114
页数:17
相关论文
共 50 条
  • [31] Using Cost-Sensitive Learning and Feature Selection Algorithms to Improve the Performance of Imbalanced Classification
    Feng, Fang
    Li, Kuan-Ching
    Shen, Jun
    Zhou, Qingguo
    Yang, Xuhui
    [J]. IEEE ACCESS, 2020, 8 : 69979 - 69996
  • [32] Cost-Sensitive Variational Autoencoding Classifier for Imbalanced Data Classification
    Liu, Fen
    Qian, Quan
    [J]. ALGORITHMS, 2022, 15 (05)
  • [33] A Statistical Approach to Cost-Sensitive AdaBoost for Imbalanced Data Classification
    Bei, Honghan
    Wang, Yajie
    Ren, Zhaonuo
    Jiang, Shuo
    Li, Keran
    Wang, Wenyang
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [34] Cost-sensitive Hybrid Neural Networks for Heterogeneous and Imbalanced Data
    Jiang, Xinxin
    Pan, Shirui
    Long, Guodong
    Chang, Jiang
    Jiang, Jing
    Zhang, Chengqi
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [35] Cost-sensitive design of quadratic discriminant analysis for imbalanced data
    Bejaoui, Amine
    Elkhalil, Khalil
    Kammoun, Abla
    Alouini, Mohamed-Slim
    Al-Naffouri, Tareq
    [J]. PATTERN RECOGNITION LETTERS, 2021, 149 : 24 - 29
  • [36] Applying Adaptive Over-sampling Technique Based on Data Density and Cost-Sensitive SVM to Imbalanced Learning
    Wang, Senzhang
    Li, Zhoujun
    Chao, Wenhan
    Cao, Qinghua
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [37] A Differential Evolution-Based Method for Class-Imbalanced Cost-Sensitive Learning
    Qiu, Chen
    Jiang, Liangxiao
    Kong, Ganggang
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [38] Cost-Sensitive Awareness-Based SAR Automatic Target Recognition for Imbalanced Data
    Cao, Changjie
    Cui, Zongyong
    Wang, Liying
    Wang, Jielei
    Cao, Zongjie
    Yang, Jianyu
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [39] An Effective Imbalanced JPEG Steganalysis Scheme Based on Adaptive Cost-Sensitive Feature Learning
    Jia, Ju
    Zhai, Liming
    Ren, Weixiang
    Wang, Lina
    Ren, Yanzhen
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1038 - 1052
  • [40] LW-ELM: A Fast and Flexible Cost-Sensitive Learning Framework for Classifying Imbalanced Data
    Yu, Hualong
    Sun, Changyin
    Yang, Xibei
    Zheng, Shang
    Wang, Qi
    Xi, Xiaoyan
    [J]. IEEE ACCESS, 2018, 6 : 28488 - 28500