A Wasserstein Distance-Based Cost-Sensitive Framework for Imbalanced Data Classification

被引:0
|
作者
Feng, Rui [1 ]
Ji, Hongbing [1 ]
Zhu, Zhigang [1 ]
Wang, Lei [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced classification; cost-sensitive; structural information; Wasserstein distance; radar emitter signal; SUPPORT VECTOR MACHINE; DECISION TREE; SYSTEMS;
D O I
10.13164/re.2023.0451
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Class imbalance is a prevalent problem in many real-world applications, and imbalanced data distribution can dramatically skew the performance of classifiers. In general, the higher the imbalance ratio of a dataset, the more difficult it is to classify. However, it is found that standard classifiers can still achieve good classification results on some highly imbalanced datasets. Obviously, the class imbalance is only a superficial characteristic of the data, and the underlying structural information is often the key factor affecting the classification performance. As implicit prior knowledge, structural information has been validated to be crucial for designing a good classifier. This paper proposes a Wasserstein-based cost-sensitive support vector machine (CS-WSVM) for class imbalance learning, incorporating prior structural information and a costsensitive strategy. The Wasserstein distance is introduced to model the distribution of majority and minority samples to capture the structural information, which is employed to weight the majority and minority samples. Comprehensive experiments on synthetic and real-world datasets, especially on the radar emitter signal dataset, demonstrated that CS-WSVM can achieve outstanding performance in imbalanced scenarios.
引用
收藏
页码:451 / 466
页数:16
相关论文
共 50 条
  • [41] Cost-sensitive design of quadratic discriminant analysis for imbalanced data
    Bejaoui, Amine
    Elkhalil, Khalil
    Kammoun, Abla
    Alouini, Mohamed-Slim
    Al-Naffouri, Tareq
    [J]. PATTERN RECOGNITION LETTERS, 2021, 149 : 24 - 29
  • [42] Cost-sensitive convolutional neural networks for imbalanced time series classification
    Geng, Yue
    Luo, Xinyu
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (02) : 357 - 370
  • [43] Cost-sensitive classification with inadequate labeled data
    Wang, Tao
    Qin, Zhenxing
    Zhang, Shichao
    Zhang, Chengqi
    [J]. INFORMATION SYSTEMS, 2012, 37 (05) : 508 - 516
  • [44] Cost-Sensitive Latent Space Learning for Imbalanced PolSAR Image Classification
    Wu, Qian
    Hou, Biao
    Wen, Zaidao
    Ren, Zhongle
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 4802 - 4817
  • [45] Cost-Sensitive Awareness-Based SAR Automatic Target Recognition for Imbalanced Data
    Cao, Changjie
    Cui, Zongyong
    Wang, Liying
    Wang, Jielei
    Cao, Zongjie
    Yang, Jianyu
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [47] Classifying imbalanced data in distance-based feature space
    Shin Ando
    [J]. Knowledge and Information Systems, 2016, 46 : 707 - 730
  • [48] Distance-based arranging oversampling technique for imbalanced data
    Qi Dai
    Jian-wei Liu
    Jia-Liang Zhao
    [J]. Neural Computing and Applications, 2023, 35 : 1323 - 1342
  • [49] Distance-based arranging oversampling technique for imbalanced data
    Dai, Qi
    Liu, Jian-wei
    Zhao, Jia-Liang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1323 - 1342
  • [50] Evolutionary Mahalanobis Distance-Based Oversampling for Multi-Class Imbalanced Data Classification
    Yao, Leehter
    Lin, Tung-Bin
    [J]. SENSORS, 2021, 21 (19)