A robust loss function for classification with imbalanced datasets

被引:9
|
作者
Wang, Yidan [1 ]
Yang, Liming [1 ]
机构
[1] China Agr Univ, Coll Sci, Beijing 100083, Haidian, Peoples R China
基金
中国国家自然科学基金;
关键词
Robustness; Imbalanced classification; Bayes optimal classifier; Fisher consistency; Half quadratic optimization; LEARNING-MACHINE; COST;
D O I
10.1016/j.neucom.2018.11.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Based on minimizing misclassification cost, a new robust loss function is designed in this paper to deal with the imbalanced classification problem under noise environment. It is nonconvex but maintains Fisher consistency. Applying the proposed loss function into support vector machine (SVM), a robust SVM framework is presented which results in a Bayes optimal classifier. However, nonconvexity makes the model difficult to optimize. We develop an alternative iterative algorithm to solve the proposed model. What's more, we analyze the robustness of the proposed model theoretically from a re-weighted SVM viewpoint and the obtained optimal solution is consistent with Bayesian optimal decision rule. Further-more, numerical experiments are carried out on databases that are drawn from UCI Machine Learning Repository and a practical application. With two different types of noise environments, one with label noise and one with feature noise, experiment results show that on these two databases the proposed method achieves better generalization results compared to other SVM methods. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 49
页数:10
相关论文
共 50 条
  • [1] A New Loss Function for Traffic Classification Task on Dramatic Imbalanced Datasets
    Xu, Luyang
    Zhou, Xu
    Lin, Xifeng
    Ren, Yongmao
    Qin, Yifang
    Liu, Jun
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [2] Robust loss function for class imbalanced semantic segmentation and image classification
    Bhat, S. Divakar
    Amit, More
    Soni, Mudit
    Yasui, Yuji
    IFAC PAPERSONLINE, 2023, 56 (02): : 7934 - 7939
  • [3] A JOINT SEMANTIC SEGMENTATION LOSS FUNCTION FOR IMBALANCED DATASETS
    Ismael, Sarmad F.
    Aptoula, Erchan
    Kayabol, Koray
    2022 IEEE MEDITERRANEAN AND MIDDLE-EAST GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (M2GARSS), 2022, : 13 - 16
  • [4] A Joint Semantic Segmentation Loss Function for Imbalanced Datasets
    Ismael, Sarmad F.
    Aptoula, Erchan
    Kayabol, Koray
    2022 IEEE Mediterranean and Middle-East Geoscience and Remote Sensing Symposium, M2GARSS 2022 - Proceedings, 2022, : 13 - 16
  • [5] To improve classification of imbalanced datasets
    Shukla, Pratyusha
    Bhowmick, Kiran
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [6] A Generative Neighborhood-based Deep Autoencoder with an Extended Loss Function for Robust Imbalanced Classification
    Troullinou, E.
    Tsagkatakis, G.
    Losonczy, A.
    Poirazi, P.
    Tsakalides, P.
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 1015 - 1019
  • [7] A Nonuniform Weighted Loss Function for Imbalanced Image Classification
    Qiu, Qiang
    Song, Zichen
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS PROCESSING (ICIGP 2018), 2018, : 78 - 82
  • [8] Classification of Antimicrobial Peptides with Imbalanced Datasets
    Camacho, Francy L.
    Torres, Rodrigo
    Ramos Pollan, Raul
    11TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2015, 9681
  • [9] Discrimination Aware Classification for Imbalanced Datasets
    Ristanoski, Goce
    Liu, Wei
    Bailey, James
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1529 - 1532
  • [10] Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets
    Abousaber, Inam
    Abdallah, Haitham F.
    El-Ghaish, Hany
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2025, 7