OWAdapt: An adaptive loss function for deep learning using OWA operators

被引:4
|
作者
Maldonado, Sebastian [1 ,4 ]
Vairetti, Carla [2 ,4 ]
Jara, Katherine [2 ]
Carrasco, Miguel [2 ]
Lopez, Julio [3 ]
机构
[1] Univ Chile, Sch Econ & Business, Dept Management Control & Informat Syst, Santiago, Chile
[2] Univ Los Andes, Fac Ingn & Ciencias Aplicadas, Santiago, Chile
[3] Univ Diego Portales, Fac Ingn & Ciencias, Ejercito 441, Santiago, Chile
[4] Inst Sistemas Complejos Ingn ISCI, Santiago, Chile
关键词
OWA operators; Loss functions; Class-imbalance classification; Deep learning; SUPPORT VECTOR MACHINES; SMOTE;
D O I
10.1016/j.knosys.2023.111022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel adaptive loss function for enhancing deep learning performance in classification tasks. Specifically, we redefine the cross-entropy loss to effectively address class-level noise conditions, including the challenging problem of class imbalance. Our approach introduces aggregation operators to improve classification accuracy. The rationale behind our proposed method lies in the iterative up-weighting of class-level components within the loss function, focusing on those with larger errors. To achieve this, we employ the ordered weighted average (OWA) operator and combine it with an adaptive scheme for gradient-based learning. The main finding is that our method outperforms other commonly used loss functions, such as the standard crossentropy or focal loss, across various binary and multiclass classification tasks. Furthermore, we explore the influence of hyperparameters associated with the OWA operators and propose a default configuration that performs well across different experimental settings.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Apparel sizing using trimmed PAM and OWA operators
    Ibanez, M. V.
    Vinue, G.
    Alemany, S.
    Simo, A.
    Epifanio, I.
    Domingo, J.
    Ayala, G.
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10512 - 10520
  • [22] An Adaptive Deep Metric Learning Loss Function for Class-Imbalance Learning via Intraclass Diversity and Interclass Distillation
    Du, Jie
    Zhang, Xiaoci
    Liu, Peng
    Vong, Chi-Man
    Wang, Tianfu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15372 - 15386
  • [23] An Adaptive Asymmetric Loss Function for Positive Unlabeled Learning
    Jaskie, Kristen
    Vaughn, Nolan
    Narayanaswamy, Vivek
    Zaare, Sahba
    Marvin, Joseph
    Spanias, Andreas
    AUTOMATIC TARGET RECOGNITION XXXIII, 2023, 12521
  • [24] A Semantic Loss Function for Deep Learning with Symbolic Knowledge
    Xu, Jingyi
    Zhang, Zilu
    Friedman, Tal
    Liang, Yitao
    Van den Broeck, Guy
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [25] Loss Function for Deep Learning to Model Dynamical Systems
    Yoshida, Takahito
    Yaguchi, Takaharu
    Matsubara, Takashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (11) : 1458 - 1462
  • [26] The Negative BER Loss Function for Deep Learning Decoders
    Dong, Rui
    Lu, Fang
    Dong, Yan
    Yan, Haotian
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (08) : 1824 - 1828
  • [27] Generalized Correntropy Induced Loss Function for Deep Learning
    Chen, Liangjun
    Qu, Hua
    Zhao, Jihong
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1428 - 1433
  • [28] A weakly supervised adaptive triplet loss for deep metric learning
    Zhao, Xiaonan
    Qi, Huan
    Luo, Rui
    Davis, Larry
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3177 - 3180
  • [29] CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition
    Huang, Yuge
    Wang, Yuhan
    Tai, Ying
    Liu, Xiaoming
    Shen, Pengcheng
    Li, Shaoxin
    Li, Jilin
    Huang, Feiyue
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5900 - 5909
  • [30] Learning deep features with adaptive triplet loss for person reidentification
    Li, Zhiqiang
    Sang, Nong
    Chen, Kezhou
    Gao, Changxin
    Wang, Ruolin
    MIPPR 2017: PATTERN RECOGNITION AND COMPUTER VISION, 2017, 10609