Leader learning loss function in neural network classification

被引:8
|
作者
Zhang, Siyuan
Xie, Linbo [1 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Wuxi, Jiangsu, Peoples R China
关键词
Leader learning; Loss function learning; Neural network classification; Cost-sensitive learning;
D O I
10.1016/j.neucom.2023.126735
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning, based on Empirical Risk Minimization (ERM), typically aims to fit the ideal outputs of all samples due to its large capacity. However, models trained based on empirical losses like cross entropy (CE) or mean square error (MSE), often learn unnecessary information during classification, leading to premature overfitting. On the other hand, the result-focused loss functions, i.e., zero-one loss or hinge loss, are hard to optimize and thus are rarely applied directly in neural network. This paper proposes a novel leader learning in classification, where CE is gradually trained by classification results using sample-dependent cost-sensitive learning. As complementary, the stepwise-changed CE covers the deficiency on classification error while preserving the advantage of fast convergence. In this way, the deviation between CE and classification error can be corrected. Experimental results demonstrate that the proposed leader learning has a more significant convergence trend than the baseline algorithms. Moreover, the loss function learned from a specific dataset has broad generality that can be transferred to other models as prior knowledge.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Modal Neural Network: Robust Deep Learning with Mode Loss Function
    Zhu, Liangxuan
    Li, Han
    Wen, Wen
    Wu, Lingjuan
    Chen, Hong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Fully Convolutional Neural Network Structure and Its Loss Function for Image Classification
    Zhu, Qiuyu
    Zu, Xuewen
    IEEE ACCESS, 2022, 10 : 35541 - 35549
  • [3] A novel deep learning neural network for fast-food image classification and prediction using modified loss function
    Lohala, Saurav
    Alsadoon, Abeer
    Prasad, P. W. C.
    Ali, Rasha S.
    Altaay, Alaa Jabbar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25453 - 25476
  • [4] A novel deep learning neural network for fast-food image classification and prediction using modified loss function
    Saurav Lohala
    Abeer Alsadoon
    P. W. C. Prasad
    Rasha S. Ali
    Alaa Jabbar Altaay
    Multimedia Tools and Applications, 2021, 80 : 25453 - 25476
  • [5] Deep learning neural network for lung cancer classification: enhanced optimization function
    Bhoj Raj Pandit
    Abeer Alsadoon
    P. W. C. Prasad
    Sarmad Al Aloussi
    Tarik A. Rashid
    Omar Hisham Alsadoon
    Oday D. Jerew
    Multimedia Tools and Applications, 2023, 82 : 6605 - 6624
  • [6] AN IMPROVED DEEP CONVOLUTIONAL NEURAL NETWORK MODEL WITH KERNEL LOSS FUNCTION IN IMAGE CLASSIFICATION
    Xia, Yuantian
    Zhou, Juxiang
    Xu, Tianwei
    Gao, Wei
    MATHEMATICAL FOUNDATIONS OF COMPUTING, 2020, 3 (01): : 51 - 64
  • [7] MPCE: A Maximum Probability Based Cross Entropy Loss Function for Neural Network Classification
    Zhou, Yangfan
    Wang, Xin
    Zhang, Mingchuan
    Zhu, Junlong
    Zheng, Ruijuan
    Wu, Qingtao
    IEEE ACCESS, 2019, 7 : 146331 - 146341
  • [8] Deep learning neural network for lung cancer classification: enhanced optimization function
    Pandit, Bhoj Raj
    Alsadoon, Abeer
    Prasad, P. W. C.
    Al Aloussi, Sarmad
    Rashid, Tarik A.
    Alsadoon, Omar Hisham
    Jerew, Oday D.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) : 6605 - 6624
  • [9] A Fast Multi-Loss Learning Deep Neural Network for Automatic Modulation Classification
    Chang, Shuo
    Yang, Zheng
    He, Jiashuo
    Li, Rong
    Huang, Sai
    Feng, Zhiyong
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2023, 9 (06) : 1503 - 1518
  • [10] Mixture Loss Function-based Classification Network for Few-shot Learning
    Zhang, Yansha
    Pan, Feng
    Wang, Jie
    Wang, Lin
    2022 INTERNATIONAL CONFERENCE ON COMPUTING, ROBOTICS AND SYSTEM SCIENCES, ICRSS, 2022, : 53 - 58