Decoupled Multi-teacher Knowledge Distillation based on Entropy

被引:0
|
作者
Cheng, Xin [1 ]
Tang, Jialiang [2 ]
Zhang, Zhiqiang [3 ]
Yu, Wenxin [3 ]
Jiang, Ning [3 ]
Zhou, Jinjia [1 ]
机构
[1] Hosei Univ, Grad Sch Sci & Engn, Tokyo, Japan
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[3] Southwest Univ Sci & Technol, Sch Comp Sci & Technol, Mianyang, Sichuan, Peoples R China
关键词
Multi-teacher knowledge distillation; image classification; entropy; deep learning;
D O I
10.1109/ISCAS58744.2024.10558141
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multi-teacher knowledge distillation (MKD) aims to leverage the valuable and diverse knowledge presented by multiple teacher networks to improve the performance of the student network. Existing approaches typically rely on simple methods such as averaging the prediction logits or using sub-optimal weighting strategies to combine knowledge from multiple teachers. However, employing these techniques cannot fully reflect the importance of teachers and may even mislead student's learning. To address these issues, we propose a novel Decoupled Multi teacher Knowledge Distillation based on Entropy (DE-MKD). DE-MKD decomposes the vanilla KD loss and assigns weights to each teacher to reflect its importance based on the entropy of their predictions. Furthermore, we extend the proposed approach to distill the intermediate features from teachers to further improve the performance of the student network. Extensive experiments conducted on the publicly available CIFAR-100 image classification dataset demonstrate the effectiveness and flexibility of our proposed approach.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Accurate and efficient protein embedding using multi-teacher distillation learning
    Shang, Jiayu
    Peng, Cheng
    Ji, Yongxin
    Guan, Jiaojiao
    Cai, Dehan
    Tang, Xubo
    Sun, Yanni
    BIOINFORMATICS, 2024, 40 (09)
  • [42] Deep Fuzzy Multi-Teacher Distillation Network for Medical Visual Question Answering
    Liu Y.
    Chen B.
    Wang S.
    Lu G.
    Zhang Z.
    IEEE Transactions on Fuzzy Systems, 2024, 32 (10) : 1 - 15
  • [43] Building and road detection from remote sensing images based on weights adaptive multi-teacher collaborative distillation using a fused knowledge
    Chen, Ziyi
    Deng, Liai
    Gou, Jing
    Wang, Cheng
    Li, Jonathan
    Li, Dilong
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
  • [44] BadCleaner: Defending Backdoor Attacks in Federated Learning via Attention-Based Multi-Teacher Distillation
    Zhang, Jiale
    Zhu, Chengcheng
    Ge, Chunpeng
    Ma, Chuan
    Zhao, Yanchao
    Sun, Xiaobing
    Chen, Bing
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (05) : 4559 - 4573
  • [45] Decoupled Knowledge Distillation
    Zhao, Borui
    Cui, Quan
    Song, Renjie
    Qiu, Yiyu
    Liang, Jiajun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11943 - 11952
  • [46] A multi-graph neural group recommendation model with meta-learning and multi-teacher distillation
    Zhou, Weizhen
    Huang, Zhenhua
    Wang, Cheng
    Chen, Yunwen
    KNOWLEDGE-BASED SYSTEMS, 2023, 276
  • [47] MKD-Cooper: Cooperative 3D Object Detection for Autonomous Driving via Multi-Teacher Knowledge Distillation
    Li, Zhiyuan
    Liang, Huawei
    Wang, Hanqi
    Zhao, Mingzhuo
    Wang, Jian
    Zheng, Xiaokun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1490 - 1500
  • [48] SA-MDRAD: sample-adaptive multi-teacher dynamic rectification adversarial distillation
    Li, Shuyi
    Yang, Xiaohan
    Cheng, Guozhen
    Liu, Wenyan
    Hu, Hongchao
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [49] ADAPTIVE KNOWLEDGE DISTILLATION BASED ON ENTROPY
    Kwon, Kisoo
    Na, Hwidong
    Lee, Hoshik
    Kim, Nam Soo
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7409 - 7413
  • [50] Learning Accurate, Speedy, Lightweight CNNs via Instance-Specific Multi-Teacher Knowledge Distillation for Distracted Driver Posture Identification
    Li, Wenjing
    Wang, Jing
    Ren, Tingting
    Li, Fang
    Zhang, Jun
    Wu, Zhongcheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17922 - 17935