KNOWLEDGE DISTILLATION WITH CATEGORY-AWARE ATTENTION AND DISCRIMINANT LOGIT LOSSES

被引:4
|
作者
Jiang, Lei [1 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, EEIS Dept, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei, Anhui, Peoples R China
关键词
knowledge distillation; attention transfer; model compression;
D O I
10.1109/ICME.2019.00308
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep neural networks (DNNs) usually suffer large amount of storage and computation, limiting their deployment on resource constrained platforms. Knowledge distillation is an effective way to address the above limitation by transferring knowledge from a large while accurate teacher model to a small yet fast student model. In this paper, we propose two objective functions to optimize the knowledge transferring process. First, we propose a category-aware attention loss which works at the convolutional feature level and catches object localization information. Second, we propose a discriminant logit loss at fully-connected feature level to capture classification information. The combined two objective functions are able to integrate different level features and guide the training of the student. We demonstrate the effectiveness of our approach on several CNN models across various datasets, and show consistent performance gain with the proposed method.
引用
收藏
页码:1792 / 1797
页数:6
相关论文
共 50 条
  • [41] Category-aware Graph Neural Network for Session-based Recommendation
    Chen, Runfeng
    Zhu, Yanmin
    Ma, Peibo
    Chen, Qiuxia
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 891 - 899
  • [42] Leveraging logit uncertainty for better knowledge distillation
    Guo, Zhen
    Wang, Dong
    He, Qiang
    Zhang, Pengzhou
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [43] Category-Aware Transformer Network for Better Human-Object Interaction Detection
    Dong, Leizhen
    Li, Zhimin
    Xu, Kunlun
    Zhang, Zhijun
    Yan, Luxin
    Zhong, Sheng
    Zou, Xu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19516 - 19525
  • [44] Category-aware self-training for extremely weakly supervised text classification
    Su, Jing
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [45] Prerequisite-Enhanced Category-Aware Graph Neural Networks for Course Recommendation
    Sun, Jianshan
    Mei, Suyuan
    Yuan, Kun
    Jiang, Yuanchun
    Cao, Jie
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)
  • [46] Category-aware feature attribution for Self-Optimizing medical image classification
    Lei, Jie
    Yang, Guoyu
    Wang, Shuaiwei
    Feng, Zunlei
    Liang, Ronghua
    DISPLAYS, 2023, 77
  • [47] Intention Adaptive Graph Neural Network for Category-Aware Session-Based Recommendation
    Cui, Chuan
    Shen, Qi
    Zhu, Shixuan
    Pang, Yitong
    Zhang, Yiming
    Gao, Hanning
    Wei, Zhihua
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 150 - 165
  • [48] A budget-limited mechanism for category-aware crowdsourcing of multiple-choice tasks
    Luo, Yuan
    Jennings, Nicholas R.
    ARTIFICIAL INTELLIGENCE, 2021, 299
  • [49] SCAMS: Semantic Category-Aware Multi-scale Network for Video Quality Assessment
    Ren, Longgang
    Zhang, Kaibing
    Feng, Dandan
    Shi, Guang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 372 - 386
  • [50] CGG: Category-aware global graph contrastive learning for session-based recommendation
    Gan, Mingxin
    Zhang, Xiongtao
    Liang, Yuxin
    KNOWLEDGE-BASED SYSTEMS, 2024, 305