Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup

被引:17
|
作者
Xu, Guodong [1 ]
Liu, Ziwei [2 ]
Loy, Chen Change [2 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, Singapore, Singapore
关键词
Knowledge distillation; Training cost;
D O I
10.1016/j.patcog.2023.109338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation (KD) has emerged as an essential technique not only for model compression, but also other learning tasks such as continual learning. Given the richer application spectrum and potential online usage of KD, knowledge distillation efficiency becomes a pivotal component. In this work, we study this little-explored but important topic. Unlike previous works that focus solely on the accuracy of stu-dent network, we attempt to achieve a harder goal - to obtain a performance comparable to conventional KD with a lower computation cost during the transfer. To this end, we present UNcertainty-aware mIXup (UNIX), an effective approach that can reduce transfer cost by 20% to 30% and yet maintain comparable or achieve even better student performance than conventional KD. This is made possible via effective uncertainty sampling and a novel adaptive mixup approach that select informative samples dynamically over ample data and compact knowledge in these samples. We show that our approach inherently per-forms hard sample mining. We demonstrate the applicability of our approach to improve various existing KD approaches by reducing their queries to a teacher network. Extensive experiments are performed on CIFAR100 and ImageNet. Code and model are available at https://github.com/xuguodong03/UNIXKD .(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] An uncertainty-aware framework for reliable disaster damage assessment via crowdsourcing
    Khajwal, Asim B.
    Noshadravan, Arash
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2021, 55
  • [22] Safe Learning for Uncertainty-Aware Planning via Interval MDP Abstraction
    Jiang, Jesse
    Zhao, Ye
    Coogan, Samuel
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2641 - 2646
  • [23] Preserving Privacy in GPS Traces via Uncertainty-Aware Path Cloaking
    Hoh, Baik
    Gruteser, Marco
    Xiong, Hui
    Alrabady, Ansaf
    CCS'07: PROCEEDINGS OF THE 14TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2007, : 161 - +
  • [24] Resolution-Aware Knowledge Distillation for Efficient Inference
    Feng, Zhanxiang
    Lai, Jianhuang
    Xie, Xiaohua
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6985 - 6996
  • [25] Uncertainty-aware fuzzy knowledge embedding method for generalized structural performance prediction
    Wang, Xiang-Yu
    Ma, Xin-Rui
    Chen, Shi-Zhi
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2025,
  • [26] Knowledge Distillation via the Target-aware Transformer
    Lin, Sihao
    Xie, Hongwei
    Wang, Bing
    Yu, Kaicheng
    Chang, Xiaojun
    Liang, Xiaodan
    Wang, Gang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10905 - 10914
  • [27] Integrated energy hub optimization in microgrids: Uncertainty-aware modeling and efficient operation
    Yan, Laiqing
    Deng, Xiwei
    Li, Ji
    ENERGY, 2024, 291
  • [28] Domain-Adaptive Object Detection via Uncertainty-Aware Distribution Alignment
    Dang-Khoa Nguyen
    Tseng, Wei-Lun
    Shuai, Hong-Han
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2499 - 2507
  • [29] An Uncertainty-Aware Transformer for MRI Cardiac Semantic Segmentation via Mean Teachers
    Wang, Ziyang
    Zheng, Jian-Qing
    Voiculescu, Irina
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 494 - 507
  • [30] Pixel-Level Anomaly Detection via Uncertainty-Aware Prototypical Transformer
    Huang, Chao
    Liu, Chengliang
    Zhang, Zheng
    Wu, Zhihao
    Wen, Jie
    Jiang, Qiuping
    Xu, Yong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,