Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup

被引:17
|
作者
Xu, Guodong [1 ]
Liu, Ziwei [2 ]
Loy, Chen Change [2 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, Singapore, Singapore
关键词
Knowledge distillation; Training cost;
D O I
10.1016/j.patcog.2023.109338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation (KD) has emerged as an essential technique not only for model compression, but also other learning tasks such as continual learning. Given the richer application spectrum and potential online usage of KD, knowledge distillation efficiency becomes a pivotal component. In this work, we study this little-explored but important topic. Unlike previous works that focus solely on the accuracy of stu-dent network, we attempt to achieve a harder goal - to obtain a performance comparable to conventional KD with a lower computation cost during the transfer. To this end, we present UNcertainty-aware mIXup (UNIX), an effective approach that can reduce transfer cost by 20% to 30% and yet maintain comparable or achieve even better student performance than conventional KD. This is made possible via effective uncertainty sampling and a novel adaptive mixup approach that select informative samples dynamically over ample data and compact knowledge in these samples. We show that our approach inherently per-forms hard sample mining. We demonstrate the applicability of our approach to improve various existing KD approaches by reducing their queries to a teacher network. Extensive experiments are performed on CIFAR100 and ImageNet. Code and model are available at https://github.com/xuguodong03/UNIXKD .(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] An Efficient and Uncertainty-Aware Decision Support System for Disaster Response Using Aerial Imagery
    Bin, Junchi
    Zhang, Ran
    Wang, Rui
    Cao, Yue
    Zheng, Yufeng
    Blasch, Erik
    Liu, Zheng
    SENSORS, 2022, 22 (19)
  • [42] Efficient Uncertainty-aware Decision-making for Automated Driving Using Guided Branching
    Zhang, Lu
    Ding, Wenchao
    Chen, Jing
    Shen, Shaojie
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 3291 - 3297
  • [43] Uncertainty-Aware Health Diagnostics via Class-Balanced Evidential Deep Learning
    Xia, Tong
    Dang, Ting
    Han, Jing
    Qendro, Lorena
    Mascolo, Cecilia
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6417 - 6428
  • [44] Incremental Pedestrian Attribute Recognition via Dual Uncertainty-Aware Pseudo-Labeling
    Li, Da
    Zhang, Zhang
    Shan, Caifeng
    Wang, Liang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2622 - 2636
  • [45] History-enhanced and Uncertainty-aware Trajectory Recovery via Attentive Neural Network
    Xia, Tong
    Li, Yong
    Qi, Yunhan
    Feng, Jie
    Xu, Fengli
    Sun, Funing
    Guo, Diansheng
    Jin, Depeng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [46] ORSI Salient Object Detection via Progressive Semantic Flow and Uncertainty-Aware Refinement
    Quan, Yueqian
    Xu, Honghui
    Wang, Renfang
    Guan, Qiu
    Zheng, Jianwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [47] Uncertainty-Aware Multimodal Trajectory Prediction via a Single Inference from a Single Model
    Suk, Ho
    Kim, Shiho
    SENSORS, 2025, 25 (01)
  • [48] Source-Free Image-Text Matching via Uncertainty-Aware Learning
    Tian, Mengxiao
    Yang, Shuo
    Wu, Xinxiao
    Jia, Yunde
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 3059 - 3063
  • [49] Efficient Crowd Counting via Dual Knowledge Distillation
    Wang, Rui
    Hao, Yixue
    Hu, Long
    Li, Xianzhi
    Chen, Min
    Miao, Yiming
    Humar, Iztok
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 569 - 583
  • [50] Efficient Biomedical Instance Segmentation via Knowledge Distillation
    Liu, Xiaoyu
    Hu, Bo
    Huang, Wei
    Zhang, Yueyi
    Xiong, Zhiwei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 14 - 24