Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup

被引：17

作者：

Xu, Guodong ^{[1
]}

Liu, Ziwei ^{[2
]}

Loy, Chen Change ^{[2
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] Nanyang Technol Univ, Singapore, Singapore

来源：

PATTERN RECOGNITION | 2023年 / 138卷

关键词：

Knowledge distillation; Training cost;

D O I：

10.1016/j.patcog.2023.109338

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Knowledge distillation (KD) has emerged as an essential technique not only for model compression, but also other learning tasks such as continual learning. Given the richer application spectrum and potential online usage of KD, knowledge distillation efficiency becomes a pivotal component. In this work, we study this little-explored but important topic. Unlike previous works that focus solely on the accuracy of stu-dent network, we attempt to achieve a harder goal - to obtain a performance comparable to conventional KD with a lower computation cost during the transfer. To this end, we present UNcertainty-aware mIXup (UNIX), an effective approach that can reduce transfer cost by 20% to 30% and yet maintain comparable or achieve even better student performance than conventional KD. This is made possible via effective uncertainty sampling and a novel adaptive mixup approach that select informative samples dynamically over ample data and compact knowledge in these samples. We show that our approach inherently per-forms hard sample mining. We demonstrate the applicability of our approach to improve various existing KD approaches by reducing their queries to a teacher network. Extensive experiments are performed on CIFAR100 and ImageNet. Code and model are available at https://github.com/xuguodong03/UNIXKD .(c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：9

共 50 条

[41] An Efficient and Uncertainty-Aware Decision Support System for Disaster Response Using Aerial Imagery
Bin, Junchi
Zhang, Ran
Wang, Rui
Cao, Yue
Zheng, Yufeng
Blasch, Erik
Liu, Zheng
SENSORS, 2022, 22 (19)
[42] Efficient Uncertainty-aware Decision-making for Automated Driving Using Guided Branching
Zhang, Lu
Ding, Wenchao
Chen, Jing
Shen, Shaojie
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 3291 - 3297
[43] Uncertainty-Aware Health Diagnostics via Class-Balanced Evidential Deep Learning
Xia, Tong
Dang, Ting
Han, Jing
Qendro, Lorena
Mascolo, Cecilia
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6417 - 6428
[44] Incremental Pedestrian Attribute Recognition via Dual Uncertainty-Aware Pseudo-Labeling
Li, Da
Zhang, Zhang
Shan, Caifeng
Wang, Liang
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2622 - 2636
[45] History-enhanced and Uncertainty-aware Trajectory Recovery via Attentive Neural Network
Xia, Tong
Li, Yong
Qi, Yunhan
Feng, Jie
Xu, Fengli
Sun, Funing
Guo, Diansheng
Jin, Depeng
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
[46] ORSI Salient Object Detection via Progressive Semantic Flow and Uncertainty-Aware Refinement
Quan, Yueqian
Xu, Honghui
Wang, Renfang
Guan, Qiu
Zheng, Jianwei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
[47] Uncertainty-Aware Multimodal Trajectory Prediction via a Single Inference from a Single Model
Suk, Ho
Kim, Shiho
SENSORS, 2025, 25 (01)
[48] Source-Free Image-Text Matching via Uncertainty-Aware Learning
Tian, Mengxiao
Yang, Shuo
Wu, Xinxiao
Jia, Yunde
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 3059 - 3063
[49] Efficient Crowd Counting via Dual Knowledge Distillation
Wang, Rui
Hao, Yixue
Hu, Long
Li, Xianzhi
Chen, Min
Miao, Yiming
Humar, Iztok
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 569 - 583
[50] Efficient Biomedical Instance Segmentation via Knowledge Distillation
Liu, Xiaoyu
Hu, Bo
Huang, Wei
Zhang, Yueyi
Xiong, Zhiwei
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 14 - 24

← 1 2 3 4 5 →