Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup

被引:17
|
作者
Xu, Guodong [1 ]
Liu, Ziwei [2 ]
Loy, Chen Change [2 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, Singapore, Singapore
关键词
Knowledge distillation; Training cost;
D O I
10.1016/j.patcog.2023.109338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation (KD) has emerged as an essential technique not only for model compression, but also other learning tasks such as continual learning. Given the richer application spectrum and potential online usage of KD, knowledge distillation efficiency becomes a pivotal component. In this work, we study this little-explored but important topic. Unlike previous works that focus solely on the accuracy of stu-dent network, we attempt to achieve a harder goal - to obtain a performance comparable to conventional KD with a lower computation cost during the transfer. To this end, we present UNcertainty-aware mIXup (UNIX), an effective approach that can reduce transfer cost by 20% to 30% and yet maintain comparable or achieve even better student performance than conventional KD. This is made possible via effective uncertainty sampling and a novel adaptive mixup approach that select informative samples dynamically over ample data and compact knowledge in these samples. We show that our approach inherently per-forms hard sample mining. We demonstrate the applicability of our approach to improve various existing KD approaches by reducing their queries to a teacher network. Extensive experiments are performed on CIFAR100 and ImageNet. Code and model are available at https://github.com/xuguodong03/UNIXKD .(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling
    Tung Nguyen
    Grover, Aditya
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [32] UCoL: Unsupervised Learning of Discriminative Facial Representations via Uncertainty-Aware Contrast
    Wang, Hao
    Li, Min
    Song, Yangyang
    Zhang, Youjian
    Chi, Liying
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2510 - 2518
  • [33] Achieving Guaranteed Anonymity in GPS Traces via Uncertainty-Aware Path Cloaking
    Hoh, Baik
    Gruteser, Marco
    Xiong, Hui
    Alrabady, Ansaf
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2010, 9 (08) : 1089 - 1107
  • [34] Label-noise learning via uncertainty-aware neighborhood sample selection
    Zhang, Yiliang
    Lu, Yang
    Wang, Hanzi
    PATTERN RECOGNITION LETTERS, 2024, 186 : 191 - 197
  • [35] Elongated Physiological Structure Segmentation via Spatial and Scale Uncertainty-Aware Network
    Zhang, Yinglin
    Xi, Ruiling
    Fu, Huazhu
    Towey, Dave
    Bai, RuiBin
    Higashita, Risa
    Liu, Jiang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 323 - 332
  • [36] D-IF: Uncertainty-aware Human Digitization via Implicit Distribution Field
    Yang, Xueting
    Luo, Yihao
    Xiu, Yuliang
    Wang, Wei
    Xu, Hao
    Fan, Zhaoxin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9088 - 9098
  • [37] Memory and Computation-Efficient Kernel SVM via Binary Embedding and Ternary Model Coefficients
    Lei, Zijian
    Lan, Liang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8316 - 8323
  • [38] Uncertainty-Aware Distillation for Semi-Supervised Few-Shot Class-Incremental Learning
    Cui, Yawen
    Deng, Wanxia
    Chen, Haoyu
    Liu, Li
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14259 - 14272
  • [39] UPGAT: Uncertainty-Aware Pseudo-neighbor Augmented Knowledge Graph Attention Network
    Tseng, Yen-Ching
    Chen, Zu-Mu
    Yeh, Mi-Yen
    Lin, Shou-De
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 53 - 65
  • [40] Efficient Uncertainty Estimation in Semantic Segmentation via Distillation
    Holder, Christopher J.
    Shafique, Muhammad
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3080 - 3087