Knowledge Distillation by On-the-Fly Native Ensemble

被引:0
|
作者
Lan, Xu [1 ]
Zhu, Xiatian [2 ]
Gong, Shaogang [1 ]
机构
[1] Queen Mary Univ London, London, England
[2] Vis Semant Ltd, London, England
基金
“创新英国”项目;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation is effective to train the small and generalisable network models for meeting the low-memory and fast running requirements. Existing offline distillation methods rely on a strong pre-trained teacher, which enables favourable knowledge discovery and transfer but requires a complex two-phase training procedure. Online counterparts address this limitation at the price of lacking a high-capacity teacher. In this work, we present an On-the-fly Native Ensemble (ONE) learning strategy for one-stage online distillation. Specifically, ONE only trains a single multi-branch network while simultaneously establishing a strong teacher on-the-fly to enhance the learning of target network. Extensive evaluations show that ONE improves the generalisation performance of a variety of deep neural networks more significantly than alternative methods on four image classification dataset: CIFAR10, CIFAR100, SVHN, and ImageNet, whilst having the computational efficiency advantages.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] On-the-fly Table Generation
    Zhang, Shuo
    Balog, Krisztian
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 595 - 604
  • [22] Evolution on-the-fly with Paradigm
    Groenewegen, Luuk
    de Vink, Erik
    [J]. COORDINATION MODELS AND LANGUAGES, PROCEEDINGS, 2006, 4038 : 97 - 112
  • [23] On-the-fly pipeline parallelism
    Lee, I.-T.A.
    Leiserson, C.E.
    Schardl, T.B.
    Zhang, Z.
    Sukha, J.
    [J]. ACM Transactions on Parallel Computing, 2015, 2 (03)
  • [24] Variational On-the-Fly Personalization
    Kim, Jangho
    Lee, Jun-Tae
    Chang, Simyung
    Kwak, Nojun
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [25] On-the-fly reconfigurable logic
    Rajagopalan, K
    Phillips, B
    Abbott, D
    [J]. SMART STRUCTURES, DEVICES, AND SYSTEMS II, PT 1 AND 2, 2005, 5649 : 101 - 109
  • [26] Materials Design On-the-Fly
    Cerqueira, Tiago F. T.
    Sarmiento-Perez, Rafael
    Amsler, Maximilian
    Nogueira, F.
    Botti, Silvana
    Marques, Miguel A. L.
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2015, 11 (08) : 3955 - 3960
  • [27] On-the-fly pipelined convolver
    Marino, F
    [J]. ELECTRONICS LETTERS, 1998, 34 (12) : 1198 - 1200
  • [28] On-the-fly calibration at STScI
    Lubow, S
    Pollizzi, J
    [J]. ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS VIII, 1999, 172 : 187 - 190
  • [29] On-the-fly range reduction
    Lefèvre, V
    Muller, JM
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 33 (1-2): : 31 - 35
  • [30] On-the-Fly Range Reduction
    Vincent Lefèvre
    Jean-Michel Muller
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2003, 33 : 31 - 35