Few-Shot Learning of Compact Models via Task-Specific Meta Distillation

被引:2
|
作者
Wu, Yong [1 ]
Chanda, Shekhor [2 ]
Hosseinzadeh, Mehrdad [3 ]
Liu, Zhi [1 ]
Wang, Yang [4 ]
机构
[1] Shanghai Univ, Shanghai, Peoples R China
[2] Univ Manitoba, Winnipeg, MB, Canada
[3] Huawei Technol Canada, Markham, ON, Canada
[4] Concordia Univ, Montreal, PQ, Canada
基金
中国国家自然科学基金; 加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/WACV56688.2023.00620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a new problem of few-shot learning of compact models. Meta-learning is a popular approach for fewshot learning. Previous work in meta-learning typically assumes that the model architecture during meta-training is the same as the model architecture used for final deployment. In this paper, we challenge this basic assumption. For final deployment, we often need the model to be small. But small models usually do not have enough capacity to effectively adapt to new tasks. In the mean time, we often have access to the large dataset and extensive computing power during meta-training since meta-training is typically performed on a server. In this paper, we propose task-specific meta distillation that simultaneously learns two models in meta-learning: a large teacher model and a small student model. These two models are jointly learned during meta-training. Given a new task during meta-testing, the teacher model is first adapted to this task, then the adapted teacher model is used to guide the adaptation of the student model. The adapted student model is used for final deployment. We demonstrate the effectiveness of our approach in few-shot image classification using model-agnostic metal-earning (MAML). Our proposed method outperforms other alternatives on several benchmark datasets.
引用
收藏
页码:6254 / 6263
页数:10
相关论文
共 50 条
  • [1] Graph Few-shot Learning with Task-specific Structures
    Wang, Song
    Chen, Chen
    Li, Jundong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] A Task-Specific Meta-Learning Framework for Few-Shot Sound Event Detection
    Zhang, Tianyang
    Yang, Liping
    Gu, Xiaohua
    Wang, Yuyang
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [3] Learning Task-Specific Embeddings for Few-Shot Classification via Local Weight Adaptation
    Gong, Nianru
    Duan, Pengfei
    Rong, Yi
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 485 - 491
  • [4] Task-specific method-agnostic metric for few-shot learning
    Wang, Heng
    Li, Yong
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (04): : 3115 - 3124
  • [5] Learning task-specific discriminative embeddings for few-shot image classification
    Xing, Lei
    Shao, Shuai
    Liu, Weifeng
    Han, Anxun
    Pan, Xiangshuai
    Liu, Bao-Di
    NEUROCOMPUTING, 2022, 488 : 1 - 13
  • [6] Task-specific method-agnostic metric for few-shot learning
    Heng Wang
    Yong Li
    Neural Computing and Applications, 2023, 35 : 3115 - 3124
  • [7] Cross-domain Few-shot Learning with Task-specific Adapters
    Li, Wei-Hong
    Liu, Xialei
    Bilen, Hakan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 7151 - 7160
  • [8] Improving Task-Specific Generalization in Few-Shot Learning via Adaptive Vicinal Risk Minimization
    Huang, Long-Kai
    Wei, Ying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [9] Task-specific contrastive learning for few-shot remote sensing image scene classification
    Zeng, Qingjie
    Geng, Jie
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 191 : 143 - 154
  • [10] SNIP-FSL: Finding task-specific lottery jackpots for few-shot learning
    Wang, Ren
    Sun, Haoliang
    Nie, Xiushan
    Yin, Yilong
    KNOWLEDGE-BASED SYSTEMS, 2022, 247