Few-Shot Learning of Compact Models via Task-Specific Meta Distillation

被引:2
|
作者
Wu, Yong [1 ]
Chanda, Shekhor [2 ]
Hosseinzadeh, Mehrdad [3 ]
Liu, Zhi [1 ]
Wang, Yang [4 ]
机构
[1] Shanghai Univ, Shanghai, Peoples R China
[2] Univ Manitoba, Winnipeg, MB, Canada
[3] Huawei Technol Canada, Markham, ON, Canada
[4] Concordia Univ, Montreal, PQ, Canada
基金
中国国家自然科学基金; 加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/WACV56688.2023.00620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a new problem of few-shot learning of compact models. Meta-learning is a popular approach for fewshot learning. Previous work in meta-learning typically assumes that the model architecture during meta-training is the same as the model architecture used for final deployment. In this paper, we challenge this basic assumption. For final deployment, we often need the model to be small. But small models usually do not have enough capacity to effectively adapt to new tasks. In the mean time, we often have access to the large dataset and extensive computing power during meta-training since meta-training is typically performed on a server. In this paper, we propose task-specific meta distillation that simultaneously learns two models in meta-learning: a large teacher model and a small student model. These two models are jointly learned during meta-training. Given a new task during meta-testing, the teacher model is first adapted to this task, then the adapted teacher model is used to guide the adaptation of the student model. The adapted student model is used for final deployment. We demonstrate the effectiveness of our approach in few-shot image classification using model-agnostic metal-earning (MAML). Our proposed method outperforms other alternatives on several benchmark datasets.
引用
收藏
页码:6254 / 6263
页数:10
相关论文
共 50 条
  • [31] Few-shot Learning with Online Self-Distillation
    Liu, Sihan
    Wang, Yue
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1067 - 1070
  • [32] A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER
    Dong, Guanting
    Wang, Zechen
    Zhao, Jinxu
    Zhao, Gang
    Guo, Daichi
    Fu, Dayuan
    Hui, Tingfeng
    Zeng, Chen
    He, Keqing
    Li, Xuefeng
    Wang, Liwen
    Cui, Xinyue
    Xu, Weiran
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 430 - 440
  • [33] Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning
    Baik, Sungyong
    Choi, Janghoon
    Kim, Heewon
    Cho, Dohee
    Min, Jaesik
    Lee, Kyoung Mu
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9445 - 9454
  • [34] Learning Dynamic Alignment via Meta-filter for Few-shot Learning
    Xu, Chengming
    Fu, Yanwei
    Liu, Chen
    Wang, Chengjie
    Li, Jilin
    Huang, Feiyue
    Zhang, Li
    Xue, Xiangyang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5178 - 5187
  • [35] Few-Shot Learning with Embedded Class Models and Shot-Free Meta Training
    Ravichandran, Avinash
    Bhotika, Rahul
    Soatto, Stefano
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 331 - 339
  • [36] Weakly Supervised Few-Shot Segmentation via Meta-Learning
    Gama, Pedro H. T.
    Oliveira, Hugo
    Marcato Jr, Jose
    dos Santos, Jefersson A.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1784 - 1797
  • [37] Few-Shot Human Motion Prediction via Meta-learning
    Gui, Liang-Yan
    Wang, Yu-Xiong
    Ramanan, Deva
    Moura, Jose M. F.
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 441 - 459
  • [38] Few-Shot Named Entity Recognition via Meta-Learning
    Li, Jing
    Chiu, Billy
    Feng, Shanshan
    Wang, Hao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4245 - 4256
  • [39] Unsupervised meta-learning for few-shot learning
    Xu, Hui
    Wang, Jiaxing
    Li, Hao
    Ouyang, Deqiang
    Shao, Jie
    PATTERN RECOGNITION, 2021, 116
  • [40] Calibrating CNNs for Few-Shot Meta Learning
    Yang, Peng
    Ren, Shaogang
    Zhao, Yang
    Li, Ping
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 408 - 417