META-LEARNING WITH ATTENTION FOR IMPROVED FEW-SHOT LEARNING

被引:2
|
作者
Hou, Zejiang [1 ]
Walid, Anwar [2 ]
Kung, Sun-Yuan [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] Nokia Bell Labs, Murray Hill, NJ USA
关键词
Few-shot learning; meta-learning; attention mechanism;
D O I
10.1109/ICASSP39728.2021.9414936
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider few-shot learning (FSL), where a model learns from very few labeled examples such that it can generalize to unseen examples. Model-agnostic meta-learning (MAML) has been proposed to solve FSL. However, the low performance of MAML suggests its difficulty in tackle diverse tasks, due to the restriction of sharing a single model initialization for fast adaptation. In this paper, we propose meta-learning with attention mechanisms. Our method meta-learns attention modules to instantiate task-specific model initialization for fast adaptation, which can obtain high-quality solution to a new task using few gradient descent steps. To further improve generalization during inference, we propose to incorporate an entropy regularizer into the adaptation objective to penalize the Shannon entropy of prediction probability. Extensive experiments under various FSL scenarios show that our method achieves state-of-the-art performance on the mini-ImageNet and tiered-ImageNet.
引用
收藏
页码:2725 / 2729
页数:5
相关论文
共 50 条
  • [1] Unsupervised meta-learning for few-shot learning
    Xu, Hui
    Wang, Jiaxing
    Li, Hao
    Ouyang, Deqiang
    Shao, Jie
    [J]. PATTERN RECOGNITION, 2021, 116
  • [2] Prior-knowledge and attention based meta-learning for few-shot learning
    Qin, Yunxiao
    Zhang, Weiguo
    Zhao, Chenxu
    Wang, Zezheng
    Zhu, Xiangyu
    Shi, Jingping
    Qi, Guojun
    Lei, Zhen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [3] Task Agnostic Meta-Learning for Few-Shot Learning
    Jamal, Muhammad Abdullah
    Qi, Guo-Jun
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
  • [4] Meta-Learning for Few-Shot NMT Adaptation
    Sharaf, Amr
    Hassan, Hany
    Daume, Hal, III
    [J]. NEURAL GENERATION AND TRANSLATION, 2020, : 43 - 53
  • [5] Fair Meta-Learning For Few-Shot Classification
    Zhao, Chen
    Li, Changbin
    Li, Jincheng
    Chen, Feng
    [J]. 11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 275 - 282
  • [6] Stress Testing of Meta-learning Approaches for Few-shot Learning
    Aimen, Aroof
    Sidheekh, Sahil
    Madan, Vineet
    Krishnan, Narayanan C.
    [J]. AAAI WORKSHOP ON META-LEARNING AND METADL CHALLENGE, VOL 140, 2021, 140 : 38 - 44
  • [7] MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
    Zhang, Baoquan
    Luo, Chuyao
    Yu, Demin
    Li, Xutao
    Lin, Huiwei
    Ye, Yunming
    Zhang, Bowen
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16687 - 16695
  • [8] Adversarially Robust Few-Shot Learning: A Meta-Learning Approach
    Goldblum, Micah
    Fowl, Liam
    Goldstein, Tom
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [9] Meta-learning for few-shot time series forecasting
    Xiao, Feng
    Liu, Lu
    Han, Jiayu
    Guo, Degui
    Wang, Shang
    Cui, Hai
    Peng, Tao
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (01) : 325 - 341
  • [10] Few-Shot Directed Meta-Learning for Image Classification
    Ouyang, Jihong
    Duan, Ganghai
    Liu, Siguang
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (01)