Contextual Gradient Scaling for Few-Shot Learning

被引:0
|
作者
Lee, Sanghyuk [1 ]
Lee, Seunghyun [1 ]
Song, Byung Cheol [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon, South Korea
关键词
D O I
10.1109/WACV51458.2022.00356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model-agnostic meta-learning (MAML) is a well-known optimization-based meta-learning algorithm that works well in various computer vision tasks, e.g., few-shot classification. MAML is to learn an initialization so that a model can adapt to a new task in a few steps. However, since the gradient norm of a classifier (head) is much bigger than those of backbone layers, the model focuses on learning the decision boundary of the classifier with similar representations. Furthermore, gradient norms of high-level layers are small than those of the other layers. So, the backbone of MAML usually learns task-generic features, which results in deteriorated adaptation performance in the inner-loop. To resolve or mitigate this problem, we propose contextual gradient scaling (CxGrad), which scales gradient norms of the backbone to facilitate learning task-specific knowledge in the inner-loop. Since the scaling factors are generated from task-conditioned parameters, gradient norms of the backbone can be scaled in a task-wise fashion. Experimental results show that CxGrad effectively encourages the backbone to learn task-specific knowledge in the inner-loop and improves the performance of MAML up to a significant margin in both same- and cross-domain few-shot classification.
引用
收藏
页码:3503 / 3512
页数:10
相关论文
共 50 条
  • [41] Learning about few-shot concept learning
    Rastogi, Ananya
    [J]. NATURE COMPUTATIONAL SCIENCE, 2022, 2 (11): : 698 - 698
  • [42] An Applicative Survey on Few-shot Learning
    Zhang, Jianwei
    Zhang, Xubin
    Lv, Lei
    Di, Yining
    Chen, Wei
    [J]. Recent Patents on Engineering, 2022, 16 (05) : 104 - 124
  • [43] Secure collaborative few-shot learning
    Xie, Yu
    Wang, Han
    Yu, Bin
    Zhang, Chen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 203
  • [44] Prototypical Networks for Few-shot Learning
    Snell, Jake
    Swersky, Kevin
    Zemel, Richard
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [45] Demystification of Few-shot and One-shot Learning
    Tyukin, Ivan Y.
    Gorban, Alexander N.
    Alkhudaydi, Muhammad H.
    Zhou, Qinghua
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] On-chip Few-shot Learning with Surrogate Gradient Descent on a Neuromorphic Processor
    Stewart, Kenneth
    Orchard, Garrick
    Shrestha, Sumit Bam
    Neftci, Emre
    [J]. 2020 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2020), 2020, : 223 - 227
  • [47] Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
    Wang, Xixi
    Wang, Xiao
    Jiang, Bo
    Luo, Bin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7789 - 7802
  • [48] Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty
    Oh, Jaehoon
    Kim, Sungnyun
    Ho, Namgyu
    Kim, Jin-Hwa
    Song, Hwanjun
    Yun, Se-Young
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [49] Splicing learning: A novel few-shot learning approach
    Hu, Lianting
    Liang, Huiying
    Lu, Long
    [J]. INFORMATION SCIENCES, 2021, 552 : 17 - 28
  • [50] Learning to Compare: Relation Network for Few-Shot Learning
    Sung, Flood
    Yang, Yongxin
    Zhang, Li
    Xiang, Tao
    Torr, Philip H. S.
    Hospedales, Timothy M.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1199 - 1208