Contextual Gradient Scaling for Few-Shot Learning

被引：0

作者：

Lee, Sanghyuk ^{[1
]}

Lee, Seunghyun ^{[1
]}

Song, Byung Cheol ^{[1
]}

机构：

[1] Inha Univ, Dept Elect & Comp Engn, Incheon, South Korea

来源：

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) | 2022年

关键词：

D O I：

10.1109/WACV51458.2022.00356

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model-agnostic meta-learning (MAML) is a well-known optimization-based meta-learning algorithm that works well in various computer vision tasks, e.g., few-shot classification. MAML is to learn an initialization so that a model can adapt to a new task in a few steps. However, since the gradient norm of a classifier (head) is much bigger than those of backbone layers, the model focuses on learning the decision boundary of the classifier with similar representations. Furthermore, gradient norms of high-level layers are small than those of the other layers. So, the backbone of MAML usually learns task-generic features, which results in deteriorated adaptation performance in the inner-loop. To resolve or mitigate this problem, we propose contextual gradient scaling (CxGrad), which scales gradient norms of the backbone to facilitate learning task-specific knowledge in the inner-loop. Since the scaling factors are generated from task-conditioned parameters, gradient norms of the backbone can be scaled in a task-wise fashion. Experimental results show that CxGrad effectively encourages the backbone to learn task-specific knowledge in the inner-loop and improves the performance of MAML up to a significant margin in both same- and cross-domain few-shot classification.

引用

页码：3503 / 3512

页数：10

共 50 条

[31] Few-shot learning for ear recognition
Zhang, Jie
Yu, Wen
Yang, Xudong
Deng, Fang
[J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 50 - 54
[32] A Feature Generator for Few-Shot Learning
Kanagalingam, Heethanjan
Pathmanathan, Thenukan
Ketheeswaran, Navaneethan
Vathanakumar, Mokeeshan
Afham, Mohamed
Rodrigo, Ranga
[J]. arXiv,
[33] Exploring Quantization in Few-Shot Learning
Wang, Meiqi
Xue, Ruixin
Lin, Jun
Wang, Zhongfeng
[J]. 2020 18TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS'20), 2020, : 279 - 282
[34] Few-Shot Learning for Image Denoising
Jiang, Bo
Lu, Yao
Zhang, Bob
Lu, Guangming
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4741 - 4753
[35] Few-shot Learning with Noisy Labels
Liang, Kevin J.
Rangrej, Samrudhdhi B.
Petrovic, Vladan
Hassner, Tal
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9079 - 9088
[36] Few-shot Continual Infomax Learning
Gu, Ziqi
Xu, Chunyan
Yang, Jian
Cui, Zhen
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19167 - 19176
[37] Adaptive Subspaces for Few-Shot Learning
Simon, Christian
Koniusz, Piotr
Nock, Richard
Harandi, Mehrtash
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4135 - 4144
[38] Explore pretraining for few-shot learning
Li, Yan
Huang, Jinjie
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4691 - 4702
[39] Few-Shot Learning for Opinion Summarization
Brazinskas, Arthur
Lapata, Mirella
Titov, Ivan
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4119 - 4135
[40] Prototype Reinforcement for Few-Shot Learning
Xu, Liheng
Xie, Qian
Jiang, Baoqing
Zhang, Jiashuo
[J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4912 - 4916

← 1 2 3 4 5 →