Contextual Gradient Scaling for Few-Shot Learning

被引：0

作者：

Lee, Sanghyuk ^{[1
]}

Lee, Seunghyun ^{[1
]}

Song, Byung Cheol ^{[1
]}

机构：

[1] Inha Univ, Dept Elect & Comp Engn, Incheon, South Korea

来源：

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) | 2022年

关键词：

D O I：

10.1109/WACV51458.2022.00356

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model-agnostic meta-learning (MAML) is a well-known optimization-based meta-learning algorithm that works well in various computer vision tasks, e.g., few-shot classification. MAML is to learn an initialization so that a model can adapt to a new task in a few steps. However, since the gradient norm of a classifier (head) is much bigger than those of backbone layers, the model focuses on learning the decision boundary of the classifier with similar representations. Furthermore, gradient norms of high-level layers are small than those of the other layers. So, the backbone of MAML usually learns task-generic features, which results in deteriorated adaptation performance in the inner-loop. To resolve or mitigate this problem, we propose contextual gradient scaling (CxGrad), which scales gradient norms of the backbone to facilitate learning task-specific knowledge in the inner-loop. Since the scaling factors are generated from task-conditioned parameters, gradient norms of the backbone can be scaled in a task-wise fashion. Experimental results show that CxGrad effectively encourages the backbone to learn task-specific knowledge in the inner-loop and improves the performance of MAML up to a significant margin in both same- and cross-domain few-shot classification.

引用

页码：3503 / 3512

页数：10

共 50 条

[21] Few-Shot Learning With a Strong Teacher
Ye, Han-Jia
Ming, Lu
Zhan, De-Chuan
Chao, Wei-Lun
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1425 - 1440
[22] Prototype Completion for Few-Shot Learning
Zhang, Baoquan
Li, Xutao
Ye, Yunming
Feng, Shanshan
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12250 - 12268
[23] Few-Shot Learning for Defence and Security
Robinson, Todd
[J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413
[24] Explore pretraining for few-shot learning
Yan Li
Jinjie Huang
[J]. Multimedia Tools and Applications, 2024, 83 : 4691 - 4702
[25] Few-Shot Learning With Class Imbalance
Ochal M.
Patacchiola M.
Vazquez J.
Storkey A.
Wang S.
[J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1348 - 1358
[26] Local Propagation for Few-Shot Learning
Lifchitz, Yann
Avrithis, Yannis
Picard, Sylvaine
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10457 - 10464
[27] Few-shot Learning with Prompting Methods
[J]. 2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
[28] Active Few-Shot Learning with FASL
Muller, Thomas
Perez-Torro, Guillermo
Basile, Angelo
Franco-Salvador, Marc
[J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 98 - 110
[29] Few-Shot Classification with Contrastive Learning
Yang, Zhanyuan
Wang, Jinghua
Zhu, Yingying
[J]. COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 293 - 309
[30] Personalized Federated Few-Shot Learning
Zhao, Yunfeng
Yu, Guoxian
Wang, Jun
Domeniconi, Carlotta
Guo, Maozu
Zhang, Xiangliang
Cui, Lizhen
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2534 - 2544

← 1 2 3 4 5 →