FEW-SHOT LEARNING BY DIMENSIONALITY REDUCTION IN GRADIENT SPACE

被引：0

作者：

Gauch, Martin ^{[1
,2
,6
]}

Beck, Maximilian ^{[1
,2
]}

Adler, Thomas ^{[1
,2
]}

Kotsur, Dmytro ^{[3
]}

Fiel, Stefan ^{[3
]}

Eghbal-Zadeh, Hamid ^{[1
,2
]}

Brandstetter, Johannes ^{[1
,2
]}

Kofler, Johannes ^{[1
,2
]}

Holzleitner, Markus ^{[1
,2
]}

Zellinger, Werner ^{[4
]}

Klotz, Daniel ^{[1
,2
]}

Hochreiter, Sepp ^{[1
,2
,5
]}

Lehner, Sebastian ^{[1
,2
]}

机构：

[1] Johannes Kepler Univ Linz, Inst Machine Learning, ELLIS Unit Linz, Linz, Austria

[2] Johannes Kepler Univ Linz, Inst Machine Learning, LIT AI Lab, Linz, Austria

[3] Anyline GmbH, Vienna, Austria

[4] Austrian Acad Sci, Johann Radon Inst Computat & Appl Math, Linz, Austria

[5] Inst Adv Res Artificial Intelligence IARAI, Vienna, Austria

[6] Microsoft Res, Redmond, WA 98052 USA

来源：

CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199 | 2022年 / 199卷

基金：

欧盟地平线“2020”;

关键词：

CLIMATE-CHANGE; MODEL;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace. In experimental and theoretical analyses, we show that models confined to a suitable predefined subspace generalize well for few-shot learning. A suitable subspace fulfills three criteria across the given tasks: it (a) allows to reduce the training error by gradient flow, (b) leads to models that generalize well, and (c) can be identified by stochastic gradient descent. SubGD identifies these subspaces from an eigendecomposition of the auto-correlation matrix of update directions across different tasks. Demonstrably, we can identify low-dimensional suitable subspaces for few-shot learning of dynamical systems, which have varying properties described by one or few parameters of the analytical system description. Such systems are ubiquitous among real-world applications in science and engineering. We experimentally corroborate the advantages of SubGD on three distinct dynamical systems problem settings, significantly outperforming popular few-shot learning methods both in terms of sample efficiency and performance.

引用

页数：22

共 50 条

[31] Few-Shot Learning With Geometric Constraints
Jung, Hong-Gyu
Lee, Seong-Whan
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4660 - 4672
[32] Prototype Reinforcement for Few-Shot Learning
Xu, Liheng
Xie, Qian
Jiang, Baoqing
Zhang, Jiashuo
[J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4912 - 4916
[33] Few-Shot Learning for Opinion Summarization
Brazinskas, Arthur
Lapata, Mirella
Titov, Ivan
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4119 - 4135
[34] Explore pretraining for few-shot learning
Li, Yan
Huang, Jinjie
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4691 - 4702
[35] An Applicative Survey on Few-shot Learning
Zhang, Jianwei
Zhang, Xubin
Lv, Lei
Di, Yining
Chen, Wei
[J]. Recent Patents on Engineering, 2022, 16 (05) : 104 - 124
[36] Few-shot Continual Infomax Learning
Gu, Ziqi
Xu, Chunyan
Yang, Jian
Cui, Zhen
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19167 - 19176
[37] Few-Shot Learning for Image Denoising
Jiang, Bo
Lu, Yao
Zhang, Bob
Lu, Guangming
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4741 - 4753
[38] Few-shot Learning with Noisy Labels
Liang, Kevin J.
Rangrej, Samrudhdhi B.
Petrovic, Vladan
Hassner, Tal
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9079 - 9088
[39] Adaptive Subspaces for Few-Shot Learning
Simon, Christian
Koniusz, Piotr
Nock, Richard
Harandi, Mehrtash
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4135 - 4144
[40] Exploring Quantization in Few-Shot Learning
Wang, Meiqi
Xue, Ruixin
Lin, Jun
Wang, Zhongfeng
[J]. 2020 18TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS'20), 2020, : 279 - 282

← 1 2 3 4 5 →