FEW-SHOT LEARNING BY DIMENSIONALITY REDUCTION IN GRADIENT SPACE

被引：0

作者：

Gauch, Martin ^{[1
,2
,6
]}

Beck, Maximilian ^{[1
,2
]}

Adler, Thomas ^{[1
,2
]}

Kotsur, Dmytro ^{[3
]}

Fiel, Stefan ^{[3
]}

Eghbal-Zadeh, Hamid ^{[1
,2
]}

Brandstetter, Johannes ^{[1
,2
]}

Kofler, Johannes ^{[1
,2
]}

Holzleitner, Markus ^{[1
,2
]}

Zellinger, Werner ^{[4
]}

Klotz, Daniel ^{[1
,2
]}

Hochreiter, Sepp ^{[1
,2
,5
]}

Lehner, Sebastian ^{[1
,2
]}

机构：

[1] Johannes Kepler Univ Linz, Inst Machine Learning, ELLIS Unit Linz, Linz, Austria

[2] Johannes Kepler Univ Linz, Inst Machine Learning, LIT AI Lab, Linz, Austria

[3] Anyline GmbH, Vienna, Austria

[4] Austrian Acad Sci, Johann Radon Inst Computat & Appl Math, Linz, Austria

[5] Inst Adv Res Artificial Intelligence IARAI, Vienna, Austria

[6] Microsoft Res, Redmond, WA 98052 USA

来源：

CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199 | 2022年 / 199卷

基金：

欧盟地平线“2020”;

关键词：

CLIMATE-CHANGE; MODEL;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace. In experimental and theoretical analyses, we show that models confined to a suitable predefined subspace generalize well for few-shot learning. A suitable subspace fulfills three criteria across the given tasks: it (a) allows to reduce the training error by gradient flow, (b) leads to models that generalize well, and (c) can be identified by stochastic gradient descent. SubGD identifies these subspaces from an eigendecomposition of the auto-correlation matrix of update directions across different tasks. Demonstrably, we can identify low-dimensional suitable subspaces for few-shot learning of dynamical systems, which have varying properties described by one or few parameters of the analytical system description. Such systems are ubiquitous among real-world applications in science and engineering. We experimentally corroborate the advantages of SubGD on three distinct dynamical systems problem settings, significantly outperforming popular few-shot learning methods both in terms of sample efficiency and performance.

引用

页数：22

共 50 条

[21] Few-Shot Learning With Class Imbalance
Ochal M.
Patacchiola M.
Vazquez J.
Storkey A.
Wang S.
[J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1348 - 1358
[22] Local Propagation for Few-Shot Learning
Lifchitz, Yann
Avrithis, Yannis
Picard, Sylvaine
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10457 - 10464
[23] Few-shot Learning with Prompting Methods
[J]. 2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
[24] Active Few-Shot Learning with FASL
Muller, Thomas
Perez-Torro, Guillermo
Basile, Angelo
Franco-Salvador, Marc
[J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 98 - 110
[25] Learning a Latent Space with Triplet Network for Few-Shot Image Classification
Wu, Jiaying
Hu, Jinglu
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 5038 - 5044
[26] Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space
Li, Shuo
Liu, Fang
Hao, Zehua
Zhao, Kaibo
Jiao, Licheng
[J]. COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 420 - 436
[27] Few-Shot Classification with Contrastive Learning
Yang, Zhanyuan
Wang, Jinghua
Zhu, Yingying
[J]. COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 293 - 309
[28] Personalized Federated Few-Shot Learning
Zhao, Yunfeng
Yu, Guoxian
Wang, Jun
Domeniconi, Carlotta
Guo, Maozu
Zhang, Xiangliang
Cui, Lizhen
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2534 - 2544
[29] Few-shot learning for ear recognition
Zhang, Jie
Yu, Wen
Yang, Xudong
Deng, Fang
[J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 50 - 54
[30] A Feature Generator for Few-Shot Learning
Kanagalingam, Heethanjan
Pathmanathan, Thenukan
Ketheeswaran, Navaneethan
Vathanakumar, Mokeeshan
Afham, Mohamed
Rodrigo, Ranga
[J]. arXiv,

← 1 2 3 4 5 →