FEW-SHOT LEARNING BY DIMENSIONALITY REDUCTION IN GRADIENT SPACE

被引:0
|
作者
Gauch, Martin [1 ,2 ,6 ]
Beck, Maximilian [1 ,2 ]
Adler, Thomas [1 ,2 ]
Kotsur, Dmytro [3 ]
Fiel, Stefan [3 ]
Eghbal-Zadeh, Hamid [1 ,2 ]
Brandstetter, Johannes [1 ,2 ]
Kofler, Johannes [1 ,2 ]
Holzleitner, Markus [1 ,2 ]
Zellinger, Werner [4 ]
Klotz, Daniel [1 ,2 ]
Hochreiter, Sepp [1 ,2 ,5 ]
Lehner, Sebastian [1 ,2 ]
机构
[1] Johannes Kepler Univ Linz, Inst Machine Learning, ELLIS Unit Linz, Linz, Austria
[2] Johannes Kepler Univ Linz, Inst Machine Learning, LIT AI Lab, Linz, Austria
[3] Anyline GmbH, Vienna, Austria
[4] Austrian Acad Sci, Johann Radon Inst Computat & Appl Math, Linz, Austria
[5] Inst Adv Res Artificial Intelligence IARAI, Vienna, Austria
[6] Microsoft Res, Redmond, WA 98052 USA
基金
欧盟地平线“2020”;
关键词
CLIMATE-CHANGE; MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace. In experimental and theoretical analyses, we show that models confined to a suitable predefined subspace generalize well for few-shot learning. A suitable subspace fulfills three criteria across the given tasks: it (a) allows to reduce the training error by gradient flow, (b) leads to models that generalize well, and (c) can be identified by stochastic gradient descent. SubGD identifies these subspaces from an eigendecomposition of the auto-correlation matrix of update directions across different tasks. Demonstrably, we can identify low-dimensional suitable subspaces for few-shot learning of dynamical systems, which have varying properties described by one or few parameters of the analytical system description. Such systems are ubiquitous among real-world applications in science and engineering. We experimentally corroborate the advantages of SubGD on three distinct dynamical systems problem settings, significantly outperforming popular few-shot learning methods both in terms of sample efficiency and performance.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Finding Significant Features for Few-Shot Learning Using Dimensionality Reduction
    Mendez-Ruiz, Mauricio
    Garcia, Ivan
    Gonzalez-Zapata, Jorge
    Ochoa-Ruiz, Gilberto
    Mendez-Vazquez, Andres
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE (MICAI 2021), PT I, 2021, 13067 : 131 - 142
  • [2] Blessing of dimensionality at the edge and geometry of few-shot learning
    Tyukin, Ivan Y.
    Gorban, Alexander N.
    McEwan, Alistair A.
    Meshkinfamfard, Sepehr
    Tang, Lixin
    [J]. INFORMATION SCIENCES, 2021, 564 : 124 - 143
  • [3] Contextual Gradient Scaling for Few-Shot Learning
    Lee, Sanghyuk
    Lee, Seunghyun
    Song, Byung Cheol
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3503 - 3512
  • [4] Few-Shot Few-Shot Learning and the role of Spatial Attention
    Lifchitz, Yann
    Avrithis, Yannis
    Picard, Sylvaine
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2693 - 2700
  • [5] Variational Few-Shot Learning
    Zhang, Jian
    Zhao, Chenglong
    Ni, Bingbing
    Xu, Minghao
    Yang, Xiaokang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1685 - 1694
  • [6] Survey on Few-shot Learning
    Zhao, Kai-Lin
    Jin, Xiao-Long
    Wang, Yuan-Zhuo
    [J]. Ruan Jian Xue Bao/Journal of Software, 2021, 32 (02): : 349 - 369
  • [7] Defensive Few-Shot Learning
    Li, Wenbin
    Wang, Lei
    Zhang, Xingxing
    Qi, Lei
    Huo, Jing
    Gao, Yang
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5649 - 5667
  • [8] Federated Few-shot Learning
    Wang, Song
    Fu, Xingbo
    Ding, Kaize
    Chen, Chen
    Chen, Huiyuan
    Li, Jundong
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2374 - 2385
  • [9] Fractal Few-Shot Learning
    Zhou, Fobao
    Huang, Wenkai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [10] Interventional Few-Shot Learning
    Yue, Zhongqi
    Zhang, Hanwang
    Sun, Qianru
    Hua, Xian-Sheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33