Inversed Pyramid Network with Spatial-adapted and Task-oriented Tuning for few-shot learning

被引:0
|
作者
Zhao, Xiaowei [1 ]
Wang, Duorui [1 ]
Bai, Shihao [4 ]
Wang, Shuo [5 ]
Gao, Yajun [1 ]
Liang, Yu [6 ]
Ma, Yuqing [1 ,2 ]
Liu, Xianglong [1 ,3 ]
机构
[1] Beihang Univ, State Key Lab Complex & Crit Software Environm, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[3] Zhongguancun Lab, Beijing 100194, Peoples R China
[4] SenseTime, Beijing 100080, Peoples R China
[5] Meituan, Beijing 100102, Peoples R China
[6] Beijing Univ Technol, Beijing 100124, Peoples R China
关键词
Few-shot learning; Inverted Pyramid Network; Spatial-adapted Layer; Task-oriented Tuning;
D O I
10.1016/j.patcog.2025.111415
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of artificial intelligence, deep neural networks have achieved great performance in many tasks. However, traditional deep learning methods require a large amount of training data, which may not be available in certain practical scenarios. In contrast, few-shot learning aims to learn a model that can be readily adapted to new unseen classes from only one or a few labeled examples. Despite this success, most existing methods rely on pre-trained feature extractor networks trained with global features, ignoring the discrimination of local features, and weak generalization capabilities limit their performance. To address the problem, according to the human's coarse-to-fine cognition paradigm, we propose an Inverted Pyramid Network with Spatial-adapted and Task-oriented Tuning (TIPN) for few-shot learning. Specifically, the proposed framework represents local features for categories that are difficult to distinguish by global features and recognizes objects from both global and local perspectives. Moreover, to ensure the calibration validity of the proposed model at the local stage, we introduce the Spatial-adapted Layer to preserve the discriminative global representation ability of the pre-trained backbone network. Meanwhile, as the representations extracted from the past categories are not applicable to the current new tasks, we further propose the Task-oriented Tuning strategy to adjust the parameters of the Batch Normalization layer in the pre-trained feature extractor network, to explicitly transfer knowledge from base classes to novel classes according to the support samples of each task. Extensive experiments conducted on multiple benchmark datasets demonstrate that our method can significantly outperform many state-of-the-art few-shot learning methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Sparse spatial transformers for few-shot learning
    Haoxing Chen
    Huaxiong Li
    Yaohui Li
    Chunlin Chen
    Science China Information Sciences, 2023, 66
  • [22] Sparse spatial transformers for few-shot learning
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (11)
  • [23] Spatial Contrastive Learning for Few-Shot Classification
    Ouali, Yassine
    Hudelot, Celine
    Tami, Myriam
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 671 - 686
  • [24] Few-Shot Learning with Siamese Networks and Label Tuning
    Mueller, Thomas
    Perez-Torro, Guillermo
    Franco-Salvador, Marc
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8532 - 8545
  • [25] Multi-Scale Adaptive Task Attention Network for Few-Shot Learning
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4765 - 4771
  • [26] Task Agnostic Meta-Learning for Few-Shot Learning
    Jamal, Muhammad Abdullah
    Qi, Guo-Jun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
  • [27] Task-Equivariant Graph Few-shot Learning
    Kim, Sungwon
    Lee, Junseok
    Lee, Namkyeong
    Kim, Wonjoong
    Choi, Seungyoon
    Park, Chanyoung
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1120 - 1131
  • [28] DETA: Denoised Task Adaptation for Few-Shot Learning
    Zhang, Ji
    Gao, Lianli
    Luo, Xu
    Shen, Hengtao
    Song, Jingkuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11507 - 11517
  • [29] One-Shot Learning for Task-Oriented Grasping
    Holomjova, Valerija
    Starkey, Andrew J.
    Yun, Bruno
    Meisner, Pascal
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8232 - 8238
  • [30] Feature Transformation Network for Few-Shot Learning
    Wang, Xiaoyan
    Wang, Hongmei
    Zhou, Daming
    IEEE ACCESS, 2021, 9 : 41913 - 41924