Inversed Pyramid Network with Spatial-adapted and Task-oriented Tuning for few-shot learning

被引:0
|
作者
Zhao, Xiaowei [1 ]
Wang, Duorui [1 ]
Bai, Shihao [4 ]
Wang, Shuo [5 ]
Gao, Yajun [1 ]
Liang, Yu [6 ]
Ma, Yuqing [1 ,2 ]
Liu, Xianglong [1 ,3 ]
机构
[1] Beihang Univ, State Key Lab Complex & Crit Software Environm, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[3] Zhongguancun Lab, Beijing 100194, Peoples R China
[4] SenseTime, Beijing 100080, Peoples R China
[5] Meituan, Beijing 100102, Peoples R China
[6] Beijing Univ Technol, Beijing 100124, Peoples R China
关键词
Few-shot learning; Inverted Pyramid Network; Spatial-adapted Layer; Task-oriented Tuning;
D O I
10.1016/j.patcog.2025.111415
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of artificial intelligence, deep neural networks have achieved great performance in many tasks. However, traditional deep learning methods require a large amount of training data, which may not be available in certain practical scenarios. In contrast, few-shot learning aims to learn a model that can be readily adapted to new unseen classes from only one or a few labeled examples. Despite this success, most existing methods rely on pre-trained feature extractor networks trained with global features, ignoring the discrimination of local features, and weak generalization capabilities limit their performance. To address the problem, according to the human's coarse-to-fine cognition paradigm, we propose an Inverted Pyramid Network with Spatial-adapted and Task-oriented Tuning (TIPN) for few-shot learning. Specifically, the proposed framework represents local features for categories that are difficult to distinguish by global features and recognizes objects from both global and local perspectives. Moreover, to ensure the calibration validity of the proposed model at the local stage, we introduce the Spatial-adapted Layer to preserve the discriminative global representation ability of the pre-trained backbone network. Meanwhile, as the representations extracted from the past categories are not applicable to the current new tasks, we further propose the Task-oriented Tuning strategy to adjust the parameters of the Batch Normalization layer in the pre-trained feature extractor network, to explicitly transfer knowledge from base classes to novel classes according to the support samples of each task. Extensive experiments conducted on multiple benchmark datasets demonstrate that our method can significantly outperform many state-of-the-art few-shot learning methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] CINS: Comprehensive Instruction for Few-Shot Learning in Task-Oriented Dialog Systems
    Mi, Fei
    Wang, Yasheng
    Li, Yitong
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11076 - 11084
  • [2] Few-shot Natural Language Generation for Task-Oriented Dialog
    Peng, Baolin
    Zhu, Chenguang
    Li, Chunyuan
    Li, Xiujun
    Li, Jinchao
    Zeng, Michael
    Gao, Jianfeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 172 - 182
  • [3] Few-Shot Language Understanding Model for Task-Oriented Dialogues
    Xiang Z.
    Chen H.
    Wang Q.
    Li N.
    Data Analysis and Knowledge Discovery, 2023, 7 (09) : 64 - 77
  • [4] Task-oriented feature hallucination for few-shot image classification
    Wu, Sining
    Gao, Xiang
    Hu, Xiaopeng
    IET IMAGE PROCESSING, 2023, 17 (12) : 3564 - 3579
  • [5] Spatial Attention Network for Few-Shot Learning
    He, Xianhao
    Qiao, Peng
    Dou, Yong
    Niu, Xin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 567 - 578
  • [6] Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation
    Moradshahi, Mehrad
    Semnani, Sina J.
    Lam, Monica S.
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 886 - 901
  • [7] Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification
    Lee, Subeen
    Moon, Wonjun
    Seong, Hyun Seok
    Heo, Jae-Pil
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1448 - 1463
  • [8] Few-Shot Few-Shot Learning and the role of Spatial Attention
    Lifchitz, Yann
    Avrithis, Yannis
    Picard, Sylvaine
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2693 - 2700
  • [9] Local spatial alignment network for few-shot learning
    Yu, Yunlong
    Zhang, Dingyi
    Wang, Sidi
    Ji, Zhong
    Zhang, Zhongfei
    NEUROCOMPUTING, 2022, 497 : 182 - 190
  • [10] Hierarchical Prompt Tuning for Few-Shot Multi-Task Learning
    Liu, Jingping
    Chen, Tao
    Liang, Zujie
    Jiang, Haiyun
    Xiao, Yanghua
    Wei, Feng
    Qian, Yuxi
    Hao, Zhenghong
    Han, Bing
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1556 - 1565