Task-Adaptive Multi-Source Representations for Few-Shot Image Recognition

被引:0
|
作者
Liu, Ge [1 ]
Zhang, Zhongqiang [1 ]
Fang, Xiangzhong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
关键词
few-shot learning; image recognition; transfer learning; domain adaptation;
D O I
10.3390/info15060293
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional few-shot learning (FSL) mainly focuses on knowledge transfer from a single source dataset to a recognition scenario with only a few training samples available but still similar to the source domain. In this paper, we consider a more practical FSL setting where multiple semantically different datasets are available to address a wide range of FSL tasks, especially for some recognition scenarios beyond natural images, such as remote sensing and medical imagery. It can be referred to as multi-source cross-domain FSL. To tackle the problem, we propose a two-stage learning scheme, termed learning and adapting multi-source representations (LAMR). In the first stage, we propose a multi-head network to obtain efficient multi-domain representations, where all source domains share the same backbone except for the last parallel projection layers for domain specialization. We train the representations in a multi-task setting where each in-domain classification task is taken by a cosine classifier. In the second stage, considering that instance discrimination and class discrimination are crucial for robust recognition, we propose two contrastive objectives for adapting the pre-trained representations to be task-specialized on the few-shot data. Careful ablation studies verify that LAMR significantly improves representation transferability, showing consistent performance boosts. We also extend LAMR to single-source FSL by introducing a dataset-splitting strategy that equally splits one source dataset into sub-domains. The empirical results show that LAMR can achieve SOTA performance on the BSCD-FSL benchmark and competitive performance on mini-ImageNet, highlighting its versatility and effectiveness for FSL of both natural and specific imaging.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Shaping Visual Representations With Attributes for Few-Shot Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1397 - 1401
  • [32] Multi-level Metric Learning for Few-Shot Image Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254
  • [33] Multi-domain few-shot image recognition with knowledge transfer
    Li, Mingxi
    Wang, Ronggui
    Yang, Juan
    Xue, Lixia
    Hu, Min
    NEUROCOMPUTING, 2021, 442 : 64 - 72
  • [34] Hierarchical compositional representations for few-shot action recognition
    Li, Changzhen
    Zhang, Jie
    Wu, Shuzhe
    Jin, Xin
    Shan, Shiguang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [35] Semantic Prompt for Few-Shot Image Recognition
    Chen, Wentao
    Si, Chenyang
    Zhang, Zhang
    Wang, Liang
    Wang, Zilei
    Tan, Tieniu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23581 - 23591
  • [36] Few-Shot Image Recognition with Knowledge Transfer
    Peng, Zhimao
    Li, Zechao
    Zhang, Junge
    Li, Yan
    Qi, Guo-Jun
    Tang, Jinhui
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 441 - 449
  • [37] Dataset Bias in Few-Shot Image Recognition
    Jiang, Shuqiang
    Zhu, Yaohui
    Liu, Chenlong
    Song, Xinhang
    Li, Xiangyang
    Min, Weiqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 229 - 246
  • [38] Task-adaptive unbiased regularization meta-learning for few-shot cross-domain fault diagnosis
    Wang, Huaqing
    Lv, Dongrui
    Lin, Tianjiao
    Han, Changkun
    Song, Liuyang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 144
  • [39] Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network
    Hon, Yutai
    Che, Wanxiang
    Lai, Yongkui
    Zhou, Zhihan
    Liu, Yijia
    Liu, Han
    Liu, Ting
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1381 - 1393
  • [40] Multi-Scale Adaptive Task Attention Network for Few-Shot Learning
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4765 - 4771