Task-Adaptive Multi-Source Representations for Few-Shot Image Recognition

被引:0
|
作者
Liu, Ge [1 ]
Zhang, Zhongqiang [1 ]
Fang, Xiangzhong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
关键词
few-shot learning; image recognition; transfer learning; domain adaptation;
D O I
10.3390/info15060293
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional few-shot learning (FSL) mainly focuses on knowledge transfer from a single source dataset to a recognition scenario with only a few training samples available but still similar to the source domain. In this paper, we consider a more practical FSL setting where multiple semantically different datasets are available to address a wide range of FSL tasks, especially for some recognition scenarios beyond natural images, such as remote sensing and medical imagery. It can be referred to as multi-source cross-domain FSL. To tackle the problem, we propose a two-stage learning scheme, termed learning and adapting multi-source representations (LAMR). In the first stage, we propose a multi-head network to obtain efficient multi-domain representations, where all source domains share the same backbone except for the last parallel projection layers for domain specialization. We train the representations in a multi-task setting where each in-domain classification task is taken by a cosine classifier. In the second stage, considering that instance discrimination and class discrimination are crucial for robust recognition, we propose two contrastive objectives for adapting the pre-trained representations to be task-specialized on the few-shot data. Careful ablation studies verify that LAMR significantly improves representation transferability, showing consistent performance boosts. We also extend LAMR to single-source FSL by introducing a dataset-splitting strategy that equally splits one source dataset into sub-domains. The empirical results show that LAMR can achieve SOTA performance on the BSCD-FSL benchmark and competitive performance on mini-ImageNet, highlighting its versatility and effectiveness for FSL of both natural and specific imaging.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Few-shot classification in Named Entity Recognition Task
    Fritzler, Alexander
    Logacheva, Varvara
    Kretov, Maksim
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 993 - 1000
  • [42] Gaussian Prototype Rectification For Few-shot Image Recognition
    Lin, Jinfu
    Shen, Junmin
    He, Xiaojian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [43] Few-shot Image Recognition for UAV Sports Cinematography
    Patsiouras, Emmanouil
    Tefas, Anastasios
    Pitas, Ioannis
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 965 - 969
  • [44] Named Entity Recognition for Few-Shot Power Dispatch Based on Multi-Task
    Tan, Zhixiang
    Chen, Yan
    Liang, Zengfu
    Meng, Qi
    Lin, Dezhao
    ELECTRONICS, 2023, 12 (16)
  • [45] LoFGAN: Fusing Local Representations for Few-shot Image Generation
    Gu, Zheng
    Li, Wenbin
    Huo, Jing
    Wang, Lei
    Gao, Yang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8443 - 8451
  • [46] Multi-task few-shot learning with composed data augmentation for image classification
    Zhang, Rui
    Yang, Yixin
    Li, Yang
    Wang, Jiabao
    Li, Hang
    Miao, Zhuang
    IET COMPUTER VISION, 2023, 17 (02) : 211 - 221
  • [47] SIM: an improved few-shot image classification model with multi-task learning
    Guo, Jin
    Li, Wengen
    Guan, Jihong
    Gao, Hang
    Liu, Baobo
    Gong, Lili
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)
  • [48] Learning Task-aware Local Representations for Few-shot Learning
    Dong, Chuanqi
    Li, Wenbin
    Huo, Jing
    Gu, Zheng
    Gao, Yang
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 716 - 722
  • [49] TACDFSL: Task Adaptive Cross Domain Few-Shot Learning
    Zhang, Qi
    Jiang, Yingluo
    Wen, Zhijie
    SYMMETRY-BASEL, 2022, 14 (06):
  • [50] Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
    Li, Alexander Hanbo
    Shang, Mingyue
    Spiliopoulou, Evangelia
    Ma, Jie
    Ng, Patrick
    Wang, Zhiguo
    Min, Bonan
    Wang, William
    McKeown, Kathleen
    Castelli, Vittorio
    Roth, Dan
    Xiang, Bing
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 16171 - 16189