Task-Adaptive Multi-Source Representations for Few-Shot Image Recognition

被引：0

作者：

Liu, Ge ^{[1
]}

Zhang, Zhongqiang ^{[1
]}

Fang, Xiangzhong ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

来源：

INFORMATION | 2024年 / 15卷 / 06期

关键词：

few-shot learning; image recognition; transfer learning; domain adaptation;

D O I：

10.3390/info15060293

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Conventional few-shot learning (FSL) mainly focuses on knowledge transfer from a single source dataset to a recognition scenario with only a few training samples available but still similar to the source domain. In this paper, we consider a more practical FSL setting where multiple semantically different datasets are available to address a wide range of FSL tasks, especially for some recognition scenarios beyond natural images, such as remote sensing and medical imagery. It can be referred to as multi-source cross-domain FSL. To tackle the problem, we propose a two-stage learning scheme, termed learning and adapting multi-source representations (LAMR). In the first stage, we propose a multi-head network to obtain efficient multi-domain representations, where all source domains share the same backbone except for the last parallel projection layers for domain specialization. We train the representations in a multi-task setting where each in-domain classification task is taken by a cosine classifier. In the second stage, considering that instance discrimination and class discrimination are crucial for robust recognition, we propose two contrastive objectives for adapting the pre-trained representations to be task-specialized on the few-shot data. Careful ablation studies verify that LAMR significantly improves representation transferability, showing consistent performance boosts. We also extend LAMR to single-source FSL by introducing a dataset-splitting strategy that equally splits one source dataset into sub-domains. The empirical results show that LAMR can achieve SOTA performance on the BSCD-FSL benchmark and competitive performance on mini-ImageNet, highlighting its versatility and effectiveness for FSL of both natural and specific imaging.

引用

页数：28

共 50 条

[41] Few-shot classification in Named Entity Recognition Task
Fritzler, Alexander
Logacheva, Varvara
Kretov, Maksim
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 993 - 1000
[42] Gaussian Prototype Rectification For Few-shot Image Recognition
Lin, Jinfu
Shen, Junmin
He, Xiaojian
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[43] Few-shot Image Recognition for UAV Sports Cinematography
Patsiouras, Emmanouil
Tefas, Anastasios
Pitas, Ioannis
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 965 - 969
[44] Named Entity Recognition for Few-Shot Power Dispatch Based on Multi-Task
Tan, Zhixiang
Chen, Yan
Liang, Zengfu
Meng, Qi
Lin, Dezhao
ELECTRONICS, 2023, 12 (16)
[45] LoFGAN: Fusing Local Representations for Few-shot Image Generation
Gu, Zheng
Li, Wenbin
Huo, Jing
Wang, Lei
Gao, Yang
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8443 - 8451
[46] Multi-task few-shot learning with composed data augmentation for image classification
Zhang, Rui
Yang, Yixin
Li, Yang
Wang, Jiabao
Li, Hang
Miao, Zhuang
IET COMPUTER VISION, 2023, 17 (02) : 211 - 221
[47] SIM: an improved few-shot image classification model with multi-task learning
Guo, Jin
Li, Wengen
Guan, Jihong
Gao, Hang
Liu, Baobo
Gong, Lili
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)
[48] Learning Task-aware Local Representations for Few-shot Learning
Dong, Chuanqi
Li, Wenbin
Huo, Jing
Gu, Zheng
Gao, Yang
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 716 - 722
[49] TACDFSL: Task Adaptive Cross Domain Few-Shot Learning
Zhang, Qi
Jiang, Yingluo
Wen, Zhijie
SYMMETRY-BASEL, 2022, 14 (06):
[50] Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Li, Alexander Hanbo
Shang, Mingyue
Spiliopoulou, Evangelia
Ma, Jie
Ng, Patrick
Wang, Zhiguo
Min, Bonan
Wang, William
McKeown, Kathleen
Castelli, Vittorio
Roth, Dan
Xiang, Bing
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 16171 - 16189

← 1 2 3 4 5 →