ProD: Prompting-to-disentangle Domain Knowledge for Cross-domain Few-shot Image Classification

被引：5

作者：

Ma, Tianyi ^{[1
,2
]}

Sun, Yifan ^{[2
]}

Yang, Zongxin ^{[3
]}

Yang, Yi ^{[3
]}

机构：

[1] Univ Technol Sydney, Ultimo, Australia

[2] Baidu Inc, Beijing, Peoples R China

[3] Zhejiang Univ, Hangzhou, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01892

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper considers few-shot image classification under the cross-domain scenario, where the train-to-test domain gap compromises classification accuracy. To mitigate the domain gap, we propose a prompting-to-disentangle (ProD) method through a novel exploration with the prompting mechanism. ProD adopts the popular multi-domain training scheme and extracts the backbone feature with a standard Convolutional Neural Network. Based on these two common practices, the key point of ProD is using the prompting mechanism in the transformer to disentangle the domain-general (DG) and domain-specific (DS) knowledge from the backbone feature. Specifically, ProD concatenates a DG and a DS prompt to the backbone feature and feeds them into a lightweight transformer. The DG prompt is learnable and shared by all the training domains, while the DS prompt is generated from the domain-of-interest on the fly. As a result, the transformer outputs DG and DS features in parallel with the two prompts, yielding the disentangling effect. We show that: 1) Simply sharing a single DG prompt for all the training domains already improves generalization towards the novel test domain. 2) The cross-domain generalization can be further reinforced by making the DG prompt neutral towards the training domains. 3) When inference, the DS prompt is generated from the support samples and can capture test domain knowledge through the prompting mechanism. Combining all three benefits, ProD significantly improves cross-domain few-shot classification. For instance, on CUB, ProD improves the 5-way 5-shot accuracy from 73.56% (baseline) to 79.19%, setting a new state of the art.

引用

下载

页码：19754 / 19763

页数：10

共 50 条

[11] Few-Shot Learning With Prototype Rectification for Cross-Domain Hyperspectral Image Classification
Qin, Anyong
Yuan, Chaoqi
Li, Qiang
Luo, Xiaoliu
Yang, Feng
Song, Tiecheng
Gao, Chenqiang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[12] Domain Mapping Network for Remote Sensing Cross-Domain Few-Shot Classification
Lu, Xiaoqiang
Gong, Tengfei
Zheng, Xiangtao
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 11
[13] Adversarial Feature Augmentation for Cross-domain Few-Shot Classification
Hu, Yanxu
Ma, Andy J.
COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 20 - 37
[14] Few-shot Image Generation via Cross-domain Correspondence
Ojha, Utkarsh
Li, Yijun
Lu, Jingwan
Efros, Alexei A.
Lee, Yong Jae
Shechtman, Eli
Zhang, Richard
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10738 - 10747
[15] Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty
Oh, Jaehoon
Kim, Sungnyun
Ho, Namgyu
Kim, Jin-Hwa
Song, Hwanjun
Yun, Se-Young
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[16] Cross-Domain Few-Shot Hyperspectral Image Classification With Class-Wise Attention
Wang, Wenzhen
Liu, Fang
Liu, Jia
Xiao, Liang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[17] Semantic Guided prototype learning for Cross-Domain Few-Shot hyperspectral image classification
Li, Yuhang
He, Jinrong
Liu, Hanchi
Zhang, Yurong
Li, Zhaokui
Expert Systems with Applications, 2025, 260
[18] SCFormer: Spectral Coordinate Transformer for Cross-Domain Few-Shot Hyperspectral Image Classification
Li, Jiaojiao
Zhang, Zhiyuan
Song, Rui
Li, Yunsong
Du, Qian
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 840 - 855
[19] Graph Information Aggregation Cross-Domain Few-Shot Learning for Hyperspectral Image Classification
Zhang, Yuxiang
Li, Wei
Zhang, Mengmeng
Wang, Shuai
Tao, Ran
Du, Qian
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 1912 - 1925
[20] Cross-Domain Few-Shot Learning Based on Feature Disentanglement for Hyperspectral Image Classification
Qin, Boao
Feng, Shou
Zhao, Chunhui
Li, Wei
Tao, Ran
Xiang, Wei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15

← 1 2 3 4 5 →