ProD: Prompting-to-disentangle Domain Knowledge for Cross-domain Few-shot Image Classification

被引:5
|
作者
Ma, Tianyi [1 ,2 ]
Sun, Yifan [2 ]
Yang, Zongxin [3 ]
Yang, Yi [3 ]
机构
[1] Univ Technol Sydney, Ultimo, Australia
[2] Baidu Inc, Beijing, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers few-shot image classification under the cross-domain scenario, where the train-to-test domain gap compromises classification accuracy. To mitigate the domain gap, we propose a prompting-to-disentangle (ProD) method through a novel exploration with the prompting mechanism. ProD adopts the popular multi-domain training scheme and extracts the backbone feature with a standard Convolutional Neural Network. Based on these two common practices, the key point of ProD is using the prompting mechanism in the transformer to disentangle the domain-general (DG) and domain-specific (DS) knowledge from the backbone feature. Specifically, ProD concatenates a DG and a DS prompt to the backbone feature and feeds them into a lightweight transformer. The DG prompt is learnable and shared by all the training domains, while the DS prompt is generated from the domain-of-interest on the fly. As a result, the transformer outputs DG and DS features in parallel with the two prompts, yielding the disentangling effect. We show that: 1) Simply sharing a single DG prompt for all the training domains already improves generalization towards the novel test domain. 2) The cross-domain generalization can be further reinforced by making the DG prompt neutral towards the training domains. 3) When inference, the DS prompt is generated from the support samples and can capture test domain knowledge through the prompting mechanism. Combining all three benefits, ProD significantly improves cross-domain few-shot classification. For instance, on CUB, ProD improves the 5-way 5-shot accuracy from 73.56% (baseline) to 79.19%, setting a new state of the art.
引用
收藏
页码:19754 / 19763
页数:10
相关论文
共 50 条
  • [1] Experiments in cross-domain few-shot learning for image classification
    Wang, Hongyu
    Gouk, Henry
    Fraser, Huon
    Frank, Eibe
    Pfahringer, Bernhard
    Mayo, Michael
    Holmes, Geoffrey
    [J]. JOURNAL OF THE ROYAL SOCIETY OF NEW ZEALAND, 2023, 53 (01) : 169 - 191
  • [2] Cross-Domain Few-Shot Graph Classification
    Hassani, Kaveh
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6856 - 6864
  • [3] HybridPrompt: Domain-Aware Prompting for Cross-Domain Few-Shot Learning
    Wu, Jiamin
    Zhang, Tianzhu
    Zhang, Yongdong
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [4] Deep Cross-Domain Few-Shot Learning for Hyperspectral Image Classification
    Li, Zhaokui
    Liu, Ming
    Chen, Yushi
    Xu, Yimin
    Li, Wei
    Du, Qian
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] Knowledge transduction for cross-domain few-shot learning
    Li, Pengfang
    Liu, Fang
    Jiao, Licheng
    Li, Shuo
    Li, Lingling
    Liu, Xu
    Huang, Xinyan
    [J]. PATTERN RECOGNITION, 2023, 141
  • [6] Adaptive Domain-Adversarial Few-Shot Learning for Cross-Domain Hyperspectral Image Classification
    Ye, Zhen
    Wang, Jie
    Liu, Huan
    Zhang, Yu
    Li, Wei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] DUAL GRAPH CROSS-DOMAIN FEW-SHOT LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Zhang, Yuxiang
    Li, Wei
    Zhang, Mengmeng
    Tao, Ran
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3573 - 3577
  • [8] Experiments in Cross-domain Few-shot Learning for Image Classification: Extended Abstract
    Wang, Hongyu
    Fraser, Huon
    Gouk, Henry
    Frank, Eibe
    Pfahringer, Bernhard
    Mayo, Michael
    Holmes, Geoff
    [J]. ECMLPKDD WORKSHOP ON META-KNOWLEDGE TRANSFER, VOL 191, 2022, 191 : 81 - 83
  • [9] SAR Image Classification Using Few-shot Cross-domain Transfer Learning
    Rostami, Mohammad
    Kolouri, Soheil
    Eaton, Eric
    Kim, Kyungnam
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 907 - 915
  • [10] Few-Shot Learning With Prototype Rectification for Cross-Domain Hyperspectral Image Classification
    Qin, Anyong
    Yuan, Chaoqi
    Li, Qiang
    Luo, Xiaoliu
    Yang, Feng
    Song, Tiecheng
    Gao, Chenqiang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62