ProD: Prompting-to-disentangle Domain Knowledge for Cross-domain Few-shot Image Classification

被引:5
|
作者
Ma, Tianyi [1 ,2 ]
Sun, Yifan [2 ]
Yang, Zongxin [3 ]
Yang, Yi [3 ]
机构
[1] Univ Technol Sydney, Ultimo, Australia
[2] Baidu Inc, Beijing, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers few-shot image classification under the cross-domain scenario, where the train-to-test domain gap compromises classification accuracy. To mitigate the domain gap, we propose a prompting-to-disentangle (ProD) method through a novel exploration with the prompting mechanism. ProD adopts the popular multi-domain training scheme and extracts the backbone feature with a standard Convolutional Neural Network. Based on these two common practices, the key point of ProD is using the prompting mechanism in the transformer to disentangle the domain-general (DG) and domain-specific (DS) knowledge from the backbone feature. Specifically, ProD concatenates a DG and a DS prompt to the backbone feature and feeds them into a lightweight transformer. The DG prompt is learnable and shared by all the training domains, while the DS prompt is generated from the domain-of-interest on the fly. As a result, the transformer outputs DG and DS features in parallel with the two prompts, yielding the disentangling effect. We show that: 1) Simply sharing a single DG prompt for all the training domains already improves generalization towards the novel test domain. 2) The cross-domain generalization can be further reinforced by making the DG prompt neutral towards the training domains. 3) When inference, the DS prompt is generated from the support samples and can capture test domain knowledge through the prompting mechanism. Combining all three benefits, ProD significantly improves cross-domain few-shot classification. For instance, on CUB, ProD improves the 5-way 5-shot accuracy from 73.56% (baseline) to 79.19%, setting a new state of the art.
引用
收藏
页码:19754 / 19763
页数:10
相关论文
共 50 条
  • [21] Cross-Domain Few-Shot Semantic Segmentation
    Lei, Shuo
    Zhang, Xuchao
    He, Jianfeng
    Chen, Fanglan
    Du, Bowen
    Lu, Chang-Tien
    [J]. COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 73 - 90
  • [22] DOMAIN-AGNOSTIC META-LEARNING FOR CROSS-DOMAIN FEW-SHOT CLASSIFICATION
    Lee, Wei-Yu
    Wang, Jheng-Yu
    Wang, Yu-Chiang Frank
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1715 - 1719
  • [23] Beyond Spectral Shift Mitigation: Knowledge Swap Net for Cross-Domain Few-Shot Hyperspectral Image Classification
    Wu, Hao
    Xue, Zhaohui
    Zhou, Shaoguang
    Su, Hongjun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [24] Cross-Domain Few-Shot Classification via Adversarial Task Augmentation
    Wang, Haoqing
    Deng, Zhi-Hong
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1075 - 1081
  • [25] Cross-Domain Few-Shot Contrastive Learning for Hyperspectral Images Classification
    Zhang, Suhua
    Chen, Zhikui
    Wang, Dan
    Wang, Z. Jane
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [26] Explanation-Guided Training for Cross-Domain Few-Shot Classification
    Sun, Jiamei
    Lapuschkin, Sebastian
    Samek, Wojciech
    Zhao, Yunqing
    Cheung, Ngai-Man
    Binder, Alexander
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7609 - 7616
  • [27] CDFM: A cross-domain few-shot model for marine plankton classification
    Guo, Jin
    Li, Wengen
    Guan, Jihong
    Gao, Hang
    Liu, Baobo
    Gong, Lili
    [J]. IET COMPUTER VISION, 2023, 17 (01) : 111 - 121
  • [28] Self-Challenging Mask for Cross-Domain Few-Shot Classification
    Ma, Yixiao
    Li, Fanzhang
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4456 - 4463
  • [29] Cross-Domain Few-Shot Graph Classification with a Reinforced Task Coordinator
    Zhang, Qiannan
    Pei, Shichao
    Yang, Qiang
    Zhang, Chuxu
    Chawla, Nitesh
    Zhang, Xiangliang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4893 - 4901
  • [30] Adaptive Parametric Prototype Learning for Cross-Domain Few-Shot Classification
    Heidari, Marzi
    Alchihabi, Abdullah
    En, Qing
    Guo, Yuhong
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238