ProD: Prompting-to-disentangle Domain Knowledge for Cross-domain Few-shot Image Classification

被引:5
|
作者
Ma, Tianyi [1 ,2 ]
Sun, Yifan [2 ]
Yang, Zongxin [3 ]
Yang, Yi [3 ]
机构
[1] Univ Technol Sydney, Ultimo, Australia
[2] Baidu Inc, Beijing, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers few-shot image classification under the cross-domain scenario, where the train-to-test domain gap compromises classification accuracy. To mitigate the domain gap, we propose a prompting-to-disentangle (ProD) method through a novel exploration with the prompting mechanism. ProD adopts the popular multi-domain training scheme and extracts the backbone feature with a standard Convolutional Neural Network. Based on these two common practices, the key point of ProD is using the prompting mechanism in the transformer to disentangle the domain-general (DG) and domain-specific (DS) knowledge from the backbone feature. Specifically, ProD concatenates a DG and a DS prompt to the backbone feature and feeds them into a lightweight transformer. The DG prompt is learnable and shared by all the training domains, while the DS prompt is generated from the domain-of-interest on the fly. As a result, the transformer outputs DG and DS features in parallel with the two prompts, yielding the disentangling effect. We show that: 1) Simply sharing a single DG prompt for all the training domains already improves generalization towards the novel test domain. 2) The cross-domain generalization can be further reinforced by making the DG prompt neutral towards the training domains. 3) When inference, the DS prompt is generated from the support samples and can capture test domain knowledge through the prompting mechanism. Combining all three benefits, ProD significantly improves cross-domain few-shot classification. For instance, on CUB, ProD improves the 5-way 5-shot accuracy from 73.56% (baseline) to 79.19%, setting a new state of the art.
引用
收藏
页码:19754 / 19763
页数:10
相关论文
共 50 条
  • [31] Rethinking cross-domain semantic relation for few-shot image generation
    Gou, Yao
    Li, Min
    Lv, Yilong
    Zhang, Yusen
    Xing, Yuhang
    He, Yujie
    [J]. APPLIED INTELLIGENCE, 2023, 53 (19) : 22391 - 22404
  • [32] Rethinking cross-domain semantic relation for few-shot image generation
    Yao Gou
    Min Li
    Yilong Lv
    Yusen Zhang
    Yuhang Xing
    Yujie He
    [J]. Applied Intelligence, 2023, 53 : 22391 - 22404
  • [33] Research on a Cross-Domain Few-Shot Adaptive Classification Algorithm Based on Knowledge Distillation Technology
    Gao, Jiuyang
    Li, Siyu
    Xia, Wenfeng
    Yu, Jiuyang
    Dai, Yaonan
    [J]. SENSORS, 2024, 24 (06)
  • [34] Spatial-Spectral-Semantic Cross-Domain Few-Shot Learning for Hyperspectral Image Classification
    Cao, Mengxin
    Zhang, Xu
    Cheng, Jinyong
    Zhao, Guixin
    Li, Wei
    Dong, Xiangjun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [35] FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION BASED ON CROSS-DOMAIN SPECTRAL SEMANTIC RELATION TRANSFORMER
    Cao, Mengxin
    Zhao, Guixin
    Dong, Aimei
    Lv, Guohua
    Guo, Ying
    Dong, Xiangjun
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1375 - 1379
  • [36] Causal Meta-Transfer Learning for Cross-Domain Few-Shot Hyperspectral Image Classification
    Cheng, Yuhu
    Zhang, Wei
    Wang, Haoyu
    Wang, Xuesong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [37] Convolutional Transformer-Based Few-Shot Learning for Cross-Domain Hyperspectral Image Classification
    Peng, Yishu
    Liu, Yaru
    Tu, Bing
    Zhang, Yuwen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1335 - 1349
  • [38] Hyperspectral Image Classification via Cross-Domain Few-Shot Learning With Kernel Triplet Loss
    Huang, Ke-Kun
    Yuan, Hao-Tian
    Ren, Chuan-Xian
    Hou, Yue-En
    Duan, Jie-Li
    Yang, Zhou
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 18
  • [39] Cross-Domain Few-Shot Learning Based on Graph Convolution Contrast for Hyperspectral Image Classification
    Ye, Zhen
    Wang, Jie
    Sun, Tao
    Zhang, Jinxin
    Li, Wei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [40] Multi-level relation learning for cross-domain few-shot hyperspectral image classification
    Liu, Chun
    Yang, Longwei
    Li, Zheng
    Yang, Wei
    Han, Zhigang
    Guo, Jianzhong
    Yu, Junyong
    [J]. APPLIED INTELLIGENCE, 2024, 54 (05) : 4392 - 4410