Adversarial domain adaptation with CLIP for few-shot image classification

被引:0
|
作者
Sun, Tongfeng [1 ,2 ]
Yang, Hongjian [1 ]
Li, Zhongnian [1 ,2 ]
Xu, Xinzheng [1 ,2 ]
Wang, Xiurui [1 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou, Jiangsu, Peoples R China
[2] Minist Educ Peoples Republ China, Mine Digitizat Engn Res Ctr, Xuzhou, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot learning; Adversarial domain adaptation; Multi-modal features; Knowledge transfer;
D O I
10.1007/s10489-024-06088-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot learning focuses on training efficient models with limited amounts of training data. Its mainstream approaches have evolved from single-modal to multi-modal methods. The Contrastive Vision-Language Pre-training model, known as CLIP, achieves image classification by aligning the embedding spaces of images and text. To better achieve knowledge transfer between image domain and text domain, we propose a fine-tuning framework for vision-language models with CLIP. It introduces a novel adversarial domain adaptation approach, which trains a text and image symmetrical classifier to identify the differences between two domains. To more effectively align text and image into the same space, we adapt two types of confusion loss to construct the aligned semantic space by fine-tuning multi-modal features extractor. Experiments on 11 public datasets show that our proposed method has superior performance compared with state of art CLIP-driven learning methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Dual-domain reciprocal learning design for few-shot image classification
    Qifan Liu
    Yaozong Chen
    Wenming Cao
    Neural Computing and Applications, 2023, 35 : 10649 - 10662
  • [42] Dual-domain reciprocal learning design for few-shot image classification
    Liu, Qifan
    Chen, Yaozong
    Cao, Wenming
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10649 - 10662
  • [43] Domain-Invariant Few-Shot Contrastive Learning for Hyperspectral Image Classification
    Chen, Wenchen
    Zhang, Yanmei
    Chu, Jianping
    Wang, Xingbo
    Applied Sciences (Switzerland), 2024, 14 (23):
  • [44] VARIATIONAL FEATURE DISENTANGLEMENT FOR FEW-SHOT DOMAIN ADAPTATION
    Wang, Weiduo
    Gu, Yun
    Yang, Jie
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2860 - 2864
  • [45] Masked Embedding Modeling With Rapid Domain Adjustment for Few-Shot Image Classification
    Walsh, Reece
    Osman, Islam
    Shehata, Mohamed S.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4907 - 4920
  • [46] Deep Cross-Domain Few-Shot Learning for Hyperspectral Image Classification
    Li, Zhaokui
    Liu, Ming
    Chen, Yushi
    Xu, Yimin
    Li, Wei
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [47] DOMAIN GENERALIZED FEW-SHOT IMAGE CLASSIFICATION VIA META REGULARIZATION NETWORK
    Zhang, Min
    Huang, Siteng
    Wang, Donglin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3748 - 3752
  • [48] CLIP-FSAC: Boosting CLIP for Few-Shot Anomaly Classification with Synthetic Anomalies
    Zuo, Zuo
    Wu, Yao
    Li, Baoqiang
    Dong, Jiahao
    Zhou, You
    Zhou, Lei
    Qu, Yanyun
    Wu, Zongze
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1834 - 1842
  • [49] Few-Shot Wideband Tympanometry Classification in Otosclerosis via Domain Adaptation with Gaussian Processes
    Nie, Leixin
    Li, Chao
    Bozorg Grayeli, Alexis
    Marzani, Franck
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [50] Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding
    Huang, Shengqi
    Yang, Wanqi
    Wang, Lei
    Zhou, Luping
    Yang, Ming
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 677 - 685