Semantic Consistent Embedding for Domain Adaptive Zero-Shot Learning

被引:5
|
作者
Zhang, Jianyang [1 ]
Yang, Guowu [1 ]
Hu, Ping [2 ]
Lin, Guosheng [3 ]
Lv, Fengmao [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
[2] Boston Univ, Comp Sci Dept, Boston, MA 02215 USA
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[4] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; unsupervised domain adaptation; transfer learning;
D O I
10.1109/TIP.2023.3293769
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised domain adaptation has limitations when encountering label discrepancy between the source and target domains. While open-set domain adaptation approaches can address situations when the target domain has additional categories, these methods can only detect them but not further classify them. In this paper, we focus on a more challenging setting dubbed Domain Adaptive Zero-Shot Learning (DAZSL), which uses semantic embeddings of class tags as the bridge between seen and unseen classes to learn the classifier for recognizing all categories in the target domain when only the supervision of seen categories in the source domain is available. The main challenge of DAZSL is to perform knowledge transfer across categories and domain styles simultaneously. To this end, we propose a novel end-to-end learning mechanism dubbed Three-way Semantic Consistent Embedding (TSCE) to embed the source domain, target domain, and semantic space into a shared space. Specifically, TSCE learns domain-irrelevant categorical prototypes from the semantic embedding of class tags and uses them as the pivots of the shared space. The source domain features are aligned with the prototypes via their supervised information. On the other hand, the mutual information maximization mechanism is introduced to push the target domain features and prototypes towards each other. By this way, our approach can align domain differences between source and target images, as well as promote knowledge transfer towards unseen classes. Moreover, as there is no supervision in the target domain, the shared space may suffer from the catastrophic forgetting problem. Hence, we further propose a ranking-based embedding alignment mechanism to maintain the consistency between the semantic space and the shared space. Experimental results on both I2AwA and I2WebV clearly validate the effectiveness of our method. Code is available at https://github.com/tiggers23/TSCE-Domain-Adaptive-Zero-Shot-Learning.
引用
收藏
页码:4024 / 4035
页数:12
相关论文
共 50 条
  • [31] Semantic softmax loss for zero-shot learning
    Ji, Zhong
    Sun, Yuxin
    Yu, Yunlong
    Guo, Jichang
    Pang, Yanwei
    NEUROCOMPUTING, 2018, 316 : 369 - 375
  • [32] Generalized Zero-Shot Recognition based on Visually Semantic Embedding
    Zhu, Pengkai
    Wang, Hanxiao
    Saligrama, Venkatesh
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2990 - 2998
  • [33] Generalised Zero-Shot Learning with Domain Classification in a Joint Semantic and Visual Space
    Felix, Rafael
    Harwood, Ben
    Sasdelli, Michele
    Carneiro, Gustavo
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 17 - 24
  • [34] ADAPTIVE MULTI-SCALE SEMANTIC FUSION NETWORK FOR ZERO-SHOT LEARNING
    Song, Jing
    Peng, Peixi
    Zhai, Yunpeng
    Zhang, Chong
    Tian, Yonghong
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [35] Region interaction and attribute embedding for zero-shot learning
    Hu, Zhengwei
    Zhao, Haitao
    Peng, Jingchao
    Gu, Xiaojing
    INFORMATION SCIENCES, 2022, 609 : 984 - 995
  • [36] Incremental Embedding Learning via Zero-Shot Translation
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Li, Maosen
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10254 - 10262
  • [37] LEARNING VISUALLY CONSISTENT LABEL EMBEDDINGS FOR ZERO-SHOT LEARNING
    Demirel, Berkan
    Cinbis, Ramazan Gokberk
    Ikizler-Cinbis, Nazli
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3656 - 3660
  • [38] Attentive Region Embedding Network for Zero-shot Learning
    Xie, Guo-Sen
    Liu, Li
    Jin, Xiaobo
    Zhu, Fan
    Zhang, Zheng
    Qin, Jie
    Yao, Yazhou
    Shao, Ling
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9376 - 9385
  • [39] Deep Unbiased Embedding Transfer for Zero-Shot Learning
    Jia, Zhen
    Zhang, Zhang
    Wang, Liang
    Shan, Caifeng
    Tan, Tieniu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1958 - 1971
  • [40] Towards Effective Deep Embedding for Zero-Shot Learning
    Zhang, Lei
    Wang, Peng
    Liu, Lingqiao
    Shen, Chunhua
    Wei, Wei
    Zhang, Yanning
    van den Hengel, Anton
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) : 2843 - 2852