Semantic Consistent Embedding for Domain Adaptive Zero-Shot Learning

被引：5

作者：

Zhang, Jianyang ^{[1
]}

Yang, Guowu ^{[1
]}

Hu, Ping ^{[2
]}

Lin, Guosheng ^{[3
]}

Lv, Fengmao ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China

[2] Boston Univ, Comp Sci Dept, Boston, MA 02215 USA

[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

[4] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Sichuan, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Zero-shot learning; unsupervised domain adaptation; transfer learning;

D O I：

10.1109/TIP.2023.3293769

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unsupervised domain adaptation has limitations when encountering label discrepancy between the source and target domains. While open-set domain adaptation approaches can address situations when the target domain has additional categories, these methods can only detect them but not further classify them. In this paper, we focus on a more challenging setting dubbed Domain Adaptive Zero-Shot Learning (DAZSL), which uses semantic embeddings of class tags as the bridge between seen and unseen classes to learn the classifier for recognizing all categories in the target domain when only the supervision of seen categories in the source domain is available. The main challenge of DAZSL is to perform knowledge transfer across categories and domain styles simultaneously. To this end, we propose a novel end-to-end learning mechanism dubbed Three-way Semantic Consistent Embedding (TSCE) to embed the source domain, target domain, and semantic space into a shared space. Specifically, TSCE learns domain-irrelevant categorical prototypes from the semantic embedding of class tags and uses them as the pivots of the shared space. The source domain features are aligned with the prototypes via their supervised information. On the other hand, the mutual information maximization mechanism is introduced to push the target domain features and prototypes towards each other. By this way, our approach can align domain differences between source and target images, as well as promote knowledge transfer towards unseen classes. Moreover, as there is no supervision in the target domain, the shared space may suffer from the catastrophic forgetting problem. Hence, we further propose a ranking-based embedding alignment mechanism to maintain the consistency between the semantic space and the shared space. Experimental results on both I2AwA and I2WebV clearly validate the effectiveness of our method. Code is available at https://github.com/tiggers23/TSCE-Domain-Adaptive-Zero-Shot-Learning.

引用

页码：4024 / 4035

页数：12

共 50 条

[31] Semantic softmax loss for zero-shot learning
Ji, Zhong
Sun, Yuxin
Yu, Yunlong
Guo, Jichang
Pang, Yanwei
NEUROCOMPUTING, 2018, 316 : 369 - 375
[32] Generalized Zero-Shot Recognition based on Visually Semantic Embedding
Zhu, Pengkai
Wang, Hanxiao
Saligrama, Venkatesh
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2990 - 2998
[33] Generalised Zero-Shot Learning with Domain Classification in a Joint Semantic and Visual Space
Felix, Rafael
Harwood, Ben
Sasdelli, Michele
Carneiro, Gustavo
2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 17 - 24
[34] ADAPTIVE MULTI-SCALE SEMANTIC FUSION NETWORK FOR ZERO-SHOT LEARNING
Song, Jing
Peng, Peixi
Zhai, Yunpeng
Zhang, Chong
Tian, Yonghong
2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
[35] Region interaction and attribute embedding for zero-shot learning
Hu, Zhengwei
Zhao, Haitao
Peng, Jingchao
Gu, Xiaojing
INFORMATION SCIENCES, 2022, 609 : 984 - 995
[36] Incremental Embedding Learning via Zero-Shot Translation
Wei, Kun
Deng, Cheng
Yang, Xu
Li, Maosen
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10254 - 10262
[37] LEARNING VISUALLY CONSISTENT LABEL EMBEDDINGS FOR ZERO-SHOT LEARNING
Demirel, Berkan
Cinbis, Ramazan Gokberk
Ikizler-Cinbis, Nazli
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3656 - 3660
[38] Attentive Region Embedding Network for Zero-shot Learning
Xie, Guo-Sen
Liu, Li
Jin, Xiaobo
Zhu, Fan
Zhang, Zheng
Qin, Jie
Yao, Yazhou
Shao, Ling
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9376 - 9385
[39] Deep Unbiased Embedding Transfer for Zero-Shot Learning
Jia, Zhen
Zhang, Zhang
Wang, Liang
Shan, Caifeng
Tan, Tieniu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1958 - 1971
[40] Towards Effective Deep Embedding for Zero-Shot Learning
Zhang, Lei
Wang, Peng
Liu, Lingqiao
Shen, Chunhua
Wei, Wei
Zhang, Yanning
van den Hengel, Anton
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) : 2843 - 2852

← 1 2 3 4 5 →