Contrastive semantic disentanglement in latent space for generalized zero-shot learning

被引:0
|
作者
Fan, Wentao [1 ,2 ]
Liang, Chen [2 ]
Wang, Tian [3 ,4 ]
机构
[1] Hong Kong Baptist Univ, Dept Comp Sci, Beijing Normal Univ, United Int Coll, Zhuhai, Guangdong, Peoples R China
[2] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen, Peoples R China
[3] Beijing Normal Univ, UIC Inst Artificial Intelligence & Future Network, Beijing, Peoples R China
[4] BNU, Guangdong Key Lab & Multi Modal Data Proc, United Int Coll, HKBU, Zhuhai, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generalized zero-shot learning; Feature disentanglement; Contrastive learning; Generative model; Wasserstein GAN; ATTENTION;
D O I
10.1016/j.knosys.2022.109949
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The target of generalized zero-shot learning (GZSL) is to train a model that can classify data samples from both seen categories and unseen categories under the circumstances that only the labeled samples from seen categories are available. In this paper, we propose a GZSL approach based on conditional generative models that adopts a contrastive disentanglement learning framework to disentangle visual information in the latent space. Specifically, our model encodes original and generated visual features into a latent space in which these visual features are disentangled into semantic-related and semantic-unrelated representations. The proposed contrastive learning framework leverages class-level and instance-level supervision, where it not only formulates contrastive loss based on semantic-related information at the instance level, but also exploits semantic-unrelated representations and the corresponding semantic information to form negative sample pairs at the class level to further facilitate disentanglement. Then, GZSL classification is performed by training a supervised model (e.g, softmax classifier) based only on semantic-related representations. The experimental results show that our model achieves state-of-the-art performance on several benchmark datasets, especially for unseen categories. The source code of the proposed model is available at: https://github.com/fwt-team/GZSL. (c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Contrastive semantic disentanglement in latent space for generalized zero-shot learning
    Fan, Wentao
    Liang, Chen
    Wang, Tian
    [J]. Knowledge-Based Systems, 2022, 257
  • [2] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Zongyan Han
    Zhenyong Fu
    Shuo Chen
    Jian Yang
    [J]. International Journal of Computer Vision, 2022, 130 : 2606 - 2622
  • [3] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2606 - 2622
  • [4] Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
  • [5] Hierarchical Disentanglement of Discriminative Latent Features for Zero-shot Learning
    Tong, Bin
    Wang, Chao
    Klinkigt, Martin
    Kobayashi, Yoshiyuki
    Nonaka, Yuuichi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11459 - 11468
  • [6] Transferable Contrastive Network for Generalized Zero-Shot Learning
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9764 - 9773
  • [7] Content-Attribute Disentanglement for Generalized Zero-Shot Learning
    An, Yoojin
    Kim, Sangyeon
    Liang, Yuxuan
    Zimmermann, Roger
    Kim, Dongho
    Kim, Jihie
    [J]. IEEE Access, 2022, 10 : 58320 - 58331
  • [8] Content-Attribute Disentanglement for Generalized Zero-Shot Learning
    An, Yoojin
    Kim, Sangyeon
    Liang, Yuxuan
    Zimmermann, Roger
    Kim, Dongho
    Kim, Jihie
    [J]. IEEE ACCESS, 2022, 10 : 58320 - 58331
  • [9] Generation-based contrastive model with semantic alignment for generalized zero-shot learning
    Yang, Jingqi
    Shen, Qi
    Xie, Cheng
    [J]. IMAGE AND VISION COMPUTING, 2023, 137
  • [10] Marginalized Latent Semantic Encoder for Zero-Shot Learning
    Ding, Zhengming
    Liu, Hongfu
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6184 - 6192