Dual Prototype Contrastive Network for Generalized Zero-Shot Learning

被引:1
|
作者
Jiang, Huajie [1 ]
Li, Zhengxian [1 ]
Hu, Yongli [1 ]
Yin, Baocai [1 ]
Yang, Jian [2 ]
van den Hengel, Anton [3 ]
Yang, Ming-Hsuan [4 ]
Qi, Yuankai
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] Macquarie Univ, Sch Comp, Sydney, NSW 2109, Australia
[3] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5000, Australia
[4] Univ Calif Merced, Dept Elect Engn & Comp Sci, Merced, CA 95343 USA
基金
中国国家自然科学基金;
关键词
Visualization; Semantics; Prototypes; Contrastive learning; Zero shot learning; Generative adversarial networks; Object recognition; Feature extraction; Training; Face recognition; Generalized zero-shot learning; prototype learning; contrastive learning;
D O I
10.1109/TCSVT.2024.3474910
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Generalized zero-shot learning (GZSL) requires that models are able to recognize classes they were trained on, and new classes they haven't seen before. Feature-generation approaches are popular due to their effectiveness in mitigating overfitting to the training classes. Existing generative approaches usually adopt simple discriminators for distribution or classification supervision, however, thus limiting their ability to generate visual features that are discriminative of and transferable to novel categories. To overcome this limitation and improve the quality of generated features, we propose a dual prototype contrastive augmented discriminator for the generative adversarial network. Specifically, we design a Dual Prototype Contrastive Network (DPCN), which leverages complementary information between visual space and semantic space through multi-task prototype contrastive learning. Contrastive learning of the visual prototypes enhances the ability of the generated features to distinguish between classes, while the contrastive learning of the semantic prototypes improves their transferability. Furthermore, we introduce margins into the contrastive learning process to ensure both intra-class compactness and inter-class separation. To demonstrate the effectiveness of the proposed approach, we conduct experiments on three widely-used zero-shot learning benchmark datasets, where DPCN achieves state-of-the-art performance for GZSL.
引用
收藏
页码:1111 / 1122
页数:12
相关论文
共 50 条
  • [21] 'Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning
    Feinglass, Joshua
    Thiagarajan, Jayaraman J.
    Anirudh, Rushil
    Jayram, T. S.
    Yang, Yezhou
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 7791 - 7798
  • [22] Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
    Li, Xiangyu
    Yang, Xu
    Wei, Kun
    Deng, Cheng
    Yang, Muli
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9316 - 9325
  • [23] A Dual Discriminator Method for Generalized Zero-Shot Learning
    Wei, Tianshu
    Huang, Jinjie
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01): : 1599 - 1612
  • [24] Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning
    Ni, Jian
    Zhang, Shanghang
    Xie, Haiyong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] Contrastive semantic disentanglement in latent space for generalized zero-shot learning
    Fan, Wentao
    Liang, Chen
    Wang, Tian
    KNOWLEDGE-BASED SYSTEMS, 2022, 257
  • [26] Contrastive semantic disentanglement in latent space for generalized zero-shot learning
    Fan, Wentao
    Liang, Chen
    Wang, Tian
    Knowledge-Based Systems, 2022, 257
  • [27] Dual-Stream Contrastive Learning for Compositional Zero-Shot Recognition
    Yang, Yanhua
    Pan, Rui
    Li, Xiangyu
    Yang, Xu
    Deng, Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1909 - 1919
  • [28] Prototype rectification for zero-shot learning
    Yi, Yuanyuan
    Zeng, Guolei
    Ren, Bocheng
    Yang, Laurence T.
    Chai, Bin
    Li, Yuxin
    PATTERN RECOGNITION, 2024, 156
  • [29] Generalized Zero-Shot Learning with Deep Calibration Network
    Liu, Shichen
    Long, Mingsheng
    Wang, Jianmin
    Jordan, Michael I.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [30] Triple Verification Network for Generalized Zero-Shot Learning
    Zhang, Haofeng
    Long, Yang
    Guan, Yu
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 506 - 517