Towards Discriminative Feature Generation for Generalized Zero-Shot Learning

被引:1
|
作者
Ge, Jiannan [1 ]
Xie, Hongtao [1 ]
Li, Pandeng [1 ]
Xie, Lingxi [2 ]
Min, Shaobo [3 ]
Zhang, Yongdong [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain inspired Intelligence Technol, Hefei 230026, Peoples R China
[2] Huawei Cloud, Shenzhen 518100, Peoples R China
[3] Tencent, Shenzhen 518000, Peoples R China
关键词
Semantics; Training; Visualization; Feature extraction; Zero-shot learning; Noise; Generators; recognition; multi-modality embedding; LOCALIZATION;
D O I
10.1109/TMM.2024.3408048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and unseen categories by establishing visual and semantic relations. Recently, generation-based methods that focus on synthesizing fictitious visual features from corresponding attributes have gained significant attention. However, these generated features often lack discriminative capabilities due to inadequate training of the generative model. To address this issue, we propose a novel Discriminative Enhanced Network (DENet) to harness the potential of the generative model by adapting the training features and imposing constraints on the generated features. Our approach incorporates three pivotal modules. 1) Before the generative network training, we implement a Pre-Tuning Module (PTM) to eliminate irrelevant background noise in the raw features extracted from a fixed CNN backbone. Therefore, PTM can provide tuned training features without redundant noise for generative model. 2) During the generative network training, we propose an Asymmetry Cross-authenticity Contrastive (AC2) loss to group visual features of the same category while repel features from different categories by optimizing a large number of sample pairs. Additionally, we incorporate intra-class and relation-specific inter-class boundaries within the AC2 loss to enrich sample diversity and preserve valid semantic information. 3) Also within the generative network training, a Dual-semantic Alignment Module (DAM) is designed to align visual features with both attributes and label embeddings, enabling the model to learn attribute-related information and discriminative extended semantics. Experiments on four standard benchmarks demonstrate that our approach learns more discriminative features and surpasses the existing methods.
引用
收藏
页码:10514 / 10529
页数:16
相关论文
共 50 条
  • [31] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
  • [32] Learning Discriminative Instance Attribute for Zero-Shot Classification
    Wang, Lu
    Wu, Songsong
    Yu, Jun
    Jing, Xiao-Yuan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 210 - 213
  • [33] Zero-Shot Classification with Discriminative Semantic Representation Learning
    Ye, Meng
    Guo, Yuhong
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5103 - 5111
  • [34] Learning Discriminative Latent Attributes for Zero-Shot Classification
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Yang, Yi
    Chen, Xilin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4233 - 4242
  • [35] Hierarchical Coupled Discriminative Dictionary Learning for Zero-Shot Learning
    Li, Shuang
    Wang, Lichun
    Wang, Shaofan
    Kong, Dehui
    Yin, Baocai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4973 - 4984
  • [36] Learning exclusive discriminative semantic information for zero-shot learning
    Jian-Xun Mi
    Zhonghao Zhang
    Debao Tai
    Li-Fang Zhou
    Wei Jia
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 761 - 772
  • [37] Improving Discriminative Learning for Zero-Shot Relation Extraction
    Tran, Van-Hien
    Ouchi, Hiroki
    Watanabe, Taro
    Matsumoto, Yuji
    PROCEEDINGS OF THE 1ST WORKSHOP ON SEMIPARAMETRIC METHODS IN NLP: DECOUPLING LOGIC FROM KNOWLEDGE (SPA-NLP 2022), 2022, : 1 - 6
  • [38] Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification
    Hong, Mingyao
    Zhang, Xinfeng
    Li, Guorong
    Huang, Qingming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1599 - 1612
  • [39] A Robust Generalized Zero-Shot Learning Method with Attribute Prototype and Discriminative Attention Mechanism
    Liu, Xiaodong
    Luo, Weixing
    Du, Jiale
    Wang, Xinshuo
    Dang, Yuhao
    Liu, Yang
    ELECTRONICS, 2024, 13 (18)
  • [40] Contrastive Prototype-Guided Generation for Generalized Zero-Shot Learning
    Wang, Yunyun
    Mao, Jian
    Guo, Chenguang
    Chen, Songcan
    NEURAL NETWORKS, 2024, 176