Towards Discriminative Feature Generation for Generalized Zero-Shot Learning

被引:1
|
作者
Ge, Jiannan [1 ]
Xie, Hongtao [1 ]
Li, Pandeng [1 ]
Xie, Lingxi [2 ]
Min, Shaobo [3 ]
Zhang, Yongdong [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain inspired Intelligence Technol, Hefei 230026, Peoples R China
[2] Huawei Cloud, Shenzhen 518100, Peoples R China
[3] Tencent, Shenzhen 518000, Peoples R China
关键词
Semantics; Training; Visualization; Feature extraction; Zero-shot learning; Noise; Generators; recognition; multi-modality embedding; LOCALIZATION;
D O I
10.1109/TMM.2024.3408048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and unseen categories by establishing visual and semantic relations. Recently, generation-based methods that focus on synthesizing fictitious visual features from corresponding attributes have gained significant attention. However, these generated features often lack discriminative capabilities due to inadequate training of the generative model. To address this issue, we propose a novel Discriminative Enhanced Network (DENet) to harness the potential of the generative model by adapting the training features and imposing constraints on the generated features. Our approach incorporates three pivotal modules. 1) Before the generative network training, we implement a Pre-Tuning Module (PTM) to eliminate irrelevant background noise in the raw features extracted from a fixed CNN backbone. Therefore, PTM can provide tuned training features without redundant noise for generative model. 2) During the generative network training, we propose an Asymmetry Cross-authenticity Contrastive (AC2) loss to group visual features of the same category while repel features from different categories by optimizing a large number of sample pairs. Additionally, we incorporate intra-class and relation-specific inter-class boundaries within the AC2 loss to enrich sample diversity and preserve valid semantic information. 3) Also within the generative network training, a Dual-semantic Alignment Module (DAM) is designed to align visual features with both attributes and label embeddings, enabling the model to learn attribute-related information and discriminative extended semantics. Experiments on four standard benchmarks demonstrate that our approach learns more discriminative features and surpasses the existing methods.
引用
收藏
页码:10514 / 10529
页数:16
相关论文
共 50 条
  • [1] Learning discriminative and representative feature with cascade GAN for generalized zero-shot learning
    Liu, Jingren
    Fu, Liyong
    Zhang, Haofeng
    Ye, Qiaolin
    Yang, Wankou
    Liu, Li
    KNOWLEDGE-BASED SYSTEMS, 2022, 236
  • [2] Learning discriminative and representative feature with cascade GAN for generalized zero-shot learning
    Liu, Jingren
    Fu, Liyong
    Zhang, Haofeng
    Ye, Qiaolin
    Yang, Wankou
    Liu, Li
    Knowledge-Based Systems, 2022, 236
  • [3] GAN-MVAE: A discriminative latent feature generation framework for generalized zero-shot learning
    Ma, Peirong
    Lu, Hong
    Yang, Bohong
    Ran, Wu
    PATTERN RECOGNITION LETTERS, 2022, 155 : 77 - 83
  • [4] Inference guided feature generation for generalized zero-shot learning
    Han, Zongyan
    Fu, Zhenyong
    Li, Guangyu
    Yang, Jian
    NEUROCOMPUTING, 2021, 430 : 150 - 158
  • [5] Discriminative deep attributes for generalized zero-shot learning
    Kim, Hoseong
    Lee, Jewook
    Byun, Hyeran
    PATTERN RECOGNITION, 2022, 124
  • [6] Discriminative comparison classifier for generalized zero-shot learning
    Hou, Mingzhen
    Xia, Wei
    Zhang, Xiangdong
    Gao, Quanxue
    NEUROCOMPUTING, 2020, 414 (414) : 10 - 17
  • [7] Contrastive embedding-based feature generation for generalized zero-shot learning
    Wang, Han
    Zhang, Tingting
    Zhang, Xiaoxuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1669 - 1681
  • [8] Contrastive embedding-based feature generation for generalized zero-shot learning
    Han Wang
    Tingting Zhang
    Xiaoxuan Zhang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1669 - 1681
  • [9] Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning
    Yang, Yanhua
    Zhang, Xiaozhe
    Yang, Muli
    Deng, Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 280 - 290
  • [10] Semantic Feature Extraction for Generalized Zero-Shot Learning
    Kim, Junhan
    Shim, Kyuhong
    Shim, Byonghyo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1166 - 1173