Towards Discriminative Feature Generation for Generalized Zero-Shot Learning

被引:1
|
作者
Ge, Jiannan [1 ]
Xie, Hongtao [1 ]
Li, Pandeng [1 ]
Xie, Lingxi [2 ]
Min, Shaobo [3 ]
Zhang, Yongdong [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain inspired Intelligence Technol, Hefei 230026, Peoples R China
[2] Huawei Cloud, Shenzhen 518100, Peoples R China
[3] Tencent, Shenzhen 518000, Peoples R China
关键词
Semantics; Training; Visualization; Feature extraction; Zero-shot learning; Noise; Generators; recognition; multi-modality embedding; LOCALIZATION;
D O I
10.1109/TMM.2024.3408048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and unseen categories by establishing visual and semantic relations. Recently, generation-based methods that focus on synthesizing fictitious visual features from corresponding attributes have gained significant attention. However, these generated features often lack discriminative capabilities due to inadequate training of the generative model. To address this issue, we propose a novel Discriminative Enhanced Network (DENet) to harness the potential of the generative model by adapting the training features and imposing constraints on the generated features. Our approach incorporates three pivotal modules. 1) Before the generative network training, we implement a Pre-Tuning Module (PTM) to eliminate irrelevant background noise in the raw features extracted from a fixed CNN backbone. Therefore, PTM can provide tuned training features without redundant noise for generative model. 2) During the generative network training, we propose an Asymmetry Cross-authenticity Contrastive (AC2) loss to group visual features of the same category while repel features from different categories by optimizing a large number of sample pairs. Additionally, we incorporate intra-class and relation-specific inter-class boundaries within the AC2 loss to enrich sample diversity and preserve valid semantic information. 3) Also within the generative network training, a Dual-semantic Alignment Module (DAM) is designed to align visual features with both attributes and label embeddings, enabling the model to learn attribute-related information and discriminative extended semantics. Experiments on four standard benchmarks demonstrate that our approach learns more discriminative features and surpasses the existing methods.
引用
收藏
页码:10514 / 10529
页数:16
相关论文
共 50 条
  • [11] FREE: Feature Refinement for Generalized Zero-Shot Learning
    Chen, Shiming
    Wang, Wenjie
    Xia, Beihao
    Peng, Qinmu
    You, Xinge
    Zheng, Feng
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 122 - 131
  • [12] Unbiased feature generating for generalized zero-shot learning
    Niu, Chang
    Shang, Junyuan
    Huang, Junchu
    Yang, Junmei
    Song, Yuting
    Zhou, Zhiheng
    Zhou, Guoxu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89
  • [13] Transductive Zero-Shot Learning by Decoupled Feature Generation
    Marmoreo, Federico
    Cavazza, Jacopo
    Murino, Vittorio
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3108 - 3117
  • [14] Co-consistent Regularization with Discriminative Feature for Zero-Shot Learning
    Tian, Yanling
    Zhang, Weitong
    Zhang, Qieshi
    Cheng, Jun
    Hao, Pengyi
    Lu, Gang
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 33 - 45
  • [15] Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
    Xu, Tingting
    Zhao, Ye
    Liu, Xueliang
    COMPLEXITY, 2021, 2021
  • [16] Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning
    Ye, Yalan
    He, Yukun
    Pan, Tongjie
    Li, Jingjing
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1325 - 1337
  • [17] Class-Prototype Discriminative Network for Generalized Zero-Shot Learning
    Huang, Sheng
    Lin, Jingkai
    Huangfu, Luwen
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 301 - 305
  • [18] Generalized zero-shot learning via discriminative and transferable disentangled representations
    Zhang, Chunyu
    Li, Zhanshan
    NEURAL NETWORKS, 2025, 183
  • [19] Contrastive visual feature filtering for generalized zero-shot learning
    Meng, Shixuan
    Jiang, Rongxin
    Tian, Xiang
    Zhou, Fan
    Chen, Yaowu
    Liu, Junjie
    Shen, Chen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [20] Aligning enhanced feature representation for generalized zero-shot learning
    Fang, Zhiyu
    Zhu, Xiaobin
    Yang, Chun
    Zhou, Hongyang
    Qin, Jingyan
    Yin, Xu-Cheng
    SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (02)