Towards Discriminative Feature Generation for Generalized Zero-Shot Learning

被引:1
|
作者
Ge, Jiannan [1 ]
Xie, Hongtao [1 ]
Li, Pandeng [1 ]
Xie, Lingxi [2 ]
Min, Shaobo [3 ]
Zhang, Yongdong [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain inspired Intelligence Technol, Hefei 230026, Peoples R China
[2] Huawei Cloud, Shenzhen 518100, Peoples R China
[3] Tencent, Shenzhen 518000, Peoples R China
关键词
Semantics; Training; Visualization; Feature extraction; Zero-shot learning; Noise; Generators; recognition; multi-modality embedding; LOCALIZATION;
D O I
10.1109/TMM.2024.3408048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and unseen categories by establishing visual and semantic relations. Recently, generation-based methods that focus on synthesizing fictitious visual features from corresponding attributes have gained significant attention. However, these generated features often lack discriminative capabilities due to inadequate training of the generative model. To address this issue, we propose a novel Discriminative Enhanced Network (DENet) to harness the potential of the generative model by adapting the training features and imposing constraints on the generated features. Our approach incorporates three pivotal modules. 1) Before the generative network training, we implement a Pre-Tuning Module (PTM) to eliminate irrelevant background noise in the raw features extracted from a fixed CNN backbone. Therefore, PTM can provide tuned training features without redundant noise for generative model. 2) During the generative network training, we propose an Asymmetry Cross-authenticity Contrastive (AC2) loss to group visual features of the same category while repel features from different categories by optimizing a large number of sample pairs. Additionally, we incorporate intra-class and relation-specific inter-class boundaries within the AC2 loss to enrich sample diversity and preserve valid semantic information. 3) Also within the generative network training, a Dual-semantic Alignment Module (DAM) is designed to align visual features with both attributes and label embeddings, enabling the model to learn attribute-related information and discriminative extended semantics. Experiments on four standard benchmarks demonstrate that our approach learns more discriminative features and surpasses the existing methods.
引用
收藏
页码:10514 / 10529
页数:16
相关论文
共 50 条
  • [21] Aligning enhanced feature representation for generalized zero-shot learning
    Zhiyu FANG
    Xiaobin ZHU
    Chun YANG
    Hongyang ZHOU
    Jingyan QIN
    Xu-Cheng YIN
    Science China(Information Sciences), 2025, 68 (02) : 74 - 88
  • [22] Mitigating Generation Shi!s for Generalized Zero-Shot Learning
    Chen, Zhi
    Luo, Yadan
    Wang, Sen
    Qiu, Ruihong
    Li, Jingjing
    Huang, Zi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 844 - 852
  • [23] Beyond Normal Distribution: More Factual Feature Generation Network for Generalized Zero-Shot Learning
    Liu, Jingren
    Bai, Haoyue
    Zhang, Haofeng
    Liu, Li
    IEEE MULTIMEDIA, 2022, 29 (03) : 69 - 79
  • [24] DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning
    Li, Bonan
    Hu, Yinhan
    Han, Congying
    Guo, Tiande
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1851 - 1857
  • [25] Towards Open Zero-Shot Learning
    Marmoreo, Federico
    Carrazco, Julio Ivan Davila
    Cavazza, Jacopo
    Murino, Vittorio
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 564 - 575
  • [26] Learning exclusive discriminative semantic information for zero-shot learning
    Mi, Jian-Xun
    Zhang, Zhonghao
    Tai, Debao
    Zhou, Li-Fang
    Jia, Wei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 761 - 772
  • [27] Discriminative and Robust Attribute Alignment for Zero-Shot Learning
    Cheng, De
    Wang, Gerong
    Wang, Nannan
    Zhang, Dingwen
    Zhang, Qiang
    Gao, Xinbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4244 - 4256
  • [28] Discriminative Learning of Latent Features for Zero-Shot Recognition
    Li, Yan
    Zhang, Junge
    Zhang, Jianguo
    Huang, Kaiqi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7463 - 7471
  • [29] Zero-shot learning via discriminative representation extraction
    Long, Teng
    Xu, Xing
    Shen, Fumin
    Liu, Li
    Xie, Ning
    Yang, Yang
    PATTERN RECOGNITION LETTERS, 2018, 109 : 27 - 34
  • [30] Discriminative Latent Attribute Autoencoder for Zero-Shot Learning
    Chen, Runqing
    Wu, Songsong
    Sun, Guangcheng
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 873 - 877