Towards Discriminative Feature Generation for Generalized Zero-Shot Learning

被引:1
|
作者
Ge, Jiannan [1 ]
Xie, Hongtao [1 ]
Li, Pandeng [1 ]
Xie, Lingxi [2 ]
Min, Shaobo [3 ]
Zhang, Yongdong [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Brain inspired Intelligence Technol, Hefei 230026, Peoples R China
[2] Huawei Cloud, Shenzhen 518100, Peoples R China
[3] Tencent, Shenzhen 518000, Peoples R China
关键词
Semantics; Training; Visualization; Feature extraction; Zero-shot learning; Noise; Generators; recognition; multi-modality embedding; LOCALIZATION;
D O I
10.1109/TMM.2024.3408048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and unseen categories by establishing visual and semantic relations. Recently, generation-based methods that focus on synthesizing fictitious visual features from corresponding attributes have gained significant attention. However, these generated features often lack discriminative capabilities due to inadequate training of the generative model. To address this issue, we propose a novel Discriminative Enhanced Network (DENet) to harness the potential of the generative model by adapting the training features and imposing constraints on the generated features. Our approach incorporates three pivotal modules. 1) Before the generative network training, we implement a Pre-Tuning Module (PTM) to eliminate irrelevant background noise in the raw features extracted from a fixed CNN backbone. Therefore, PTM can provide tuned training features without redundant noise for generative model. 2) During the generative network training, we propose an Asymmetry Cross-authenticity Contrastive (AC2) loss to group visual features of the same category while repel features from different categories by optimizing a large number of sample pairs. Additionally, we incorporate intra-class and relation-specific inter-class boundaries within the AC2 loss to enrich sample diversity and preserve valid semantic information. 3) Also within the generative network training, a Dual-semantic Alignment Module (DAM) is designed to align visual features with both attributes and label embeddings, enabling the model to learn attribute-related information and discriminative extended semantics. Experiments on four standard benchmarks demonstrate that our approach learns more discriminative features and surpasses the existing methods.
引用
收藏
页码:10514 / 10529
页数:16
相关论文
共 50 条
  • [41] GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning
    Chen, Zhi
    Luo, Yadan
    Wang, Sen
    Li, Jingjing
    Huang, Zi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5374 - 5385
  • [42] Towards Zero-Shot Persona Dialogue Generation with In-Context Learning
    Xu, Xinchao
    Lei, Zeyang
    Wu, Wenquan
    Niu, Zheng-Yu
    Wu, Hua
    Wang, Haifeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1387 - 1398
  • [43] Co-GZSL: Feature Contrastive Optimization for Generalized Zero-Shot Learning
    Qun Li
    Zhuxi Zhan
    Yaying Shen
    Bir Bhanu
    Neural Processing Letters, 56
  • [44] Augmented semantic feature based generative network for generalized zero-shot learning
    Li, Zhiqun
    Chen, Qiong
    Liu, Qingfa
    NEURAL NETWORKS, 2021, 143 : 1 - 11
  • [45] Dual-Aligned Feature Confusion Alleviation for Generalized Zero-Shot Learning
    Su, Hongzu
    Li, Jingjing
    Lu, Ke
    Zhu, Lei
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3774 - 3785
  • [46] Co-GZSL: Feature Contrastive Optimization for Generalized Zero-Shot Learning
    Li, Qun
    Zhan, Zhuxi
    Shen, Yaying
    Bhanu, Bir
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [47] Feature Generating Networks for Zero-Shot Learning
    Xian, Yongqin
    Lorenz, Tobias
    Schiele, Bernt
    Akata, Zeynep
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5542 - 5551
  • [48] Superclass-aware visual feature disentangling for generalized zero-shot learning
    Niu, Chang
    Shang, Junyuan
    Zhou, Zhiheng
    Yang, Junmei
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [49] Dual insurance for generalized zero-shot learning
    Liang, Jiahao
    Fang, Xiaozhao
    Kang, Peipei
    Han, Na
    Li, Chuang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 2111 - 2125
  • [50] Model Selection for Generalized Zero-Shot Learning
    Zhang, Hongguang
    Koniusz, Piotr
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 198 - 204