Superclass-aware visual feature disentangling for generalized zero-shot learning

被引:0
|
作者
Niu, Chang [1 ,2 ]
Shang, Junyuan [1 ,3 ]
Zhou, Zhiheng [1 ,4 ]
Yang, Junmei [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Peoples R China
[2] Foshan Univ, Sch Comp Sci & Artificial Intelligence, Foshan 528000, Peoples R China
[3] GRG Banking Equipment Co Ltd, Guangzhou 510663, Peoples R China
[4] South China Univ Technol, Key Lab Big Data & Intelligent Robot, Minist Educ, Guangzhou, Peoples R China
关键词
Zero-shot learning; Transfer learning; Image classification; Action recognition;
D O I
10.1016/j.eswa.2024.125150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) aims to learn a model trained on seen samples with the ability to recognize samples from unseen classes, while generalized ZSL (GZSL) takes a step closer to realistic scenarios by recognizing both of seen and unseen samples. The existing methods rely on the semantic descriptions as the side-information and conduct tight alignment between the visual and semantic spaces. However, the tight modality alignment may result in incomplete representations, leading to the loss of originally detailed and discriminative information. In this paper, we propose a simple yet effective superclass-aware visual feature disentangling method termed as SupVFD for GZSL. We use the neighbor relations of the semantic descriptions to define superclass and with the guide of superclass, our method disentangles visual features into discriminative and transferable factors. To this end, the semantic descriptions are used as implicit supervision, which preserves the valuable detailed and discriminative information in the visual features. The extensive experiments in both ZSL and GZSL settings prove our method outperforms the state-of-the-art methods for image object classification as well as video action recognition. Code is available at our github: https://github.com/changniu54/SupVFD-Master.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Learning discriminative and representative feature with cascade GAN for generalized zero-shot learning
    Liu, Jingren
    Fu, Liyong
    Zhang, Haofeng
    Ye, Qiaolin
    Yang, Wankou
    Liu, Li
    Knowledge-Based Systems, 2022, 236
  • [22] DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning
    Li, Bonan
    Hu, Yinhan
    Han, Congying
    Guo, Tiande
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1851 - 1857
  • [23] SEMANTIC MANIFOLD ALIGNMENT IN VISUAL FEATURE SPACE FOR ZERO-SHOT LEARNING
    Liao, Changsu
    Su, Li
    Zhang, Wegang
    Huang, Qingming
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [24] Audio-Visual Generalized Zero-Shot Learning the Easy Way
    Mo, Shentong
    Morgado, Pedro
    COMPUTER VISION - ECCV 2024, PT LXXI, 2025, 15129 : 377 - 395
  • [25] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
  • [26] Contrastive embedding-based feature generation for generalized zero-shot learning
    Wang, Han
    Zhang, Tingting
    Zhang, Xiaoxuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1669 - 1681
  • [27] Co-GZSL: Feature Contrastive Optimization for Generalized Zero-Shot Learning
    Qun Li
    Zhuxi Zhan
    Yaying Shen
    Bir Bhanu
    Neural Processing Letters, 56
  • [28] Augmented semantic feature based generative network for generalized zero-shot learning
    Li, Zhiqun
    Chen, Qiong
    Liu, Qingfa
    NEURAL NETWORKS, 2021, 143 : 1 - 11
  • [29] Contrastive embedding-based feature generation for generalized zero-shot learning
    Han Wang
    Tingting Zhang
    Xiaoxuan Zhang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1669 - 1681
  • [30] Dual-Aligned Feature Confusion Alleviation for Generalized Zero-Shot Learning
    Su, Hongzu
    Li, Jingjing
    Lu, Ke
    Zhu, Lei
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3774 - 3785