Discriminative and Robust Attribute Alignment for Zero-Shot Learning

被引:20
|
作者
Cheng, De [1 ]
Wang, Gerong [2 ]
Wang, Nannan [1 ]
Zhang, Dingwen [3 ,4 ]
Zhang, Qiang [5 ]
Gao, Xinbo [6 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
[2] Beijing Inst Remote Sensing Equipment, Beijing 100854, Peoples R China
[3] Inst Artificial Intelligence, Hefei Comprehens Natl Sci Ctr, Hefei 230088, Peoples R China
[4] Northwestern Polytech Univ, Sch Automat, Xian 710060, Shaanxi, Peoples R China
[5] Xidian Univ, Sch Mechanoelect Engn, Xian 710071, Shaanxi, Peoples R China
[6] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; attribute alignment; contrastive learning; consistency regularization;
D O I
10.1109/TCSVT.2023.3243205
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Zero-shot learning (ZSL) aims to learn models that can recognize images of semantically related unseen categories, through transferring attribute-based knowledge learned from training data of seen classes to unseen testing data. As visual attributes play a vital role in ZSL, recent embedding-based methods usually focus on learning a compatibility function between the visual representation and the class semantic attributes. While in this work, in addition to simply learning the region embedding of different semantic attributes to maintain the generalization capability of the learned model, we further consider to improve the discrimination power of the learned visual features themselves by contrastive embedding. It exploits both the class-wise and instance-wise supervision for GZSL, under the attribute guided weakly supervised representation learning framework. To further improve the robustness of the ZSL model, we also propose to train the model under the consistency regularization constraint, through taking full advantages of self-supervised signals of the image under various perturbed augmentation situations, which could make the model robust to some occluded or un-related attribute regions. Extensive experimental results demonstrate the effectiveness of the proposed ZSL method, achieving superior performances to state-of-the-art methods on three widely-used benchmark datasets, namely CUB, SUN, and AWA2. Our source code is released at https://github.com/KORIYN/CC-ZSL.
引用
收藏
页码:4244 / 4256
页数:13
相关论文
共 50 条
  • [31] Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning
    Ge, Jiannan
    Liu, Zhihang
    Li, Pandeng
    Xie, Lingxi
    Zhang, Yongdong
    Tian, Qi
    Xie, Hongtao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1501 - 1515
  • [32] Towards Discriminative Feature Generation for Generalized Zero-Shot Learning
    Ge, Jiannan
    Xie, Hongtao
    Li, Pandeng
    Xie, Lingxi
    Min, Shaobo
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10514 - 10529
  • [33] An Inverse Mapping with Manifold Alignment for Zero-Shot Learning
    Wu, Xixun
    Song, Binheng
    Wang, Zhixiang
    Yuan, Chun
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 400 - 411
  • [34] Learning discriminative visual semantic embedding for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Yuan, Jianying
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
  • [35] Hierarchical Disentanglement of Discriminative Latent Features for Zero-shot Learning
    Tong, Bin
    Wang, Chao
    Klinkigt, Martin
    Kobayashi, Yoshiyuki
    Nonaka, Yuuichi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11459 - 11468
  • [36] Discriminative Embedding Autoencoder With a Regressor Feedback for Zero-Shot Learning
    Shi, Ying
    Wei, Wei
    IEEE ACCESS, 2020, 8 : 11019 - 11030
  • [37] Attribute-Modulated Generative Meta Learning for Zero-Shot Learning
    Li, Yun
    Liu, Zhe
    Yao, Lina
    Chang, Xiaojun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1600 - 1610
  • [38] Zero-Shot Learning Based on Deep Weighted Attribute Prediction
    Wang, Xuesong
    Chen, Chen
    Cheng, Yuhu
    Chen, Xun
    Liu, Yu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (08): : 2948 - 2957
  • [39] Person Search by Text Attribute Query as Zero-Shot Learning
    Dong, Qi
    Gong, Shaogang
    Zhu, Xiatian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3651 - 3660
  • [40] A Large-scale Attribute Dataset for Zero-shot Learning
    Zhao, Bo
    Fu, Yanwei
    Liang, Rui
    Wu, Jiahong
    Wang, Yonggang
    Wang, Yizhou
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 398 - 407