Generative Model with Semantic Embedding and Integrated Classifier for Generalized Zero-Shot Learning

被引:0
|
作者
Pambala, Ayyappa Kumar [1 ]
Dutta, Titir [1 ]
Biswas, Soma [1 ]
机构
[1] Indian Inst Sci, Bangalore, Karnataka, India
关键词
D O I
10.1109/wacv45572.2020.9093625
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative models have achieved impressive performance for the generalized zero-shot learning task by learning the mapping from attributes to feature space. In this work, we propose to derive semantic inferences from images and use them for the generation, which enables us to capture the bidirectional information i.e., visual to semantic and semantic to visual spaces. Specifically, we propose a Semantic Embedding module which not only gives image specific semantic information to the generative model for generation of better features, but also makes sure that the generated features can be mapped to the correct semantic space. We also propose an Integrated Classifier, which is trained along with the generator. This module not only eliminates the requirement of additional classifier for new object categories which is required by the existing generative approaches, but also facilitates the generation of more discriminative and useful features. This approach can be used seamlessly for the task of few-shot learning. Extensive experiments on four benchmark datasets, namely, CUB, SUN, AWA1, AWA2 for both zero-shot learning and few-shot setting show the effectiveness of the proposed approach.
引用
收藏
页码:1226 / 1235
页数:10
相关论文
共 50 条
  • [11] Generalized Zero-Shot Recognition based on Visually Semantic Embedding
    Zhu, Pengkai
    Wang, Hanxiao
    Saligrama, Venkatesh
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2990 - 2998
  • [12] Discriminative comparison classifier for generalized zero-shot learning
    Hou, Mingzhen
    Xia, Wei
    Zhang, Xiangdong
    Gao, Quanxue
    [J]. NEUROCOMPUTING, 2020, 414 (414) : 10 - 17
  • [13] A Variational Autoencoder with Deep Embedding Model for Generalized Zero-Shot Learning
    Ma, Peirong
    Hu, Xiao
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11733 - 11740
  • [14] A Joint Generative Model for Zero-Shot Learning
    Gao, Rui
    Hou, Xingsong
    Qin, Jie
    Liu, Li
    Zhu, Fan
    Zhang, Zhao
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 631 - 646
  • [15] Evolving Semantic Prototype Improves Generative Zero-Shot Learning
    Chen, Shiming
    Hou, Wenjin
    Hong, Ziming
    Ding, Xiaohan
    Song, Yibing
    You, Xinge
    Liu, Tongliang
    Zhang, Kun
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [16] Learning a Deep Embedding Model for Zero-Shot Learning
    Zhang, Li
    Xiang, Tao
    Gong, Shaogang
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3010 - 3019
  • [17] Semantic Feature Extraction for Generalized Zero-Shot Learning
    Kim, Junhan
    Shim, Kyuhong
    Shim, Byonghyo
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1166 - 1173
  • [18] Learning discriminative visual semantic embedding for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Yuan, Jianying
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
  • [19] Transductive Visual-Semantic Embedding for Zero-shot Learning
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    Shao, Jie
    Huang, Zi
    [J]. PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 41 - 49
  • [20] Domain-Oriented Semantic Embedding for Zero-Shot Learning
    Min, Shaobo
    Yao, Hantao
    Xie, Hongtao
    Zha, Zheng-Jun
    Zhang, Yongdong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3919 - 3930