Deep Unbiased Embedding Transfer for Zero-Shot Learning

被引:32
|
作者
Jia, Zhen [1 ]
Zhang, Zhang [1 ]
Wang, Liang [1 ,2 ,3 ]
Shan, Caifeng [4 ]
Tan, Tieniu [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Ctr Res Intelligent Percept & Comp, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, AIR, Beijing 100190, Peoples R China
基金
国家重点研发计划;
关键词
Visualization; Feature extraction; Semantics; Training; Seals; Prototypes; Indexes; Zero-shot learning; image classification; projection domain shift; convolutional neural network; generative adversarial network;
D O I
10.1109/TIP.2019.2947780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning aims to recognize objects which do not appear in the training dataset. Previous prevalent mapping-based zero-shot learning methods suffer from the projection domain shift problem due to the lack of image classes in the training stage. In order to alleviate the projection domain shift problem, a deep unbiased embedding transfer (DUET) model is proposed in this paper. The DUET model is composed of a deep embedding transfer (DET) module and an unseen visual feature generation (UVG) module. In the DET module, a novel combined embedding transfer net which integrates the complementary merits of the linear and nonlinear embedding mapping functions is proposed to connect the visual space and semantic space. Whats more, the end-to-end joint training process is implemented to train the visual feature extractor and the combined embedding transfer net simultaneously. In the UVG module, a visual feature generator trained with a conditional generative adversarial framework is used to synthesize the visual features of the unseen classes to ease the disturbance of the projection domain shift problem. Furthermore, a quantitative index, namely the score of resistance on domain shift (ScoreRDS), is proposed to evaluate different models regarding their resistance capability on the projection domain shift problem. The experiments on five zero-shot learning benchmarks verify the effectiveness of the proposed DUET model. As demonstrated by the qualitative and quantitative analysis, the unseen class visual feature generation, the combined embedding transfer net and the end-to-end joint training process all contribute to alleviating projection domain shift in zero-shot learning.
引用
收藏
页码:1958 / 1971
页数:14
相关论文
共 50 条
  • [1] Transductive Unbiased Embedding for Zero-Shot Learning
    Song, Jie
    Shen, Chengchao
    Yang, Yezhou
    Liu, Yang
    Song, Mingli
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1024 - 1033
  • [2] Learning a Deep Embedding Model for Zero-Shot Learning
    Zhang, Li
    Xiang, Tao
    Gong, Shaogang
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3010 - 3019
  • [3] Towards Effective Deep Embedding for Zero-Shot Learning
    Zhang, Lei
    Wang, Peng
    Liu, Lingqiao
    Shen, Chunhua
    Wei, Wei
    Zhang, Yanning
    van den Hengel, Anton
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) : 2843 - 2852
  • [4] A Variational Autoencoder with Deep Embedding Model for Generalized Zero-Shot Learning
    Ma, Peirong
    Hu, Xiao
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11733 - 11740
  • [5] Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
  • [6] Learning Unbiased Zero-Shot Semantic Segmentation Networks Via Transductive Transfer
    Lv, Fengmao
    Liu, Haiyang
    Wang, Yichen
    Zhao, Jiayi
    Yang, Guowu
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1640 - 1644
  • [7] Disentangled Ontology Embedding for Zero-shot Learning
    Geng, Yuxia
    Chen, Jiaoyan
    Zhang, Wen
    Xu, Yajing
    Chen, Zhuo
    Pan, Jeff Z.
    Huang, Yufeng
    Xiong, Feiyu
    Chen, Huajun
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 443 - 453
  • [8] Unbiased hybrid generation network for zero-shot learning
    Wang, Zong-Hui
    Lu, Zi-Qian
    Lu, Zhe-Ming
    [J]. ELECTRONICS LETTERS, 2020, 56 (18) : 929 - +
  • [9] Unbiased feature generating for generalized zero-shot learning
    Niu, Chang
    Shang, Junyuan
    Huang, Junchu
    Yang, Junmei
    Song, Yuting
    Zhou, Zhiheng
    Zhou, Guoxu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89
  • [10] Region interaction and attribute embedding for zero-shot learning
    Hu, Zhengwei
    Zhao, Haitao
    Peng, Jingchao
    Gu, Xiaojing
    [J]. INFORMATION SCIENCES, 2022, 609 : 984 - 995