Multi-Modality Adversarial Auto-Encoder for Zero-Shot Learning

被引:3
|
作者
Ji, Zhong [1 ]
Dai, Guangwen [1 ]
Yu, Yunlong [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
基金
中国国家自然科学基金;
关键词
Zero-shot learning; adversarial network; auto-encoder; image recognition;
D O I
10.1109/ACCESS.2019.2962298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing generative Zero-Shot Learning (ZSL) methods only consider the unidirectional alignment from the class semantics to the visual features while ignoring the alignment from the visual features to the class semantics, which fails to construct the visual-semantic interactions well. In this paper, we propose to generate visual features based on an auto-encoder framework paired with multi-modality adversarial networks respectively for visual and semantic modalities to reinforce the visual-semantic interactions with a bidirectional alignment, which ensures the generated visual features to fit the real visual distribution and to be highly related to the semantics. The encoder aims at generating real-like visual features while the decoder forces both the real and the generated visual features to be more related to the class semantics. To further capture the discriminative information of the generated visual features, both the real and generated visual features are forced to be classified into the correct classes via a classification network. Experimental results on four benchmark datasets show that the proposed approach is particularly competitive on both the traditional ZSL and the generalized ZSL tasks.
引用
收藏
页码:9287 / 9295
页数:9
相关论文
共 50 条
  • [31] Adversarial Learning for Zero-Shot Stance Detection on Social Media
    Allaway, Emily
    Srikanth, Malavika
    McKeown, Kathleen
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4756 - 4767
  • [32] Cross-Domain Adversarial Learning for Zero-Shot Classification
    Liu H.
    Zheng Q.
    Luo M.
    Zhao H.
    Xiao Y.
    Lü Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (12): : 2521 - 2535
  • [33] Inductive Generalized Zero-Shot Learning with Adversarial Relation Network
    Yang, Guanyu
    Huang, Kaizhu
    Zhang, Rui
    Goulermas, John Y.
    Hussain, Amir
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 724 - 739
  • [34] Adversarial unseen visual feature synthesis for Zero-shot Learning
    Zhang, Haofeng
    Long, Yang
    Liu, Li
    Shao, Ling
    NEUROCOMPUTING, 2019, 329 : 12 - 20
  • [35] Generative Dual Adversarial Network for Generalized Zero-shot Learning
    Huang, He
    Wang, Changhu
    Yu, Philip S.
    Wang, Chang-Dong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 801 - 810
  • [36] JS']JSE: Joint Semantic Encoder for zero-shot gesture learning
    Madapana, Naveen
    Wachs, Juan
    PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (03) : 679 - 692
  • [37] Few-Shot Learning Based on Self-Attention and Auto-Encoder
    Ji, Zhong
    Chai, Xingliang
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2021, 54 (04): : 338 - 345
  • [38] Coupled generative adversarial stacked Auto-encoder: CoGASA
    Kiasari, Mohammad Ahangar
    Moirangthem, Dennis Singh
    Lee, Minho
    NEURAL NETWORKS, 2018, 100 : 1 - 9
  • [39] Active Learning with Multi-Granular Graph Auto-Encoder
    He, Yi
    Yuan, Xu
    Tzeng, Nian-Feng
    Wu, Xindong
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1058 - 1063
  • [40] Adversarial auto-encoder for rating prediction with ratings and reviews
    Yi, Jin
    Huang, Jiajin
    Qin, Jin
    WEB INTELLIGENCE, 2020, 18 (04) : 285 - 294