Multi-Modality Adversarial Auto-Encoder for Zero-Shot Learning

被引:3
|
作者
Ji, Zhong [1 ]
Dai, Guangwen [1 ]
Yu, Yunlong [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
基金
中国国家自然科学基金;
关键词
Zero-shot learning; adversarial network; auto-encoder; image recognition;
D O I
10.1109/ACCESS.2019.2962298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing generative Zero-Shot Learning (ZSL) methods only consider the unidirectional alignment from the class semantics to the visual features while ignoring the alignment from the visual features to the class semantics, which fails to construct the visual-semantic interactions well. In this paper, we propose to generate visual features based on an auto-encoder framework paired with multi-modality adversarial networks respectively for visual and semantic modalities to reinforce the visual-semantic interactions with a bidirectional alignment, which ensures the generated visual features to fit the real visual distribution and to be highly related to the semantics. The encoder aims at generating real-like visual features while the decoder forces both the real and the generated visual features to be more related to the class semantics. To further capture the discriminative information of the generated visual features, both the real and generated visual features are forced to be classified into the correct classes via a classification network. Experimental results on four benchmark datasets show that the proposed approach is particularly competitive on both the traditional ZSL and the generalized ZSL tasks.
引用
收藏
页码:9287 / 9295
页数:9
相关论文
共 50 条
  • [1] Variational Auto-Encoder Combined with Knowledge Graph Zero-Shot Learning
    Zhang, Haitao
    Su, Lin
    Computer Engineering and Applications, 2023, 59 (01): : 236 - 243
  • [2] Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder
    Xing, Nan
    Liu, Yang
    Zhu, Hong
    Wang, Jing
    Han, Jungong
    IEEE ACCESS, 2021, 9 : 733 - 742
  • [3] Bi-shifting semantic auto-encoder for zero-shot learning
    Wang, Yu
    ELECTRONIC RESEARCH ARCHIVE, 2022, 30 (01): : 140 - 167
  • [4] Double Discriminative Graph Regularized Semantic Auto-Encoder for Zero-shot Learning
    Tai, Debao
    Zhang, Zhonghao
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [5] GENERALIZED ZERO-SHOT LEARNING USING MULTIMODAL VARIATIONAL AUTO-ENCODER WITH SEMANTIC CONCEPTS
    Bendre, Nihar
    Desai, Kevin
    Najafirad, Peyman
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1284 - 1288
  • [6] A Simple Discriminative Dual Semantic Auto-encoder for Zero-shot Classification
    Liu, Yang
    Li, Jin
    Gao, Xinbo
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4053 - 4057
  • [7] Domain-aware multi-modality fusion network for generalized zero-shot learning
    Wang, Jia
    Wang, Xiao
    Zhang, Han
    NEUROCOMPUTING, 2022, 488 : 23 - 35
  • [8] Modal-nexus auto-encoder for multi-modality cellular data integration and imputation
    Tang, Zhenchao
    Chen, Guanxing
    Chen, Shouzhi
    Yao, Jianhua
    You, Linlin
    Chen, Calvin Yu-Chian
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [9] Zero-Shot Image Recognition Algorithm via Semantic Auto-Encoder Combining Relation Network
    Lin K.
    Li H.
    Bai J.
    Li A.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (03): : 214 - 224
  • [10] Adversarial strategy for transductive zero-shot learning
    Liu, Youfa
    Du, Bo
    Ni, Fuchuan
    INFORMATION SCIENCES, 2021, 578 : 750 - 761