Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network

被引:36
|
作者
Chi, Jingze [1 ]
Peng, Yuxin [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Gallium nitride; Semantics; Media; Correlation; Training; Dogs; Measurement; Cross-media retrieval; zero-shot learning; generative adversarial networks; maximum mean discrepancy; REPRESENTATION; RETRIEVAL;
D O I
10.1109/TCSVT.2019.2900171
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing cross-media retrieval methods are mainly based on the condition where the training set covers all the categories in the testing set, which lack extensibility to retrieve data of new categories. Thus, zero-shot cross-media retrieval has been a promising direction in practical application, aiming to retrieve data of new categories (unseen categories), only with data of limited known categories (seen categories) for training. It is challenging for not only the heterogeneous distributions across different media types, but also the inconsistent semantics across seen and unseen categories need to be handled. To address the above issues, we propose dual adversarial distribution network (DADN), to learn common embeddings and explore the knowledge from word-embeddings of different categories. The main contributions are as follows. First, zero-shot cross-media dual generative adversarial networks architecture is proposed, in which two kinds of generative adversarial networks (GANs) for common embedding generation and representation reconstruction form dual processes. The dual GANs mutually promote to model semantic and underlying structure information, which generalizes across different categories on heterogeneous distributions and boosts correlation learning. Second, distribution matching with maximum mean discrepancy criterion is proposed to combine with dual GANs, which enhances distribution matching between common embeddings and category word-embeddings. Finally, adversarial inter-media metric constraint is proposed with an inter-media loss and a quadruplet loss, which further model the inter-media correlation information and improve semantic ranking ability. The experiments on four widely used cross-media datasets demonstrate the effectiveness of our DADN approach.
引用
收藏
页码:1173 / 1187
页数:15
相关论文
共 50 条
  • [41] Incremental Embedding Learning via Zero-Shot Translation
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Li, Maosen
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10254 - 10262
  • [42] ENCYCLOPEDIA ENHANCED SEMANTIC EMBEDDING FOR ZERO-SHOT LEARNING
    Jia, Zhen
    Zhang, Junge
    Huang, Kaiqi
    Tan, Tieniu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1287 - 1291
  • [43] Transductive Zero-Shot Learning With Adaptive Structural Embedding
    Yu, Yunlong
    Ji, Zhong
    Guo, Jichang
    Pang, Yanwei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 4116 - 4127
  • [44] Zero-Shot Learning via Semantic Similarity Embedding
    Zhang, Ziming
    Saligrama, Venkatesh
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4166 - 4174
  • [45] Deep Unbiased Embedding Transfer for Zero-Shot Learning
    Jia, Zhen
    Zhang, Zhang
    Wang, Liang
    Shan, Caifeng
    Tan, Tieniu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1958 - 1971
  • [46] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2606 - 2622
  • [47] Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
    Xu, Tingting
    Zhao, Ye
    Liu, Xueliang
    COMPLEXITY, 2021, 2021
  • [48] Towards Effective Deep Embedding for Zero-Shot Learning
    Zhang, Lei
    Wang, Peng
    Liu, Lingqiao
    Shen, Chunhua
    Wei, Wei
    Zhang, Yanning
    van den Hengel, Anton
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) : 2843 - 2852
  • [49] GNDAN: Graph Navigated Dual Attention Network for Zero-Shot Learning
    Chen, Shiming
    Hong, Ziming
    Xie, Guosen
    Peng, Qinmu
    You, Xinge
    Ding, Weiping
    Shao, Ling
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4516 - 4529
  • [50] Dual-stream generative adversarial networks for distributionally robust zero-shot learning
    Liu, Huan
    Yao, Lina
    Zheng, Qinghua
    Luo, Minnan
    Zhao, Hongke
    Lyu, Yanzhang
    INFORMATION SCIENCES, 2020, 519 : 407 - 422