Deep Semantic Correlation with Adversarial Learning for Cross-Modal Retrieval

被引:0
|
作者
Hua, Yan [1 ]
Du, Jianhe [1 ]
机构
[1] Commun Univ China, Sch Informat & Commun Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Adversarial network; Deep learning; Correlation learning; Cross-modal retrieval;
D O I
10.1109/iceiec.2019.8784597
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Correlation learning usually maps heterogeneous data into a common subspace to achieve cross-modal retrieval. Thanks to the success of deep learning in recent years, the performance of cross-modal retrieval has made a great improvement. However, how to bridge the modality gap is still the key problem. In this paper, we propose a deep semantic correlation learning method with generative adversarial network to deal with cross-modal data annotated by multi-labels. With adversarial learning, the generative network tries to produce the common semantic representations respect to image and text modalities, while discriminative model tries to point out the differences between them. Besides that, we propose a classification loss applied to one or multiple categories for semantic subspace learning to promote cross-modal retrieval. The adversarial network and the classification network are jointly optimized. Experiments verify the effectiveness of our proposed model on two widely used datasets.
引用
收藏
页码:252 / 255
页数:4
相关论文
共 50 条
  • [1] Adversarial Learning-Based Semantic Correlation Representation for Cross-Modal Retrieval
    Zhu, Lei
    Song, Jiayu
    Zhu, Xiaofeng
    Zhang, Chengyuan
    Zhang, Shichao
    Yuan, Xinpan
    [J]. IEEE MULTIMEDIA, 2020, 27 (04) : 79 - 90
  • [2] Deep semantic similarity adversarial hashing for cross-modal retrieval
    Qiang, Haopeng
    Wan, Yuan
    Xiang, Lun
    Meng, Xiaojing
    [J]. NEUROCOMPUTING, 2020, 400 : 24 - 33
  • [3] Deep adversarial metric learning for cross-modal retrieval
    Xu, Xing
    He, Li
    Lu, Huimin
    Gao, Lianli
    Ji, Yanli
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 657 - 672
  • [4] Deep adversarial metric learning for cross-modal retrieval
    Xing Xu
    Li He
    Huimin Lu
    Lianli Gao
    Yanli Ji
    [J]. World Wide Web, 2019, 22 : 657 - 672
  • [5] Deep Semantic Correlation Learning based Hashing for Multimedia Cross-Modal Retrieval
    Gong, Xiaolong
    Huang, Linpeng
    Wang, Fuwei
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 117 - 126
  • [6] Modal-adversarial Semantic Learning Network for Extendable Cross-modal Retrieval
    Xu, Xing
    Song, Jingkuan
    Lu, Huimin
    Yang, Yang
    Shen, Fumin
    Huang, Zi
    [J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 46 - 54
  • [7] Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval
    Meng, Min
    Sun, Jiaxuan
    Liu, Jigang
    Yu, Jun
    Wu, Jigang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1914 - 1926
  • [8] Analyzing semantic correlation for cross-modal retrieval
    Liang Xie
    Peng Pan
    Yansheng Lu
    [J]. Multimedia Systems, 2015, 21 : 525 - 539
  • [9] Analyzing semantic correlation for cross-modal retrieval
    Xie, Liang
    Pan, Peng
    Lu, Yansheng
    [J]. MULTIMEDIA SYSTEMS, 2015, 21 (06) : 525 - 539
  • [10] Deep Semantic Mapping for Cross-Modal Retrieval
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    [J]. 2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 234 - 241