Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval

被引:363
|
作者
Li, Chao [1 ]
Deng, Cheng [1 ]
Li, Ning [1 ]
Liu, Wei [2 ]
Gao, Xinbo [1 ]
Tao, Dacheng [3 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Shaanxi, Peoples R China
[2] Tencent AI Lab, Shenzhen, Peoples R China
[3] Univ Sydney, UBTECH Sydney AI Ctr, SIT, FEIT, Sydney, NSW, Australia
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2018.00446
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
UThanks to the success of deep learning, cross-modal retrieval has made significant progress recently. However, there still remains a crucial bottleneck: how to bridge the modality gap to further enhance the retrieval accuracy. In this paper, we propose a self-supervised adversarial hashing (SSAH) approach, which lies among the early attempts to incorporate adversarial learning into cross-modal hashing in a self-supervised fashion. The primary contribution of this work is that two adversarial networks are leveraged to maximize the semantic correlation and consistency of the representations between different modalities. In addition, we harness a self-supervised semantic network to discover high-level semantic information in the form of multi-label annotations. Such information guides the feature learning process and preserves the modality relationships in both the common semantic space and the Hamming space. Extensive experiments carried out on three benchmark datasets validate that the proposed SSAH surpasses the state-of-the-art methods.
引用
下载
收藏
页码:4242 / 4251
页数:10
相关论文
共 50 条
  • [31] Deep supervised fused similarity hashing for cross-modal retrieval
    Ng W.W.Y.
    Xu Y.
    Tian X.
    Wang H.
    Multimedia Tools and Applications, 2024, 83 (39) : 86537 - 86555
  • [32] Multi-label enhancement based self-supervised deep cross-modal hashing
    Zou, Xitao
    Wu, Song
    Bakker, Erwin M.
    Wang, Xinzhi
    NEUROCOMPUTING, 2022, 467 : 138 - 162
  • [33] Multi-label enhancement based self-supervised deep cross-modal hashing
    Zou, Xitao
    Wu, Song
    Bakker, Erwin M.
    Wang, Xinzhi
    Neurocomputing, 2022, 467 : 138 - 162
  • [34] Separated Variational Hashing Networks for Cross-Modal Retrieval
    Hu, Peng
    Wang, Xu
    Zhen, Liangli
    Peng, Dezhong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1721 - 1729
  • [35] Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
    Salvador, Amaia
    Gundogdu, Erhan
    Bazzani, Loris
    Donoser, Michael
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15470 - 15479
  • [36] A NOVEL SELF-SUPERVISED CROSS-MODAL IMAGE RETRIEVAL METHOD IN REMOTE SENSING
    Sumbul, Gencer
    Mueller, Markus
    Demir, Beguem
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2426 - 2430
  • [37] AN ADVERSARIAL AND DEEP HASHING-BASED HIERARCHICAL SUPERVISED CROSS-MODAL IMAGE AND TEXT RETRIEVAL ALGORITHM
    Chen, Ruidong
    Qiang, Baohua
    Zhou, Mingliang
    Zhang, Shihao
    Zheng, Hong
    Tang, Chenghua
    International Journal of Robotics and Automation, 2024, 39 (01): : 77 - 86
  • [38] Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval
    Ma, Xinhong
    Zhang, Tianzhu
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (12) : 3101 - 3114
  • [39] Discriminant Adversarial Hashing Transformer for Cross-modal Vessel Image Retrieval
    Guan X.
    Guo J.
    Lu Y.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (12): : 4411 - 4420
  • [40] Attention-Aware Deep Adversarial Hashing for Cross-Modal Retrieval
    Zhang, Xi
    Lai, Hanjiang
    Feng, Jiashi
    COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 614 - 629