Structures Aware Fine-Grained Contrastive Adversarial Hashing for Cross-Media Retrieval

被引:0
|
作者
Liang, Meiyu [1 ]
Li, Yawen [2 ]
Yu, Yang [1 ]
Cao, Xiaowen [1 ]
Xue, Zhe [1 ]
Li, Ang [1 ]
Lu, Kangkang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing Key Lab Intelligent Telecommun Software &, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Contrastive adversarial hashing; cross-media retrieval; cross-media contrastive learning; graph convolutional network; NETWORK;
D O I
10.1109/TKDE.2024.3356258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep cross-media hashing provides an efficient semantic representation learning solution for large-scale cross-media retrieval. The existing methods only consider the inter-media or intra-media semantic association learning, ignore the guiding of semantic structure information, and have weak reasoning ability for implicit fine-grained semantic associations. To tackle this problem, we propose a novel structures aware fine-grained contrastive adversarial hashing method for cross-media retrieval. A novel cross-media contrastive adversarial hash network is constructed for the first time, which integrates the cross-media and intra-media contrastive learning and multi-modal adversarial learning, aiming at maximizing the semantic association between different modalities, and improving the semantic discrimination and consistency of cross-media unified hash representation, thereby the inter-media and intra-media semantic preserving ability can be well enhanced; A fine-grained cross-media semantic feature learning method based on fine-grained semantic reasoning with transformers is proposed, which captures fine-grained salient features of different modalities for semantic association learning, and enhances the reasoning ability of fine-grained implicit semantic association; A semantic label graph convolutional network guided cross-media semantic association learning strategy is proposed, which makes full use of semantic structure information to enhance the learning ability of implicit cross-media semantic associations. Extensive experiments on several large-scale cross-media benchmark datasets demonstrate that the proposed method outperforms the state-of-the-art methods.
引用
收藏
页码:3514 / 3528
页数:15
相关论文
共 50 条
  • [1] A New Benchmark and Approach for Fine-grained Cross-media Retrieval
    He, Xiangteng
    Peng, Yuxin
    Xie, Liu
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1740 - 1748
  • [2] Cross-media retrieval by exploiting fine-grained correlation at entity level
    Huang, Lei
    Peng, Yuxin
    [J]. NEUROCOMPUTING, 2017, 236 : 123 - 133
  • [3] DBFC-Net: a uniform framework for fine-grained cross-media retrieval
    Qiong Wang
    Youdong Guo
    Yazhou Yao
    [J]. Multimedia Systems, 2022, 28 : 423 - 432
  • [4] DBFC-Net: a uniform framework for fine-grained cross-media retrieval
    Wang, Qiong
    Guo, Youdong
    Yao, Yazhou
    [J]. MULTIMEDIA SYSTEMS, 2022, 28 (02) : 423 - 432
  • [5] Cross-media Deep Fine-grained Correlation Learning
    Zhuo Y.-K.
    Qi J.-W.
    Peng Y.-X.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2019, 30 (04): : 884 - 895
  • [6] Supervised Coarse-to-Fine Semantic Hashing for cross-media retrieval
    Yao, Tao
    Kong, Xiangwei
    Fu, Haiyan
    Tian, Qi
    [J]. DIGITAL SIGNAL PROCESSING, 2017, 63 : 135 - 144
  • [7] Deep Saliency Hashing for Fine-Grained Retrieval
    Jin, Sheng
    Yao, Hongxun
    Sun, Xiaoshuai
    Zhou, Shangchen
    Zhang, Lei
    Hua, Xiansheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5336 - 5351
  • [8] Unsupervised Deep Hashing With Fine-Grained Similarity-Preserving Contrastive Learning for Image Retrieval
    Cao, Hu
    Huang, Lei
    Nie, Jie
    Wei, Zhiqiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 4095 - 4108
  • [9] Self-Attention based fine-grained cross-media hybrid network
    Shan, Wei
    Huang, Dan
    Wang, Jiangtao
    Zou, Feng
    Li, Suwen
    [J]. PATTERN RECOGNITION, 2022, 130
  • [10] Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network
    Liang, Meiyu
    Du, Junping
    Liu, Wu
    Xue, Zhe
    Geng, Yue
    Yang, Congxian
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1313 - 1321