Unsupervised Cross-Modal Hashing With Modality-Interaction

被引:13
|
作者
Tu, Rong-Cheng [1 ,2 ]
Jiang, Jie [3 ]
Lin, Qinghong [4 ]
Cai, Chengfei [3 ]
Tian, Shangxuan [3 ]
Wang, Hongfa [3 ]
Liu, Wei [3 ]
机构
[1] Tencent, Shenzhen 518100, Peoples R China
[2] Beijing Inst Technol, Dept Comp Sci & Technol, Beijing 100081, Peoples R China
[3] Tencent Data Platform, Shenzhen 518051, Guangdong, Peoples R China
[4] Natl Univ Singapore, Elect & Comp Engn, Singapore 138600, Singapore
关键词
Cross-modal Retrieval; Hashing; Modality-interaction; Bit-selection; ATTENTION; NETWORK;
D O I
10.1109/TCSVT.2023.3251395
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, numerous unsupervised cross-modal hashing methods have been proposed to deal the image-text retrieval tasks for the unlabeled cross-modal data. However, when these methods learn to generate hash codes, almost all of them lack modality-interaction in the following two aspects: 1) The instance similarity matrix used to guide the hashing networks training is constructed without image-text interaction, which fails to capture the fine-grained cross-modal cues to elaborately characterize the intrinsic semantic similarity among the datapoints. 2) The binary codes used for quantization loss are inferior because they are generated by directly quantizing a simple combination of continuous hash codes from different modalities without the interaction among these continuous hash codes. Such problems will cause the generated hash codes to be of poor quality and degrade the retrieval performance. Hence, in this paper, we propose a novel Unsupervised Cross-modal Hashing with Modality-interaction, termed UCHM. Specifically, by optimizing a novel hash-similarity-friendly loss, a modality-interaction-enabled (MIE) similarity generator is first trained to generate a superior MIE similarity matrix for the training set. Then, the generated MIE similarity matrix is utilized as guiding information to train the deep hashing networks. Furthermore, during the process of training the hashing networks, a novel bit-selection module is proposed to generate high-quality unified binary codes for the quantization loss with the interaction among continuous codes from different modalities, thereby further enhancing the retrieval performance. Extensive experiments on two widely used datasets show that the proposed UCHM outperforms state-of-the-art techniques on cross-modal retrieval tasks.
引用
收藏
页码:5296 / 5308
页数:13
相关论文
共 50 条
  • [21] MODALITY-SPECIFIC STRUCTURE PRESERVING HASHING FOR CROSS-MODAL RETRIEVAL
    Liu, Xingbo
    Nie, Xiushan
    Sun, Haoliang
    Cui, Chaoran
    Yin, Yilong
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1678 - 1682
  • [22] Unsupervised cross-modal hashing retrieval via Dynamic Contrast and Optimization
    Xie, Xiumin
    Li, Zhixin
    Li, Bo
    Zhang, Canlong
    Ma, Huifang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [23] High-order nonlocal Hashing for unsupervised cross-modal retrieval
    Peng-Fei Zhang
    Yadan Luo
    Zi Huang
    Xin-Shun Xu
    Jingkuan Song
    World Wide Web, 2021, 24 : 563 - 583
  • [24] Dark knowledge association guided hashing for unsupervised cross-modal retrieval
    Kang, Han
    Zhang, Xiaowei
    Han, Wenpeng
    Zhou, Mingliang
    Multimedia Systems, 2024, 30 (06)
  • [25] Self-Attentive CLIP Hashing for Unsupervised Cross-Modal Retrieval
    Yu, Heng
    Ding, Shuyan
    Li, Lunbo
    Wu, Jiexin
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [26] Scalable Unsupervised Hashing via Exploiting Robust Cross-Modal Consistency
    Liu, Xingbo
    Li, Jiamin
    Nie, Xiushan
    Zhang, Xuening
    Wang, Shaohua
    Yin, Yilong
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (04) : 514 - 527
  • [27] Cluster-wise unsupervised hashing for cross-modal similarity search
    Wang, Lu
    Yang, Jie
    Zareapoor, Masoumeh
    Zheng, Zhonglong
    PATTERN RECOGNITION, 2021, 111
  • [28] Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing
    Yang, Xiaohan
    Wang, Zhen
    Wu, Nannan
    Li, Guokun
    Feng, Chuang
    Liu, Pingping
    MATHEMATICS, 2022, 10 (15)
  • [29] High-order nonlocal Hashing for unsupervised cross-modal retrieval
    Zhang, Peng-Fei
    Luo, Yadan
    Huang, Zi
    Xu, Xin-Shun
    Song, Jingkuan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (02): : 563 - 583
  • [30] Structure-aware contrastive hashing for unsupervised cross-modal retrieval
    Cui, Jinrong
    He, Zhipeng
    Huang, Qiong
    Fu, Yulu
    Li, Yuting
    Wen, Jie
    NEURAL NETWORKS, 2024, 174