Unsupervised Contrastive Cross-Modal Hashing

被引:76
|
作者
Hu, Peng [1 ]
Zhu, Hongyuan [2 ]
Lin, Jie [2 ]
Peng, Dezhong [1 ,3 ,4 ]
Zhao, Yin-Ping [5 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
[3] Chengdu Ruibei Yingte Informat Technol Ltd Co, Chengdu 610094, Peoples R China
[4] Sichuan Zhiqian Technol Ltd Co, Chengdu 610094, Peoples R China
[5] Northwestern Polytech Univ, Sch Software, Xian 710072, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Semantics; Bridges; Optimization; Correlation; Task analysis; Degradation; Binary codes; Common hamming space; contrastive hashing network; cross-modal retrieval; unsupervised cross-modal hashing; NETWORK;
D O I
10.1109/TPAMI.2022.3177356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study how to make unsupervised cross-modal hashing (CMH) benefit from contrastive learning (CL) by overcoming two challenges. To be exact, i) to address the performance degradation issue caused by binary optimization for hashing, we propose a novel momentum optimizer that performs hashing operation learnable in CL, thus making on-the-shelf deep cross-modal hashing possible. In other words, our method does not involve binary-continuous relaxation like most existing methods, thus enjoying better retrieval performance; ii) to alleviate the influence brought by false-negative pairs (FNPs), we propose a Cross-modal Ranking Learning loss (CRL) which utilizes the discrimination from all instead of only the hard negative pairs, where FNP refers to the within-class pairs that were wrongly treated as negative pairs. Thanks to such a global strategy, CRL endows our method with better performance because CRL will not overuse the FNPs while ignoring the true-negative pairs. To the best of our knowledge, the proposed method could be one of the first successful contrastive hashing methods. To demonstrate the effectiveness of the proposed method, we carry out experiments on five widely-used datasets compared with 13 state-of-the-art methods. The code is available at https://github.com/penghu-cs/UCCH.
引用
收藏
页码:3877 / 3889
页数:13
相关论文
共 50 条
  • [21] Joint-Modal Graph Convolutional Hashing for unsupervised cross-modal retrieval
    Meng, Hui
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Lu, Xu
    Guo, Xinru
    [J]. NEUROCOMPUTING, 2024, 595
  • [22] Unsupervised cross-modal hashing retrieval via Dynamic Contrast and Optimization
    Xie, Xiumin
    Li, Zhixin
    Li, Bo
    Zhang, Canlong
    Ma, Huifang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [23] Dark knowledge association guided hashing for unsupervised cross-modal retrieval
    Kang, Han
    Zhang, Xiaowei
    Han, Wenpeng
    Zhou, Mingliang
    [J]. Multimedia Systems, 2024, 30 (06)
  • [24] High-order nonlocal Hashing for unsupervised cross-modal retrieval
    Peng-Fei Zhang
    Yadan Luo
    Zi Huang
    Xin-Shun Xu
    Jingkuan Song
    [J]. World Wide Web, 2021, 24 : 563 - 583
  • [25] Cluster-wise unsupervised hashing for cross-modal similarity search
    Wang, Lu
    Yang, Jie
    Zareapoor, Masoumeh
    Zheng, Zhonglong
    [J]. PATTERN RECOGNITION, 2021, 111
  • [26] Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing
    Yang, Xiaohan
    Wang, Zhen
    Wu, Nannan
    Li, Guokun
    Feng, Chuang
    Liu, Pingping
    [J]. MATHEMATICS, 2022, 10 (15)
  • [27] Self-Attentive CLIP Hashing for Unsupervised Cross-Modal Retrieval
    Yu, Heng
    Ding, Shuyan
    Li, Lunbo
    Wu, Jiexin
    [J]. PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [28] Scalable Unsupervised Hashing via Exploiting Robust Cross-Modal Consistency
    Liu, Xingbo
    Li, Jiamin
    Nie, Xiushan
    Zhang, Xuening
    Wang, Shaohua
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (04) : 514 - 527
  • [29] High-order nonlocal Hashing for unsupervised cross-modal retrieval
    Zhang, Peng-Fei
    Luo, Yadan
    Huang, Zi
    Xu, Xin-Shun
    Song, Jingkuan
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (02): : 563 - 583
  • [30] Continuous cross-modal hashing
    Zheng, Hao
    Wang, Jinbao
    Zhen, Xiantong
    Song, Jingkuan
    Zheng, Feng
    Lu, Ke
    Qi, Guo-Jun
    [J]. PATTERN RECOGNITION, 2023, 142