Unsupervised cross-modal hashing retrieval via Dynamic Contrast and Optimization

被引:0
|
作者
Xie, Xiumin [1 ,2 ]
Li, Zhixin [1 ,2 ]
Li, Bo [1 ,2 ]
Zhang, Canlong [1 ,2 ]
Ma, Huifang [3 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
[3] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Peoples R China
基金
中国国家自然科学基金;
关键词
Unsupervised cross-modal hashing retrieval; Contrastive learning; Adversarial learning; Cross-modal ranking learning; Dynamic optimization;
D O I
10.1016/j.engappai.2024.108969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal hashing encodes multimodal data into a common binary space, which can efficiently measure correlations between cross-modal instances. However, most existing cross-modal hashing retrieval methods are difficult to handle the heterogeneity problem between different modalities, and the performance drops because the binary code cannot be learned in the process of hash binary optimization. To solve these problems, we propose a Dynamic Contrast and Optimization (DCO) method for unsupervised cross-modal hashing retrieval, which implements an adaptive hash optimizer to strengthen the consistency of each modal representation and maintain the correlations between different modalities. Specifically, we propose a novel adaptive memory optimization mechanism. It enables the memory unit to learn and optimize adaptively, memorize in dynamic learning, and learn from memory, thereby narrowing the gap between original features and binary representations. Furthermore, we combine cross-modal ranking learning and adversarial learning. This not only ensures the modal invariance of correlated binary codes, but also allows for better approximation of generating continuous values close to discrete binary codes. To verify the effectiveness of the proposed method, we conduct a series of experiments on three widely used benchmark datasets. Through experimental results, we demonstrate the superiority of the proposed method in comparison with some state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Hierarchical Consensus Hashing for Cross-Modal Retrieval
    Sun, Yuan
    Ren, Zhenwen
    Hu, Peng
    Peng, Dezhong
    Wang, Xu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 824 - 836
  • [42] Unsupervised multi-perspective fusing semantic alignment for cross-modal hashing retrieval
    Chen, Yongfeng
    Tan, Junpeng
    Yang, Zhijing
    Shi, Yukai
    Qin, Jinghui
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63993 - 64014
  • [43] Work Together: Correlation-Identity Reconstruction Hashing for Unsupervised Cross-Modal Retrieval
    Zhu, Lei
    Wu, Xize
    Li, Jingjing
    Zhang, Zheng
    Guan, Weili
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8838 - 8851
  • [44] From Sparse to Dense: Semantic Graph Evolutionary Hashing for Unsupervised Cross-Modal Retrieval
    Zhao, Yang
    Yu, Jiaguo
    Liao, Shengbin
    Zhang, Zheng
    Zhang, Haofeng
    [J]. COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 521 - 536
  • [45] Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval
    Shen, Xiao
    Zhang, Haofeng
    Li, Lunbo
    Zhang, Zheng
    Chen, Debao
    Liu, Li
    [J]. NEUROCOMPUTING, 2021, 459 : 152 - 164
  • [46] Unsupervised Deep Hashing via Binary Latent Factor Models for Large-scale Cross-modal Retrieval
    Wu, Gengshen
    Lin, Zijia
    Han, Jungong
    Liu, Li
    Ding, Guiguang
    Zhang, Baochang
    Shen, Jialie
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2854 - 2860
  • [47] Unsupervised Cross-Modal Hashing With Modality-Interaction
    Tu, Rong-Cheng
    Jiang, Jie
    Lin, Qinghong
    Cai, Chengfei
    Tian, Shangxuan
    Wang, Hongfa
    Liu, Wei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5296 - 5308
  • [48] Unsupervised cross-modal similarity via Latent Structure Discrete Hashing Factorization
    Fang, Yixian
    Li, Bin
    Li, Xiaozhou
    Ren, Yuwei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [49] Semantics-Reconstructing Hashing for Cross-Modal Retrieval
    Zhang, Peng-Fei
    Huang, Zi
    Zhang, Zheng
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 315 - 327
  • [50] Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval
    Gu, Wen
    Gu, Xiaoyan
    Gu, Jingzi
    Li, Bo
    Xiong, Zhi
    Wang, Weiping
    [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 159 - 167