Unsupervised cross-modal hashing retrieval via Dynamic Contrast and Optimization

被引:0
|
作者
Xie, Xiumin [1 ,2 ]
Li, Zhixin [1 ,2 ]
Li, Bo [1 ,2 ]
Zhang, Canlong [1 ,2 ]
Ma, Huifang [3 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
[3] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Peoples R China
基金
中国国家自然科学基金;
关键词
Unsupervised cross-modal hashing retrieval; Contrastive learning; Adversarial learning; Cross-modal ranking learning; Dynamic optimization;
D O I
10.1016/j.engappai.2024.108969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal hashing encodes multimodal data into a common binary space, which can efficiently measure correlations between cross-modal instances. However, most existing cross-modal hashing retrieval methods are difficult to handle the heterogeneity problem between different modalities, and the performance drops because the binary code cannot be learned in the process of hash binary optimization. To solve these problems, we propose a Dynamic Contrast and Optimization (DCO) method for unsupervised cross-modal hashing retrieval, which implements an adaptive hash optimizer to strengthen the consistency of each modal representation and maintain the correlations between different modalities. Specifically, we propose a novel adaptive memory optimization mechanism. It enables the memory unit to learn and optimize adaptively, memorize in dynamic learning, and learn from memory, thereby narrowing the gap between original features and binary representations. Furthermore, we combine cross-modal ranking learning and adversarial learning. This not only ensures the modal invariance of correlated binary codes, but also allows for better approximation of generating continuous values close to discrete binary codes. To verify the effectiveness of the proposed method, we conduct a series of experiments on three widely used benchmark datasets. Through experimental results, we demonstrate the superiority of the proposed method in comparison with some state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Unsupervised Multi-modal Hashing for Cross-Modal Retrieval
    Jun Yu
    Xiao-Jun Wu
    Donglin Zhang
    [J]. Cognitive Computation, 2022, 14 : 1159 - 1171
  • [2] Unsupervised Multi-modal Hashing for Cross-Modal Retrieval
    Yu, Jun
    Wu, Xiao-Jun
    Zhang, Donglin
    [J]. COGNITIVE COMPUTATION, 2022, 14 (03) : 1159 - 1171
  • [3] Robust Unsupervised Cross-modal Hashing for Multimedia Retrieval
    Cheng, Miaomiao
    Jing, Liping
    Ng, Michael K.
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 38 (03)
  • [4] Unsupervised Deep Imputed Hashing for Partial Cross-modal Retrieval
    Chen, Dong
    Cheng, Miaomiao
    Min, Chen
    Jing, Liping
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] UNSUPERVISED CONTRASTIVE HASHING FOR CROSS-MODAL RETRIEVAL IN REMOTE SENSING
    Mikriukov, Georgii
    Ravanbakhsh, Mahdyar
    Demir, Begum
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4463 - 4467
  • [6] Deep Unsupervised Momentum Contrastive Hashing for Cross-modal Retrieval
    Lu, Kangkang
    Yu, Yanhua
    Liang, Meiyu
    Zhang, Min
    Cao, Xiaowen
    Zhao, Zehua
    Yin, Mengran
    Xue, Zhe
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 126 - 131
  • [7] Coupled CycleGAN: Unsupervised Hashing Network for Cross-Modal Retrieval
    Li, Chao
    Deng, Cheng
    Wang, Lei
    Xie, De
    Liu, Xianglong
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 176 - 183
  • [8] Adaptive Graph Attention Hashing for Unsupervised Cross-Modal Retrieval via Multimodal Transformers
    Li, Yewen
    Ge, Mingyuan
    Ji, Yucheng
    Li, Mingyong
    [J]. WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 1 - 15
  • [9] Joint-Modal Graph Convolutional Hashing for unsupervised cross-modal retrieval
    Meng, Hui
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Lu, Xu
    Guo, Xinru
    [J]. NEUROCOMPUTING, 2024, 595
  • [10] Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval
    Li, Mingyong
    Wang, Hongya
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 183 - 191