Self-Supervised Cluster-Contrast Distillation Hashing Network for Cross-Modal Retrieval

被引:0
|
作者
Sun, Haoxuan [1 ]
Cao, Yudong [1 ]
Liu, Guangyuan [2 ]
机构
[1] Liaoning Univ Technol, Sch Elect & Informat Engn, Jinzhou 121001, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 1116024, Peoples R China
关键词
Hashing; cross-modal; contrastive learning;
D O I
10.1109/ACCESS.2023.3308931
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional cross-modal hash models enable efficient and fast retrieval between multimodal data by training high-quality hash representations. The key to the cross-modal hashing model is feature extraction. However, the quality of the features largely depends on the semantic similarity between the multi-modal data, and the existing methods do not effectively utilize the semantic information between the data. In this paper, we attempt to explore the semantic information inherent within the data using contrastive learning. Specifically, we propose a end-to-end cluster-level contrastive learning method (SCCDH) for cross-modal hashing. The method utilizes the clustering results to guide feature learning in an appropriately designed contrast framework. In SCCDH, feature-level and hash cluster-level contrastive learning are used to help the model learn discriminative features among multimodal data. In addition, we propose a distillation filtering method to filter out a large amount of noise in the data. Extensive experiments were conducted on the MIRFLICKR-25K, NUS-WIDE, and MS-COCO datasets, and the results demonstrate that the proposed method outperformed several state-of-the-art methods.
引用
收藏
页码:96584 / 96593
页数:10
相关论文
共 50 条
  • [1] Self-supervised incomplete cross-modal hashing retrieval
    Peng, Shouyong
    Yao, Tao
    Li, Ying
    Wang, Gang
    Wang, Lili
    Yan, Zhiming
    [J]. Expert Systems with Applications, 2025, 262
  • [2] Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
    Li, Chao
    Deng, Cheng
    Li, Ning
    Liu, Wei
    Gao, Xinbo
    Tao, Dacheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4242 - 4251
  • [3] Graph Convolutional Network Semantic Enhancement Hashing for Self-supervised Cross-Modal Retrieval
    Hu, Jinyu
    Li, Mingyong
    Zhang, Jiayan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 410 - 422
  • [4] Autoencoder-based self-supervised hashing for cross-modal retrieval
    Li, Yifan
    Wang, Xuan
    Cui, Lei
    Zhang, Jiajia
    Huang, Chengkai
    Luo, Xuan
    Qi, Shuhan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 17257 - 17274
  • [5] Autoencoder-based self-supervised hashing for cross-modal retrieval
    Yifan Li
    Xuan Wang
    Lei Cui
    Jiajia Zhang
    Chengkai Huang
    Xuan Luo
    Shuhan Qi
    [J]. Multimedia Tools and Applications, 2021, 80 : 17257 - 17274
  • [6] Self-supervised deep semantics-preserving Hashing for cross-modal retrieval
    Lu B.
    Duan X.
    Yuan Y.
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (09): : 1442 - 1449
  • [7] Self-Supervised Correlation Learning for Cross-Modal Retrieval
    Liu, Yaxin
    Wu, Jianlong
    Qu, Leigang
    Gan, Tian
    Yin, Jianhua
    Nie, Liqiang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2851 - 2863
  • [8] Self-Supervised Visual Representations for Cross-Modal Retrieval
    Patel, Yash
    Gomez, Lluis
    Rusinol, Marcal
    Karatzas, Dimosthenis
    Jawahar, C., V
    [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 182 - 186
  • [9] Deep Self-Supervised Hashing With Fine-Grained Similarity Mining for Cross-Modal Retrieval
    Han, Lijun
    Wang, Renlin
    Chen, Chunlei
    Zhang, Huihui
    Zhang, Yujie
    Zhang, Wenfeng
    [J]. IEEE ACCESS, 2024, 12 : 31756 - 31770
  • [10] Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval
    Yifan Li
    Xuan Wang
    Shuhan Qi
    Chengkai Huang
    Zoe. L Jiang
    Qing Liao
    Jian Guan
    Jiajia Zhang
    [J]. Signal, Image and Video Processing, 2021, 15 : 673 - 680