Autoencoder-based self-supervised hashing for cross-modal retrieval

被引:2
|
作者
Li, Yifan [1 ]
Wang, Xuan [1 ]
Cui, Lei [1 ]
Zhang, Jiajia [1 ]
Huang, Chengkai [1 ]
Luo, Xuan [1 ]
Qi, Shuhan [1 ]
机构
[1] Harbin Inst Technol Shenzhen, Comp Sci & Technol, Shenzhen, Peoples R China
关键词
Cross-modal retrieval; Hash learning; Autoencoder; Self-supervised;
D O I
10.1007/s11042-020-09599-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal retrieval has gained lots of attention in the era of the multimedia data explosion. Taking advantage of low storage cost and fast retrieval speed, hash learning-based methods become more and more popular in this field. The crucial bottlenecks of cross-modal retrieval are twofold: the heterogeneous gap in different modalities and the semantic gap among similar data with various modalities. To address these issues, we adopt self-supervised fashion to bridge the heterogeneous gap by generating the cohesive features of different instances. To mitigate the semantic gap, we use triplet sampling to optimize the semantic loss in inter-modal and intra-modal, which increase the discriminability of our approach. Experimental on two benchmark datasets show the efficiency and robustness of our method, and the extended experiments show the scalability.
引用
收藏
页码:17257 / 17274
页数:18
相关论文
共 50 条
  • [1] Autoencoder-based self-supervised hashing for cross-modal retrieval
    Yifan Li
    Xuan Wang
    Lei Cui
    Jiajia Zhang
    Chengkai Huang
    Xuan Luo
    Shuhan Qi
    [J]. Multimedia Tools and Applications, 2021, 80 : 17257 - 17274
  • [2] Self-supervised incomplete cross-modal hashing retrieval
    Peng, Shouyong
    Yao, Tao
    Li, Ying
    Wang, Gang
    Wang, Lili
    Yan, Zhiming
    [J]. Expert Systems with Applications, 2025, 262
  • [3] Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
    Li, Chao
    Deng, Cheng
    Li, Ning
    Liu, Wei
    Gao, Xinbo
    Tao, Dacheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4242 - 4251
  • [4] Self-supervised deep semantics-preserving Hashing for cross-modal retrieval
    Lu, Bo
    Duan, Xiaodong
    Yuan, Ye
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (09): : 1442 - 1449
  • [5] Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval
    Yifan Li
    Xuan Wang
    Shuhan Qi
    Chengkai Huang
    Zoe. L Jiang
    Qing Liao
    Jian Guan
    Jiajia Zhang
    [J]. Signal, Image and Video Processing, 2021, 15 : 673 - 680
  • [6] Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval
    Li, Yifan
    Wang, Xuan
    Qi, Shuhan
    Huang, Chengkai
    Jiang, Zoe L.
    Liao, Qing
    Guan, Jian
    Zhang, Jiajia
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (04) : 673 - 680
  • [7] Graph Convolutional Network Semantic Enhancement Hashing for Self-supervised Cross-Modal Retrieval
    Hu, Jinyu
    Li, Mingyong
    Zhang, Jiayan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 410 - 422
  • [8] Self-Supervised Cluster-Contrast Distillation Hashing Network for Cross-Modal Retrieval
    Sun, Haoxuan
    Cao, Yudong
    Liu, Guangyuan
    [J]. IEEE ACCESS, 2023, 11 : 96584 - 96593
  • [9] Self-Supervised Correlation Learning for Cross-Modal Retrieval
    Liu, Yaxin
    Wu, Jianlong
    Qu, Leigang
    Gan, Tian
    Yin, Jianhua
    Nie, Liqiang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2851 - 2863
  • [10] Self-Supervised Visual Representations for Cross-Modal Retrieval
    Patel, Yash
    Gomez, Lluis
    Rusinol, Marcal
    Karatzas, Dimosthenis
    Jawahar, C., V
    [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 182 - 186