Autoencoder-based self-supervised hashing for cross-modal retrieval

被引：2

作者：

Li, Yifan ^{[1
]}

Wang, Xuan ^{[1
]}

Cui, Lei ^{[1
]}

Zhang, Jiajia ^{[1
]}

Huang, Chengkai ^{[1
]}

Luo, Xuan ^{[1
]}

Qi, Shuhan ^{[1
]}

机构：

[1] Harbin Inst Technol Shenzhen, Comp Sci & Technol, Shenzhen, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 11期

关键词：

Cross-modal retrieval; Hash learning; Autoencoder; Self-supervised;

D O I：

10.1007/s11042-020-09599-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cross-modal retrieval has gained lots of attention in the era of the multimedia data explosion. Taking advantage of low storage cost and fast retrieval speed, hash learning-based methods become more and more popular in this field. The crucial bottlenecks of cross-modal retrieval are twofold: the heterogeneous gap in different modalities and the semantic gap among similar data with various modalities. To address these issues, we adopt self-supervised fashion to bridge the heterogeneous gap by generating the cohesive features of different instances. To mitigate the semantic gap, we use triplet sampling to optimize the semantic loss in inter-modal and intra-modal, which increase the discriminability of our approach. Experimental on two benchmark datasets show the efficiency and robustness of our method, and the extended experiments show the scalability.

引用

页码：17257 / 17274

页数：18

共 50 条

[1] Autoencoder-based self-supervised hashing for cross-modal retrieval
Yifan Li
Xuan Wang
Lei Cui
Jiajia Zhang
Chengkai Huang
Xuan Luo
Shuhan Qi
[J]. Multimedia Tools and Applications, 2021, 80 : 17257 - 17274
[2] Self-supervised incomplete cross-modal hashing retrieval
Peng, Shouyong
Yao, Tao
Li, Ying
Wang, Gang
Wang, Lili
Yan, Zhiming
[J]. Expert Systems with Applications, 2025, 262
[3] Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
Li, Chao
Deng, Cheng
Li, Ning
Liu, Wei
Gao, Xinbo
Tao, Dacheng
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4242 - 4251
[4] Self-supervised deep semantics-preserving Hashing for cross-modal retrieval
Lu, Bo
Duan, Xiaodong
Yuan, Ye
[J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (09): : 1442 - 1449
[5] Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval
Yifan Li
Xuan Wang
Shuhan Qi
Chengkai Huang
Zoe. L Jiang
Qing Liao
Jian Guan
Jiajia Zhang
[J]. Signal, Image and Video Processing, 2021, 15 : 673 - 680
[6] Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval
Li, Yifan
Wang, Xuan
Qi, Shuhan
Huang, Chengkai
Jiang, Zoe L.
Liao, Qing
Guan, Jian
Zhang, Jiajia
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (04) : 673 - 680
[7] Graph Convolutional Network Semantic Enhancement Hashing for Self-supervised Cross-Modal Retrieval
Hu, Jinyu
Li, Mingyong
Zhang, Jiayan
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 410 - 422
[8] Self-Supervised Cluster-Contrast Distillation Hashing Network for Cross-Modal Retrieval
Sun, Haoxuan
Cao, Yudong
Liu, Guangyuan
[J]. IEEE ACCESS, 2023, 11 : 96584 - 96593
[9] Self-Supervised Correlation Learning for Cross-Modal Retrieval
Liu, Yaxin
Wu, Jianlong
Qu, Leigang
Gan, Tian
Yin, Jianhua
Nie, Liqiang
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2851 - 2863
[10] Self-Supervised Visual Representations for Cross-Modal Retrieval
Patel, Yash
Gomez, Lluis
Rusinol, Marcal
Karatzas, Dimosthenis
Jawahar, C., V
[J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 182 - 186

← 1 2 3 4 5 →