Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing

被引：1

作者：

Yang, Xiaohan ^{[1
]}

Wang, Zhen ^{[1
,2
]}

Wu, Nannan ^{[1
]}

Li, Guokun ^{[1
]}

Feng, Chuang ^{[1
]}

Liu, Pingping ^{[2
]}

机构：

[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255000, Peoples R China

[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China

来源：

MATHEMATICS | 2022年 / 10卷 / 15期

基金：

中国国家自然科学基金;

关键词：

cross-modal retrieval; image-text retrieval; cross-modal similarity preserving; hashing algorithm; unsupervised learning; NETWORK; VGG-16;

D O I：

10.3390/math10152644

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

The image-text cross-modal retrieval task, which aims to retrieve the relevant image from text and vice versa, is now attracting widespread attention. To quickly respond to the large-scale task, we propose an Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing (DRNPH) to achieve cross-modal retrieval in the common Hamming space, which has the advantages of storage and efficiency. To fulfill the nearest neighbor search in the Hamming space, we demand to reconstruct both the original intra- and inter-modal neighbor matrix according to the binary feature vectors. Thus, we can compute the neighbor relationship among different modal samples directly based on the Hamming distances. Furthermore, the cross-modal pair-wise similarity preserving constraint requires the similar sample pair have an identical Hamming distance to the anchor. Therefore, the similar sample pairs own the same binary code, and they have minimal Hamming distances. Unfortunately, the pair-wise similarity preserving constraint may lead to an imbalanced code problem. Therefore, we propose the cross-modal triplet relative similarity preserving constraint, which demands the Hamming distances of similar pairs should be less than those of dissimilar pairs to distinguish the samples' ranking orders in the retrieval results. Moreover, a large similarity marginal can boost the algorithm's noise robustness. We conduct the cross-modal retrieval comparative experiments and ablation study on two public datasets, MIRFlickr and NUS-WIDE, respectively. The experimental results show that DRNPH outperforms the state-of-the-art approaches in various image-text retrieval scenarios, and all three proposed constraints are necessary and effective for boosting cross-modal retrieval performance.

引用

页数：17

共 50 条

[41] Multimodal Mutual Information Maximization: A Novel Approach for Unsupervised Deep Cross-Modal Hashing
Hoang, Tuan
Do, Thanh-Toan
Nguyen, Tam V.
Cheung, Ngai-Man
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6289 - 6302
[42] Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval
Shen, Xiao
Zhang, Haofeng
Li, Lunbo
Zhang, Zheng
Chen, Debao
Liu, Li
NEUROCOMPUTING, 2021, 459 : 152 - 164
[43] Fine-grained similarity semantic preserving deep hashing for cross-modal retrieval
Li, Guoyou
Peng, Qingjun
Zou, Dexu
Yang, Jinyue
Shu, Zhenqiu
FRONTIERS IN PHYSICS, 2023, 11
[44] Self-supervised deep semantics-preserving Hashing for cross-modal retrieval
Lu B.
Duan X.
Yuan Y.
Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (09): : 1442 - 1449
[45] Global and local semantics-preserving based deep hashing for cross-modal retrieval
Ma, Lei
Li, Hongliang
Meng, Fanman
Wu, Qingbo
Ngan, King Ngi
NEUROCOMPUTING, 2018, 312 : 49 - 62
[46] UNSUPERVISED CONTRASTIVE HASHING FOR CROSS-MODAL RETRIEVAL IN REMOTE SENSING
Mikriukov, Georgii
Ravanbakhsh, Mahdyar
Demir, Begum
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4463 - 4467
[47] Unsupervised Cross-Modal Hashing via Semantic Text Mining
Tu, Rong-Cheng
Mao, Xian-Ling
Lin, Qinghong
Ji, Wenjin
Qin, Weize
Wei, Wei
Huang, Heyan
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8946 - 8957
[48] Coupled CycleGAN: Unsupervised Hashing Network for Cross-Modal Retrieval
Li, Chao
Deng, Cheng
Wang, Lei
Xie, De
Liu, Xianglong
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 176 - 183
[49] Deep Feature-Based Neighbor Similarity Hashing With Adversarial Learning for Cross-Modal Retrieval
Li, Kun
Zhang, Yonghui
Wang, Feng
Liu, Guoxu
Wei, Xianmin
IEEE ACCESS, 2024, 12 : 128559 - 128569
[50] Joint-Modal Graph Convolutional Hashing for unsupervised cross-modal retrieval
Meng, Hui
Zhang, Huaxiang
Liu, Li
Liu, Dongmei
Lu, Xu
Guo, Xinru
NEUROCOMPUTING, 2024, 595

← 1 2 3 4 5 →