Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing

被引:1
|
作者
Yang, Xiaohan [1 ]
Wang, Zhen [1 ,2 ]
Wu, Nannan [1 ]
Li, Guokun [1 ]
Feng, Chuang [1 ]
Liu, Pingping [2 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255000, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
cross-modal retrieval; image-text retrieval; cross-modal similarity preserving; hashing algorithm; unsupervised learning; NETWORK; VGG-16;
D O I
10.3390/math10152644
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The image-text cross-modal retrieval task, which aims to retrieve the relevant image from text and vice versa, is now attracting widespread attention. To quickly respond to the large-scale task, we propose an Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing (DRNPH) to achieve cross-modal retrieval in the common Hamming space, which has the advantages of storage and efficiency. To fulfill the nearest neighbor search in the Hamming space, we demand to reconstruct both the original intra- and inter-modal neighbor matrix according to the binary feature vectors. Thus, we can compute the neighbor relationship among different modal samples directly based on the Hamming distances. Furthermore, the cross-modal pair-wise similarity preserving constraint requires the similar sample pair have an identical Hamming distance to the anchor. Therefore, the similar sample pairs own the same binary code, and they have minimal Hamming distances. Unfortunately, the pair-wise similarity preserving constraint may lead to an imbalanced code problem. Therefore, we propose the cross-modal triplet relative similarity preserving constraint, which demands the Hamming distances of similar pairs should be less than those of dissimilar pairs to distinguish the samples' ranking orders in the retrieval results. Moreover, a large similarity marginal can boost the algorithm's noise robustness. We conduct the cross-modal retrieval comparative experiments and ablation study on two public datasets, MIRFlickr and NUS-WIDE, respectively. The experimental results show that DRNPH outperforms the state-of-the-art approaches in various image-text retrieval scenarios, and all three proposed constraints are necessary and effective for boosting cross-modal retrieval performance.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Multimodal Mutual Information Maximization: A Novel Approach for Unsupervised Deep Cross-Modal Hashing
    Hoang, Tuan
    Do, Thanh-Toan
    Nguyen, Tam V.
    Cheung, Ngai-Man
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6289 - 6302
  • [42] Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval
    Shen, Xiao
    Zhang, Haofeng
    Li, Lunbo
    Zhang, Zheng
    Chen, Debao
    Liu, Li
    NEUROCOMPUTING, 2021, 459 : 152 - 164
  • [43] Fine-grained similarity semantic preserving deep hashing for cross-modal retrieval
    Li, Guoyou
    Peng, Qingjun
    Zou, Dexu
    Yang, Jinyue
    Shu, Zhenqiu
    FRONTIERS IN PHYSICS, 2023, 11
  • [44] Self-supervised deep semantics-preserving Hashing for cross-modal retrieval
    Lu B.
    Duan X.
    Yuan Y.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (09): : 1442 - 1449
  • [45] Global and local semantics-preserving based deep hashing for cross-modal retrieval
    Ma, Lei
    Li, Hongliang
    Meng, Fanman
    Wu, Qingbo
    Ngan, King Ngi
    NEUROCOMPUTING, 2018, 312 : 49 - 62
  • [46] UNSUPERVISED CONTRASTIVE HASHING FOR CROSS-MODAL RETRIEVAL IN REMOTE SENSING
    Mikriukov, Georgii
    Ravanbakhsh, Mahdyar
    Demir, Begum
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4463 - 4467
  • [47] Unsupervised Cross-Modal Hashing via Semantic Text Mining
    Tu, Rong-Cheng
    Mao, Xian-Ling
    Lin, Qinghong
    Ji, Wenjin
    Qin, Weize
    Wei, Wei
    Huang, Heyan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8946 - 8957
  • [48] Coupled CycleGAN: Unsupervised Hashing Network for Cross-Modal Retrieval
    Li, Chao
    Deng, Cheng
    Wang, Lei
    Xie, De
    Liu, Xianglong
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 176 - 183
  • [49] Deep Feature-Based Neighbor Similarity Hashing With Adversarial Learning for Cross-Modal Retrieval
    Li, Kun
    Zhang, Yonghui
    Wang, Feng
    Liu, Guoxu
    Wei, Xianmin
    IEEE ACCESS, 2024, 12 : 128559 - 128569
  • [50] Joint-Modal Graph Convolutional Hashing for unsupervised cross-modal retrieval
    Meng, Hui
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Lu, Xu
    Guo, Xinru
    NEUROCOMPUTING, 2024, 595