Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing

被引:1
|
作者
Yang, Xiaohan [1 ]
Wang, Zhen [1 ,2 ]
Wu, Nannan [1 ]
Li, Guokun [1 ]
Feng, Chuang [1 ]
Liu, Pingping [2 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255000, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
cross-modal retrieval; image-text retrieval; cross-modal similarity preserving; hashing algorithm; unsupervised learning; NETWORK; VGG-16;
D O I
10.3390/math10152644
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The image-text cross-modal retrieval task, which aims to retrieve the relevant image from text and vice versa, is now attracting widespread attention. To quickly respond to the large-scale task, we propose an Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing (DRNPH) to achieve cross-modal retrieval in the common Hamming space, which has the advantages of storage and efficiency. To fulfill the nearest neighbor search in the Hamming space, we demand to reconstruct both the original intra- and inter-modal neighbor matrix according to the binary feature vectors. Thus, we can compute the neighbor relationship among different modal samples directly based on the Hamming distances. Furthermore, the cross-modal pair-wise similarity preserving constraint requires the similar sample pair have an identical Hamming distance to the anchor. Therefore, the similar sample pairs own the same binary code, and they have minimal Hamming distances. Unfortunately, the pair-wise similarity preserving constraint may lead to an imbalanced code problem. Therefore, we propose the cross-modal triplet relative similarity preserving constraint, which demands the Hamming distances of similar pairs should be less than those of dissimilar pairs to distinguish the samples' ranking orders in the retrieval results. Moreover, a large similarity marginal can boost the algorithm's noise robustness. We conduct the cross-modal retrieval comparative experiments and ablation study on two public datasets, MIRFlickr and NUS-WIDE, respectively. The experimental results show that DRNPH outperforms the state-of-the-art approaches in various image-text retrieval scenarios, and all three proposed constraints are necessary and effective for boosting cross-modal retrieval performance.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Cross-Modal Deep Variational Hashing
    Liong, Venice Erin
    Lu, Jiwen
    Tan, Yap-Peng
    Zhou, Jie
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4097 - 4105
  • [22] Unsupervised Multi-modal Hashing for Cross-Modal Retrieval
    Yu, Jun
    Wu, Xiao-Jun
    Zhang, Donglin
    COGNITIVE COMPUTATION, 2022, 14 (03) : 1159 - 1171
  • [23] Unsupervised Multi-modal Hashing for Cross-Modal Retrieval
    Jun Yu
    Xiao-Jun Wu
    Donglin Zhang
    Cognitive Computation, 2022, 14 : 1159 - 1171
  • [24] Pseudo-label driven deep hashing for unsupervised cross-modal retrieval
    Zeng, XianHua
    Xu, Ke
    Xie, YiCai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3437 - 3456
  • [25] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Cheng Zhang
    Yuan Wan
    Haopeng Qiang
    Neural Computing and Applications, 2024, 36 : 5383 - 5397
  • [26] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Zhang, Cheng
    Wan, Yuan
    Qiang, Haopeng
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5383 - 5397
  • [27] Generative Adversarial Network Based Asymmetric Deep Cross-Modal Unsupervised Hashing
    Cao, Yuan
    Gao, Yaru
    Chen, Na
    Lin, Jiacheng
    Chen, Sheng
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT I, 2024, 14487 : 30 - 48
  • [28] Pseudo-label driven deep hashing for unsupervised cross-modal retrieval
    XianHua Zeng
    Ke Xu
    YiCai Xie
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3437 - 3456
  • [29] Multi-label semantics preserving based deep cross-modal hashing
    Zou, Xitao
    Wang, Xinzhi
    Bakker, Erwin M.
    Wu, Song
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 93
  • [30] Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search
    Jin, Lu
    Li, Kai
    Li, Zechao
    Xiao, Fu
    Qi, Guo-Jun
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1429 - 1440