Semantic-rebased cross-modal hashing for scalable unsupervised text-visual retrieval

被引:11
|
作者
Wang, Weiwei [1 ]
Shen, Yuming [2 ]
Zhang, Haofeng [1 ]
Liu, Li [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Incept Inst Artificial Intelligence IIAI, Abu Dhabi, U Arab Emirates
基金
中国国家自然科学基金;
关键词
Sparse graph; Semantic rebasing; Cross-modal hashing; Unsupervised text-visual retrieval;
D O I
10.1016/j.ipm.2020.102374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, learning-based cross-modal hashing has gained increasing research interests for its low computation complexity and memory requirement. Among existing cross-modal techniques, supervised algorithms can gain better performance. However, due to the cost of acquiring labeled data, unsupervised methods become our choice when faced with large scale unlabeled web images. The label-free nature of unsupervised cross-modal hashing hinders models from exploiting the exact semantic data similarity. Existing research typically simulates the semantics by a heuristic geometric prior in the original feature space with pseudo labels or traditional dense graph structures. However, this introduces heavy bias into the model as the original features are not fully representing the underlying multi-view data relations, and these two structures may face with issues like interference noise or high sensitivity to cluster number. To address the problem above, in this paper, we propose a novel unsupervised sparse-graph based hashing method called Semantic-Rebased Cross-modal Hashing (SRCH). A novel `Set-and-Rebase' process is defined to initialize and update the cross-modal similarity graph of training data. In particular, we set the graph according to the intra-modal feature geometric basis and then alternately rebase it to update the edges within according to the hashing results. We develop an alternating optimization routine to rebase the graph and train the hashing auto-encoders with closed-form solutions so that the overall framework is efficiently trained. Our experimental results on benchmarked datasets demonstrate the superiority of our model against state-of-the-art algorithms.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Unsupervised Cross-Modal Hashing via Semantic Text Mining
    Tu, Rong-Cheng
    Mao, Xian-Ling
    Lin, Qinghong
    Ji, Wenjin
    Qin, Weize
    Wei, Wei
    Huang, Heyan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8946 - 8957
  • [2] Discrete semantic embedding hashing for scalable cross-modal retrieval
    Liu, Junjie
    Fei, Lunke
    Jia, Wei
    Zhao, Shuping
    Wen, Jie
    Teng, Shaohua
    Zhang, Wei
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1461 - 1467
  • [3] Deep Visual-Semantic Hashing for Cross-Modal Retrieval
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Yang, Qiang
    Yu, Philip S.
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1445 - 1454
  • [4] Efficient discrete latent semantic hashing for scalable cross-modal retrieval
    Lu, Xu
    Zhu, Lei
    Cheng, Zhiyong
    Song, Xuemeng
    Zhang, Huaxiang
    [J]. SIGNAL PROCESSING, 2019, 154 : 217 - 231
  • [5] Scalable semantic-enhanced supervised hashing for cross-modal retrieval
    Yang, Fan
    Ding, Xiaojian
    Liu, Yufeng
    Ma, Fumin
    Cao, Jie
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [6] Semantic consistency hashing for cross-modal retrieval
    Yao, Tao
    Kong, Xiangwei
    Fu, Haiyan
    Tian, Qi
    [J]. NEUROCOMPUTING, 2016, 193 : 250 - 259
  • [7] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Cheng Zhang
    Yuan Wan
    Haopeng Qiang
    [J]. Neural Computing and Applications, 2024, 36 : 5383 - 5397
  • [8] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Zhang, Cheng
    Wan, Yuan
    Qiang, Haopeng
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5383 - 5397
  • [9] Deep Semantic-Preserving Reconstruction Hashing for Unsupervised Cross-Modal Retrieval
    Cheng, Shuli
    Wang, Liejun
    Du, Anyu
    [J]. ENTROPY, 2020, 22 (11) : 1 - 22
  • [10] Object-Level Visual-Text Correlation Graph Hashing for Unsupervised Cross-Modal Retrieval
    Shi, Ge
    Li, Feng
    Wu, Lifang
    Chen, Yukun
    [J]. SENSORS, 2022, 22 (08)