Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval

被引:0
|
作者
Zhang, Donglin [1 ]
Xiao-Jun Wu [1 ]
Xu, Tianyang [1 ,2 ]
Kittler, Josef [2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
Semantics; Binary codes; Hash functions; Optimization; Quantization (signal); Task analysis; Costs; Cross-modal retrieval; discrete optimization; hashing;
D O I
10.1109/TSMC.2021.3130939
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, hashing-based multimodal learning systems have received increasing attention due to their query efficiency and parsimonious storage costs. However, impeded by the quantization loss caused by numerical optimization, the existing cross-media hashing approaches are unable to capture all the discriminative information present in the original multimodal data. Besides, most cross-modal methods belong to the one-step paradigm, which learn the binary codes and hash function simultaneously, increasing the complexity of optimization. To address these issues, we propose a novel two-stage approach, named the two-stage supervised discrete hashing (TSDH) method. In particular, in the first phase, TSDH generates a latent representation for each modality. These representations are then mapped to a common Hamming space to generate the binary codes. In addition, TSDH directly endows the hash codes with the semantic labels, enhancing the discriminatory power of the learned binary codes. A discrete hash optimization approach is developed to learn the binary codes without relaxation, avoiding the large quantization loss. The proposed hash function learning scheme reuses the semantic information contained by the embeddings, endowing the hash functions with enhanced discriminability. Extensive experiments on several databases demonstrate the effectiveness of the developed TSDH, outperforming several recent competitive cross-media algorithms.
引用
收藏
页码:7014 / 7026
页数:13
相关论文
共 50 条
  • [1] Supervised Contrastive Discrete Hashing for cross-modal retrieval
    Li, Ze
    Yao, Tao
    Wang, Lili
    Li, Ying
    Wang, Gang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [2] Discrete Robust Supervised Hashing for Cross-Modal Retrieval
    Yao, Tao
    Zhang, Zhiwang
    Yan, Lianshan
    Yue, Jun
    Tian, Qi
    [J]. IEEE ACCESS, 2019, 7 : 39806 - 39814
  • [3] Two-stage deep learning for supervised cross-modal retrieval
    Jie Shao
    Zhicheng Zhao
    Fei Su
    [J]. Multimedia Tools and Applications, 2019, 78 : 16615 - 16631
  • [4] Two-stage deep learning for supervised cross-modal retrieval
    Shao, Jie
    Zhao, Zhicheng
    Su, Fei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 16615 - 16631
  • [5] Supervised Discrete Matrix Factorization Hashing For Cross-Modal Retrieval
    Wu, Fei
    Wu, Zhiyong
    Feng, Yujian
    Zhou, Jun
    Huang, He
    Li, Xinwei
    Dong, Xiwei
    Jing, Xiao Yuan
    [J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 855 - 859
  • [6] Two-Stage Asymmetric Similarity Preserving Hashing for Cross-Modal Retrieval
    Huang, Junfan
    Kang, Peipei
    Han, Na
    Chen, Yonghao
    Fang, Xiaozhao
    Gao, Hongbo
    Zhou, Guoxu
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (01) : 429 - 444
  • [7] Semi-supervised discrete hashing for efficient cross-modal retrieval
    Xingzhi Wang
    Xin Liu
    Shu-Juan Peng
    Bineng Zhong
    Yewang Chen
    Ji-Xiang Du
    [J]. Multimedia Tools and Applications, 2020, 79 : 25335 - 25356
  • [8] Semi-supervised discrete hashing for efficient cross-modal retrieval
    Wang, Xingzhi
    Liu, Xin
    Peng, Shu-Juan
    Zhong, Bineng
    Chen, Yewang
    Du, Ji-Xiang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25335 - 25356
  • [9] Two-Step Discrete Hashing for Cross-Modal Retrieval
    Tu, Junfeng
    Liu, Xueliang
    Hao, Yanbin
    Hong, Richang
    Wang, Meng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8730 - 8741
  • [10] Efficient discrete supervised hashing for large-scale cross-modal retrieval
    Yao, Tao
    Han, Yaru
    Wang, Ruxin
    Kong, Xiangwei
    Yan, Lianshan
    Fu, Haiyan
    Tian, Qi
    [J]. NEUROCOMPUTING, 2020, 385 (385) : 358 - 367