FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval

被引:43
|
作者
Liu, Xin [1 ,2 ,3 ]
Wang, Xingzhi [4 ]
Yiu-ming Cheung [5 ]
机构
[1] Huaqiao Univ, Dept Comp Sci, Xiamen 361021, Peoples R China
[2] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361021, Peoples R China
[3] Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361021, Peoples R China
[4] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[5] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Semantics; Training; Optimization; Correlation; Sparse matrices; Quantization (signal); Media; -dragging; bi-Lipschitz continuity; cross-modal hashing; online strategy; orthogonal basis; semantic margin; BINARY-CODES;
D O I
10.1109/TNNLS.2021.3076684
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal hashing, favored for its effectiveness and efficiency, has received wide attention to facilitating efficient retrieval across different modalities. Nevertheless, most existing methods do not sufficiently exploit the discriminative power of semantic information when learning the hash codes while often involving time-consuming training procedure for handling the large-scale dataset. To tackle these issues, we formulate the learning of similarity-preserving hash codes in terms of orthogonally rotating the semantic data, so as to minimize the quantization loss of mapping such data to hamming space and propose an efficient fast discriminative discrete hashing (FDDH) approach for large-scale cross-modal retrieval. More specifically, FDDH introduces an orthogonal basis to regress the targeted hash codes of training examples to their corresponding semantic labels and utilizes the epsilon-dragging technique to provide provable large semantic margins. Accordingly, the discriminative power of semantic information can be explicitly captured and maximized. Moreover, an orthogonal transformation scheme is further proposed to map the nonlinear embedding data into the semantic subspace, which can well guarantee the semantic consistency between the data feature and its semantic representation. Consequently, an efficient closed-form solution is derived for discriminative hash code learning, which is very computationally efficient. In addition, an effective and stable online learning strategy is presented for optimizing modality-specific projection functions, featuring adaptivity to different training sizes and streaming data. The proposed FDDH approach theoretically approximates the bi-Lipschitz continuity, runs sufficiently fast, and also significantly improves the retrieval performance over the state-of-the-art methods. The source code is released at https://github.com/starxliu/FDDH.
引用
收藏
页码:6306 / 6320
页数:15
相关论文
共 50 条
  • [1] SCALABLE DISCRIMINATIVE DISCRETE HASHING FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
    Qin, Jianyang
    Fei, Lunke
    Zhu, Jian
    Wen, Jie
    Tian, Chunwei
    Wu, Shuai
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4330 - 4334
  • [2] Efficient discrete supervised hashing for large-scale cross-modal retrieval
    Yao, Tao
    Han, Yaru
    Wang, Ruxin
    Kong, Xiangwei
    Yan, Lianshan
    Fu, Haiyan
    Tian, Qi
    [J]. NEUROCOMPUTING, 2020, 385 : 358 - 367
  • [3] Fast Semantic Preserving Hashing for Large-Scale Cross-Modal Retrieval
    Wang, Xingzhi
    Liu, Xin
    Peng, Shujuan
    Cheung, Yiu-ming
    Hu, Zhikai
    Wang, Nannan
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1348 - 1353
  • [4] NSDH: A Nonlinear Supervised Discrete Hashing framework for large-scale cross-modal retrieval
    Yang, Zhan
    Yang, Liu
    Raymond, Osolo Ian
    Zhu, Lei
    Huang, Wenti
    Liao, Zhifang
    Long, Jun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 217
  • [5] Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval
    Li, Mingyong
    Wang, Hongya
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 183 - 191
  • [6] Online Adaptive Supervised Hashing for Large-Scale Cross-Modal Retrieval
    Su, Ruoqi
    Wang, Di
    Huang, Zhen
    Liu, Yuan
    An, Yaqiang
    [J]. IEEE ACCESS, 2020, 8 : 206360 - 206370
  • [7] Label guided correlation hashing for large-scale cross-modal retrieval
    Guohua Dong
    Xiang Zhang
    Long Lan
    Shiwei Wang
    Zhigang Luo
    [J]. Multimedia Tools and Applications, 2019, 78 : 30895 - 30922
  • [8] Multiple Information Embedded Hashing for Large-Scale Cross-Modal Retrieval
    Wang, Yongxin
    Zhan, Yu-Wei
    Chen, Zhen-Duo
    Luo, Xin
    Xu, Xin-Shun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5118 - 5131
  • [9] Label guided correlation hashing for large-scale cross-modal retrieval
    Dong, Guohua
    Zhang, Xiang
    Lan, Long
    Wang, Shiwei
    Luo, Zhigang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30895 - 30922
  • [10] Efficient Discriminative Hashing for Cross-Modal Retrieval
    Huang, Junfan
    Kang, Peipei
    Fang, Xiaozhao
    Han, Na
    Xie, Shengli
    Gao, Hongbo
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3865 - 3878