NSDH: A Nonlinear Supervised Discrete Hashing framework for large-scale cross-modal retrieval

被引:20
|
作者
Yang, Zhan [1 ,2 ]
Yang, Liu [1 ]
Raymond, Osolo Ian [1 ,2 ]
Zhu, Lei [1 ,2 ]
Huang, Wenti [1 ,2 ]
Liao, Zhifang [1 ]
Long, Jun [2 ,3 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Network Resources Management & Trust Evaluat Key, Changsha 410083, Hunan, Peoples R China
[3] Cent South Univ, Big Data Inst, Changsha 410083, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; Supervised hashing; Semantic enhancement descriptors;
D O I
10.1016/j.knosys.2021.106818
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hashing technology has been widely used in approximate nearest neighbor search algorithms for large-scale cross-modal retrieval due to its significantly reduced storage and high-speed search capabilities. However, most existing supervised cross-modal hashing methods either mainly rely on binary pairwise similarity and fail to exploit the rich semantic information contained in the label matrix, or the use of single linear projections which suffer from limited information completeness. In this paper, we propose a novel method, named Nonlinear Supervised Discrete Hashing (NSDH). Specifically, NSDH consists of two components, (1) a semantic enhancement descriptor consisting of multiple linear projections that is used to extract comprehensive latent representations of heterogeneous multimedia data, which aligns the original heterogeneous features and integrates the rich semantic label matrix; (2) a fast discrete optimization module used to learn discriminative compact hash codes, which preserves the similarity information using an inner product between the real-valued embeddings of the output of the semantic enhancement descriptors. Therefore, NSDH leverages both the label matrix and similarity information in order to enhance the semantic information of the learned hash codes. In this way, the representation learning capability of the output layer of semantic enhancement descriptors can be greatly enhanced and as a result the learned hash codes are more discriminative. In addition, we present a fast discrete optimization algorithm to efficiently learn the binary hash codes. Results from our experiments on two benchmark datasets highlight the superiority of NSDH in comparison to many state-of-the-art cross-modal hashing methods. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Joint and individual matrix factorization hashing for large-scale cross-modal retrieval
    Wang, Di
    Wang, Quan
    He, Lihuo
    Gao, Xinbo
    Tian, Yumin
    [J]. PATTERN RECOGNITION, 2020, 107
  • [22] Semantic-guided autoencoder adversarial hashing for large-scale cross-modal retrieval
    Mingyong Li
    Qiqi Li
    Yan Ma
    Degang Yang
    [J]. Complex & Intelligent Systems, 2022, 8 : 1603 - 1617
  • [23] DEEP SEMANTIC ADVERSARIAL HASHING BASED ON AUTOENCODER FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
    Li, Mingyong
    Wang, Hongya
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [24] Supervised Hierarchical Online Hashing for Cross-modal Retrieval
    Han, Kai
    Liu, Yu
    Wei, Rukai
    Zhou, Ke
    Xu, Jinhui
    Long, Kun
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [25] Unsupervised multi-graph cross-modal hashing for large-scale multimedia retrieval
    Xie, Liang
    Zhu, Lei
    Chen, Guoqi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 9185 - 9204
  • [26] Unsupervised multi-graph cross-modal hashing for large-scale multimedia retrieval
    Liang Xie
    Lei Zhu
    Guoqi Chen
    [J]. Multimedia Tools and Applications, 2016, 75 : 9185 - 9204
  • [27] Supervised Matrix Factorization Hashing for Cross-Modal Retrieval
    Tang, Jun
    Wang, Ke
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3157 - 3166
  • [28] Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval
    Zhan, Yu-Wei
    Luo, Xin
    Wang, Yongxin
    Xu, Xin-Shun
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3386 - 3394
  • [29] Discriminative correlation hashing for supervised cross-modal retrieval
    Lu, Xu
    Zhang, Huaxiang
    Sun, Jiande
    Wang, Zhenhua
    Guo, Peilian
    Wan, Wenbo
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 65 : 221 - 230
  • [30] Semantic-guided autoencoder adversarial hashing for large-scale cross-modal retrieval
    Li, Mingyong
    Li, Qiqi
    Ma, Yan
    Yang, Degang
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (02) : 1603 - 1617