Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval

被引:37
|
作者
Zhong, Fangming [1 ]
Chen, Zhikui [1 ,2 ]
Min, Geyong [3 ]
机构
[1] Dalian Univ Technol, Sch Software, Dalian, Peoples R China
[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian, Peoples R China
[3] Univ Exeter, Coll Engn Math & Phys Sci, Exeter, Devon, England
关键词
Cross-modal retrieval; deep learning; discrete hashing; alternative optimization; QUANTIZATION;
D O I
10.1016/j.patcog.2018.05.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal hashing has drawn increasing research interests in multimedia retrieval due to the explosive growth of multimedia big data. It is such a challenging topic due to the heterogeneity gap and high storage cost. However, most of the previous methods based on conventional linear projections and relaxation scheme fail to capture the nonlinear relationship among samples and suffers from large quantization loss, which result in an unsatisfactory performance of cross-modal retrieval. To address these issues, this paper is dedicated to learning discrete nonlinear hash functions by deep learning. A novel framework of cross-modal deep neural networks is proposed to learn binary codes directly. We formulate the similarity preserving in the framework, and also bit-independent as well as binary constraints are imposed on the hash codes. Specifically, we consider intra-modality similarity preserving at each hidden layer of the networks. Inter-modality similarity preserving is formulated by the output of each individual network. By so doing, the cross correlation can be encoded into the network training (i.e. hash functions learning) by back propagation algorithm. The final objective is solved by alternative optimization in an iterative fashion. Experimental results on four datasets i.e. NUS-WIDE, MIR Flickr, Pascal VOC, and LabelMe demonstrate the effectiveness of the proposed method, which is significantly superior to state-of-the-art cross-modal hashing approaches. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:64 / 77
页数:14
相关论文
共 50 条
  • [21] Discrete semantic embedding hashing for scalable cross-modal retrieval
    Liu, Junjie
    Fei, Lunke
    Jia, Wei
    Zhao, Shuping
    Wen, Jie
    Teng, Shaohua
    Zhang, Wei
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1461 - 1467
  • [22] Robust and discrete matrix factorization hashing for cross-modal retrieval
    Zhang, Donglin
    Wu, Xiao-Jun
    [J]. PATTERN RECOGNITION, 2022, 122
  • [23] Supervised Discrete Matrix Factorization Hashing For Cross-Modal Retrieval
    Wu, Fei
    Wu, Zhiyong
    Feng, Yujian
    Zhou, Jun
    Huang, He
    Li, Xinwei
    Dong, Xiwei
    Jing, Xiao Yuan
    [J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 855 - 859
  • [24] Two-Step Discrete Hashing for Cross-Modal Retrieval
    Tu, Junfeng
    Liu, Xueliang
    Hao, Yanbin
    Hong, Richang
    Wang, Meng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8730 - 8741
  • [25] Discrete Semantic Matrix Factorization Hashing for Cross-Modal Retrieval
    Qin, Jianyang
    Fei, Lunke
    Teng, Shaohua
    Zhang, Wei
    Liu, Dongning
    Zhao, Genping
    Yuan, Haoliang
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1550 - 1557
  • [26] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
    Yan, Ting-Kun
    Xu, Xin-Shun
    Guo, Shanqing
    Huang, Zi
    Wang, Xiao-Lin
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1271 - 1280
  • [27] DAH: Discrete Asymmetric Hashing for Efficient Cross-Media Retrieval
    Zhang, Donglin
    Wu, Xiao-Jun
    Xu, Tianyang
    Yin, He-Feng
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1365 - 1378
  • [28] Hashing for Cross-Modal Similarity Retrieval
    Liu, Yao
    Yuan, Yanhong
    Huang, Qiaoli
    Huang, Zhixing
    [J]. 2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8
  • [29] Deep Unsupervised Momentum Contrastive Hashing for Cross-modal Retrieval
    Lu, Kangkang
    Yu, Yanhua
    Liang, Meiyu
    Zhang, Min
    Cao, Xiaowen
    Zhao, Zehua
    Yin, Mengran
    Xue, Zhe
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 126 - 131
  • [30] Cross-Modal Hashing Retrieval Based on Deep Residual Network
    Li, Zhiyi
    Xu, Xiaomian
    Zhang, Du
    Zhang, Peng
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 36 (02): : 383 - 405