Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval

被引：37

作者：

Zhong, Fangming ^{[1
]}

Chen, Zhikui ^{[1
,2
]}

Min, Geyong ^{[3
]}

机构：

[1] Dalian Univ Technol, Sch Software, Dalian, Peoples R China

[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian, Peoples R China

[3] Univ Exeter, Coll Engn Math & Phys Sci, Exeter, Devon, England

来源：

PATTERN RECOGNITION | 2018年 / 83卷

关键词：

Cross-modal retrieval; deep learning; discrete hashing; alternative optimization; QUANTIZATION;

D O I：

10.1016/j.patcog.2018.05.018

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-modal hashing has drawn increasing research interests in multimedia retrieval due to the explosive growth of multimedia big data. It is such a challenging topic due to the heterogeneity gap and high storage cost. However, most of the previous methods based on conventional linear projections and relaxation scheme fail to capture the nonlinear relationship among samples and suffers from large quantization loss, which result in an unsatisfactory performance of cross-modal retrieval. To address these issues, this paper is dedicated to learning discrete nonlinear hash functions by deep learning. A novel framework of cross-modal deep neural networks is proposed to learn binary codes directly. We formulate the similarity preserving in the framework, and also bit-independent as well as binary constraints are imposed on the hash codes. Specifically, we consider intra-modality similarity preserving at each hidden layer of the networks. Inter-modality similarity preserving is formulated by the output of each individual network. By so doing, the cross correlation can be encoded into the network training (i.e. hash functions learning) by back propagation algorithm. The final objective is solved by alternative optimization in an iterative fashion. Experimental results on four datasets i.e. NUS-WIDE, MIR Flickr, Pascal VOC, and LabelMe demonstrate the effectiveness of the proposed method, which is significantly superior to state-of-the-art cross-modal hashing approaches. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：64 / 77

页数：14

共 50 条

[21] Discrete semantic embedding hashing for scalable cross-modal retrieval
Liu, Junjie
Fei, Lunke
Jia, Wei
Zhao, Shuping
Wen, Jie
Teng, Shaohua
Zhang, Wei
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1461 - 1467
[22] Robust and discrete matrix factorization hashing for cross-modal retrieval
Zhang, Donglin
Wu, Xiao-Jun
[J]. PATTERN RECOGNITION, 2022, 122
[23] Supervised Discrete Matrix Factorization Hashing For Cross-Modal Retrieval
Wu, Fei
Wu, Zhiyong
Feng, Yujian
Zhou, Jun
Huang, He
Li, Xinwei
Dong, Xiwei
Jing, Xiao Yuan
[J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 855 - 859
[24] Two-Step Discrete Hashing for Cross-Modal Retrieval
Tu, Junfeng
Liu, Xueliang
Hao, Yanbin
Hong, Richang
Wang, Meng
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8730 - 8741
[25] Discrete Semantic Matrix Factorization Hashing for Cross-Modal Retrieval
Qin, Jianyang
Fei, Lunke
Teng, Shaohua
Zhang, Wei
Liu, Dongning
Zhao, Genping
Yuan, Haoliang
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1550 - 1557
[26] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
Yan, Ting-Kun
Xu, Xin-Shun
Guo, Shanqing
Huang, Zi
Wang, Xiao-Lin
[J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1271 - 1280
[27] DAH: Discrete Asymmetric Hashing for Efficient Cross-Media Retrieval
Zhang, Donglin
Wu, Xiao-Jun
Xu, Tianyang
Yin, He-Feng
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1365 - 1378
[28] Hashing for Cross-Modal Similarity Retrieval
Liu, Yao
Yuan, Yanhong
Huang, Qiaoli
Huang, Zhixing
[J]. 2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8
[29] Deep Unsupervised Momentum Contrastive Hashing for Cross-modal Retrieval
Lu, Kangkang
Yu, Yanhua
Liang, Meiyu
Zhang, Min
Cao, Xiaowen
Zhao, Zehua
Yin, Mengran
Xue, Zhe
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 126 - 131
[30] Cross-Modal Hashing Retrieval Based on Deep Residual Network
Li, Zhiyi
Xu, Xiaomian
Zhang, Du
Zhang, Peng
[J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 36 (02): : 383 - 405

← 1 2 3 4 5 →