Deep Multiscale Fusion Hashing for Cross-Modal Retrieval

被引：56

作者：

Nie, Xiushan ^{[1
]}

Wang, Bowei ^{[2
]}

Li, Jiajia ^{[2
]}

Hao, Fanchang ^{[1
]}

Jian, Muwei ^{[2
]}

Yin, Yilong ^{[3
]}

机构：

[1] Shandong Jianzhu Univ, Sch Comp Sci & Technol, Jinan 250101, Peoples R China

[2] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan 250014, Peoples R China

[3] Shandong Univ, Sch Software, Jinan 250101, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2021年 / 31卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Semantics; Machine learning; Training data; Media; Electronic mail; Correlation; Retrieval; hashing; deep learning; cross-modal;

D O I：

10.1109/TCSVT.2020.2974877

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Owing to the rapid development of deep learning and the high efficiency of hashing, hashing methods based on deep learning models have been extensively adopted in the area of cross-modal retrieval. In general, in existing deep model-based methods, modality-specific features play an important role during the hash learning. However, most existing methods only use the modality-specific features from the final fully connected layer, ignoring the semantic relevance among modality-specific features with different scales in multiple layers. To address this issue, in this study, we put forward an end-to-end deep hashing method called deep multiscale fusion hashing (DMFH) for cross-modal retrieval. For the proposed DMFH, we first design different network branches for two modalities and then adopt multiscale fusion models for each branch network to fuse the multiscale semantics, which can be used to explore the semantic relevance. Furthermore, the multi-fusion models also embed the multiscale semantics into the final hash codes, making the final hash codes more representative. In addition, the proposed DMFH can learn common hash codes directly without a relaxation, thereby avoiding a loss in accuracy during hash learning. Experimental results on three benchmark datasets prove the relative superiority of the proposed method.

引用

页码：401 / 410

页数：10

共 50 条

[1] A triple fusion model for cross-modal deep hashing retrieval
Hufei Wang
Kaiqiang Zhao
Dexin Zhao
[J]. Multimedia Systems, 2023, 29 : 347 - 359
[2] A triple fusion model for cross-modal deep hashing retrieval
Wang, Hufei
Zhao, Kaiqiang
Zhao, Dexin
[J]. MULTIMEDIA SYSTEMS, 2023, 29 (01) : 347 - 359
[3] Deep Label Feature Fusion Hashing for Cross-Modal Retrieval
Ren, Dongxiao
Xu, Weihua
Wang, Zhonghua
Sun, Qinxiu
[J]. IEEE ACCESS, 2022, 10 : 100276 - 100285
[4] Unsupervised Deep Fusion Cross-modal Hashing
Huang, Jiaming
Min, Chen
Jing, Liping
[J]. ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 358 - 366
[5] Joint feature fusion hashing for cross-modal retrieval
Cao, Yuxia
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
[6] Discrete Fusion Adversarial Hashing for cross-modal retrieval
Li, Jing
Yu, En
Ma, Jianhua
Chang, Xiaojun
Zhang, Huaxiang
Sun, Jiande
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 253
[7] Deep Multiscale Fine-Grained Hashing for Remote Sensing Cross-Modal Retrieval
Huang, Jiaxiang
Feng, Yong
Zhou, Mingliang
Xiong, Xiancai
Wang, Yongheng
Qiang, Baohua
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[8] Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval
Zhan, Yu-Wei
Luo, Xin
Wang, Yongxin
Xu, Xin-Shun
[J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3386 - 3394
[9] Deep Hashing Similarity Learning for Cross-Modal Retrieval
Ma, Ying
Wang, Meng
Lu, Guangyun
Sun, Yajun
[J]. IEEE ACCESS, 2024, 12 : 8609 - 8618
[10] FUSION-SUPERVISED DEEP CROSS-MODAL HASHING
Wang, Li
Zhu, Lei
Yu, En
Sun, Jiande
Zhang, Huaxiang
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 37 - 42

← 1 2 3 4 5 →