Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval

被引：18

作者：

Luo, Kaiyi ^{[1
]}

Zhang, Chao ^{[1
]}

Li, Huaxiong ^{[1
]}

Jia, Xiuyi ^{[2
]}

Chen, Chunlin ^{[1
]}

机构：

[1] Nanjing Univ, Dept Control Sci & Intelligence Engn, Nanjing 210093, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210014, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Cross-modal retrieval; unpaired hashing; adaptive margins; CODES;

D O I：

10.1109/TMM.2023.3245400

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, Cross-Modal Hashing (CMH) has attracted much attention due to its fast query speed and efficient storage. Previous studies have achieved promising results for Cross-Modal Retrieval (CMR) by discovering discriminative hash codes and modality-specific hash functions. Nonetheless, most existing CMR works are subjected to some restrictions: 1) It is assumed that data of different modalities are fully paired, which is impractical in real applications due to sample missing and false data alignment, and 2) binary regression targets including the label matrix and binary codes are too rigid to effectively learn semantic-preserving hash codes and hash functions. To address these problems, this paper proposes an Adaptive Marginalized Semantic Hashing (AMSH) method which not only enhances the discrimination of latent representations and hash codes by adaptive margins, but can also be used for both paired and unpaired CMR. As a two-step method, in the first step, AMSH generates semantic-aware modality-specific latent representations with adaptively marginalized labels, thereby enlarging the distances between different classes, and exploiting the labels to preserve the inter-modal and intra-modal semantic similarities into latent representations and hash codes. In the second step, adaptive margin matrices are embedded into the hash codes, and enlarge the gaps between positive and negative bits, which improves the discrimination and robustness of hash functions. On this basis, AMSH generates similarity-preserving hash codes and robust hash functions without the strict one-to-one data correspondence requirement. Experiments are conducted on several benchmark datasets to demonstrate the superiority and flexibility of AMSH over some state-of-the-art CMR methods.

引用

页码：9082 / 9095

页数：14

共 50 条

[31] Efficient discrete latent semantic hashing for scalable cross-modal retrieval
Lu, Xu
Zhu, Lei
Cheng, Zhiyong
Song, Xuemeng
Zhang, Huaxiang
SIGNAL PROCESSING, 2019, 154 : 217 - 231
[32] Scalable semantic-enhanced supervised hashing for cross-modal retrieval
Yang, Fan
Ding, Xiaojian
Liu, Yufeng
Ma, Fumin
Cao, Jie
KNOWLEDGE-BASED SYSTEMS, 2022, 251
[33] Label-Based Deep Semantic Hashing for Cross-Modal Retrieval
Weng, Weiwei
Wu, Jiagao
Yang, Lu
Liu, Linfeng
Hu, Bin
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 24 - 36
[34] UCMH: Unpaired cross-modal hashing with matrix factorization
Gao, Jing
Zhang, Wenjun
Zhong, Fangming
Chen, Zhikui
NEUROCOMPUTING, 2020, 418 : 178 - 190
[35] Category correlations embedded semantic centers hashing for cross-modal retrieval
Fan, Wentao
Yang, Chenwen
Luo, Kaiyi
Zhang, Min
Li, Huaxiong
INFORMATION SCIENCES, 2024, 683
[36] MESH: A Flexible Manifold-Embedded Semantic Hashing for Cross-Modal Retrieval
Zhong, Fangming
Wang, Guangze
Chen, Zhikui
Xia, Feng
IEEE ACCESS, 2020, 8 : 147569 - 147579
[37] Multilevel Deep Semantic Feature Asymmetric Network for Cross-Modal Hashing Retrieval
Jiang, Xiaolong
Fan, Jiabao
Zhang, Jie
Lin, Ziyong
Li, Mingyong
IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (08) : 621 - 631
[38] Label-Semantic-Enhanced Online Hashing for Efficient Cross-modal Retrieval
Jiang, Xueting
Liu, Xin
Cheung, Yiu-ming
Xu, Xing
Zheng, Shukai
Li, Taihao
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 984 - 989
[39] Generalized Semantic Preserving Hashing for n-Label Cross-Modal Retrieval
Mandal, Devraj
Chaudhury, Kunal N.
Biswas, Soma
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2633 - 2641
[40] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
Zhang, Cheng
Wan, Yuan
Qiang, Haopeng
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5383 - 5397

← 1 2 3 4 5 →