Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval

被引:18
|
作者
Luo, Kaiyi [1 ]
Zhang, Chao [1 ]
Li, Huaxiong [1 ]
Jia, Xiuyi [2 ]
Chen, Chunlin [1 ]
机构
[1] Nanjing Univ, Dept Control Sci & Intelligence Engn, Nanjing 210093, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210014, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; unpaired hashing; adaptive margins; CODES;
D O I
10.1109/TMM.2023.3245400
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, Cross-Modal Hashing (CMH) has attracted much attention due to its fast query speed and efficient storage. Previous studies have achieved promising results for Cross-Modal Retrieval (CMR) by discovering discriminative hash codes and modality-specific hash functions. Nonetheless, most existing CMR works are subjected to some restrictions: 1) It is assumed that data of different modalities are fully paired, which is impractical in real applications due to sample missing and false data alignment, and 2) binary regression targets including the label matrix and binary codes are too rigid to effectively learn semantic-preserving hash codes and hash functions. To address these problems, this paper proposes an Adaptive Marginalized Semantic Hashing (AMSH) method which not only enhances the discrimination of latent representations and hash codes by adaptive margins, but can also be used for both paired and unpaired CMR. As a two-step method, in the first step, AMSH generates semantic-aware modality-specific latent representations with adaptively marginalized labels, thereby enlarging the distances between different classes, and exploiting the labels to preserve the inter-modal and intra-modal semantic similarities into latent representations and hash codes. In the second step, adaptive margin matrices are embedded into the hash codes, and enlarge the gaps between positive and negative bits, which improves the discrimination and robustness of hash functions. On this basis, AMSH generates similarity-preserving hash codes and robust hash functions without the strict one-to-one data correspondence requirement. Experiments are conducted on several benchmark datasets to demonstrate the superiority and flexibility of AMSH over some state-of-the-art CMR methods.
引用
收藏
页码:9082 / 9095
页数:14
相关论文
共 50 条
  • [31] Efficient discrete latent semantic hashing for scalable cross-modal retrieval
    Lu, Xu
    Zhu, Lei
    Cheng, Zhiyong
    Song, Xuemeng
    Zhang, Huaxiang
    SIGNAL PROCESSING, 2019, 154 : 217 - 231
  • [32] Scalable semantic-enhanced supervised hashing for cross-modal retrieval
    Yang, Fan
    Ding, Xiaojian
    Liu, Yufeng
    Ma, Fumin
    Cao, Jie
    KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [33] Label-Based Deep Semantic Hashing for Cross-Modal Retrieval
    Weng, Weiwei
    Wu, Jiagao
    Yang, Lu
    Liu, Linfeng
    Hu, Bin
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 24 - 36
  • [34] UCMH: Unpaired cross-modal hashing with matrix factorization
    Gao, Jing
    Zhang, Wenjun
    Zhong, Fangming
    Chen, Zhikui
    NEUROCOMPUTING, 2020, 418 : 178 - 190
  • [35] Category correlations embedded semantic centers hashing for cross-modal retrieval
    Fan, Wentao
    Yang, Chenwen
    Luo, Kaiyi
    Zhang, Min
    Li, Huaxiong
    INFORMATION SCIENCES, 2024, 683
  • [36] MESH: A Flexible Manifold-Embedded Semantic Hashing for Cross-Modal Retrieval
    Zhong, Fangming
    Wang, Guangze
    Chen, Zhikui
    Xia, Feng
    IEEE ACCESS, 2020, 8 : 147569 - 147579
  • [37] Multilevel Deep Semantic Feature Asymmetric Network for Cross-Modal Hashing Retrieval
    Jiang, Xiaolong
    Fan, Jiabao
    Zhang, Jie
    Lin, Ziyong
    Li, Mingyong
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (08) : 621 - 631
  • [38] Label-Semantic-Enhanced Online Hashing for Efficient Cross-modal Retrieval
    Jiang, Xueting
    Liu, Xin
    Cheung, Yiu-ming
    Xu, Xing
    Zheng, Shukai
    Li, Taihao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 984 - 989
  • [39] Generalized Semantic Preserving Hashing for n-Label Cross-Modal Retrieval
    Mandal, Devraj
    Chaudhury, Kunal N.
    Biswas, Soma
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2633 - 2641
  • [40] Deep noise mitigation and semantic reconstruction hashing for unsupervised cross-modal retrieval
    Zhang, Cheng
    Wan, Yuan
    Qiang, Haopeng
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5383 - 5397