Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval

被引:19
|
作者
Wang, Di [1 ,2 ]
Zhang, Caiping [3 ]
Wang, Quan [3 ]
Tian, Yumin [3 ]
He, Lihuo [4 ]
Zhao, Lin [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China
[2] Nanjing Univ Sci & Technol, Jiangsu Key Lab Image & Video Understanding Social, Nanjing 210093, Jiangsu, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Key Labratory Smart Human Comp Interact & Wearable, Xian 710071, Peoples R China
[4] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Codes; Binary codes; Representation learning; Correlation; Hash functions; Feature extraction; Cross-modal retrieval; deep hashing; semantic preserving; hierarchical learning;
D O I
10.1109/TMM.2022.3140656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal hashing has become a vital technique in cross-modal retrieval due to its fast query speed and low storage cost in recent years. Generally, most of the priors supervised cross-modal hashing methods are flat methods which are designed for non-hierarchical labeled data. They treat different categories independently and ignore the inter-category correlations. In practical applications, many instances are labeled with hierarchical categories. The hierarchical label structure provides rich information among different categories. To rationally take use of category correlations, hierarchical cross-modal hashing is proposed. However, existing methods intend to preserve instance-pairwise or class-pairwise similarities, which cannot fully explore the semantic correlations among different categories and make the learned hash codes less discriminative. In this paper, we propose a deep cross-modal hashing method named hierarchical semantic structure preserving hashing (HSSPH), which directly exploits the label hierarchy information to learn discriminative hash codes. Specifically, HSSPH learns a set of class-wise hash codes for each layer. By augmenting class-wise codes with labels, it generates layer-wise prototype codes which reflect the semantic structure of each layer. In order to enhance the discriminative ability of hash codes, HSSPH supervises the hash codes learning with both labels and semantic structures to preserve the hierarchical semantics. Besides, efficient optimization algorithms are developed to directly learn the discrete hash codes for each instance and each class. Extensive experiments on two benchmark datasets show the superiority of HSSPH over several state-of-the-art methods.
引用
收藏
页码:1217 / 1229
页数:13
相关论文
共 50 条
  • [1] Generalized Semantic Preserving Hashing for Cross-Modal Retrieval
    Mandal, Devraj
    Chaudhury, Kunal N.
    Biswas, Soma
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 102 - 112
  • [2] Joint Semantic Preserving Sparse Hashing for Cross-Modal Retrieval
    Hu, Zhikai
    Cheung, Yiu-Ming
    Li, Mengke
    Lan, Weichao
    Zhang, Donglin
    Liu, Qiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2989 - 3002
  • [3] Semantic preserving asymmetric discrete hashing for cross-modal retrieval
    Fan Yang
    Qiao-xi Zhang
    Xiao-jian Ding
    Fu-min Ma
    Jie Cao
    De-yu Tong
    Applied Intelligence, 2023, 53 : 15352 - 15371
  • [4] Semantic preserving asymmetric discrete hashing for cross-modal retrieval
    Yang, Fan
    Zhang, Qiao-xi
    Ding, Xiao-jian
    Ma, Fu-min
    Cao, Jie
    Tong, De-yu
    APPLIED INTELLIGENCE, 2023, 53 (12) : 15352 - 15371
  • [5] An efficient dual semantic preserving hashing for cross-modal retrieval
    Liu, Yun
    Ji, Shujuan
    Fu, Qiang
    Chiu, Dickson K. W.
    Gong, Maoguo
    NEUROCOMPUTING, 2022, 492 : 264 - 277
  • [6] Semantic ranking structure preserving for cross-modal retrieval
    Liu, Hui
    Feng, Yong
    Zhou, Mingliang
    Qiang, Baohua
    APPLIED INTELLIGENCE, 2021, 51 (03) : 1802 - 1812
  • [7] Semantic ranking structure preserving for cross-modal retrieval
    Hui Liu
    Yong Feng
    Mingliang Zhou
    Baohua Qiang
    Applied Intelligence, 2021, 51 : 1802 - 1812
  • [8] Semantic consistency hashing for cross-modal retrieval
    Yao, Tao
    Kong, Xiangwei
    Fu, Haiyan
    Tian, Qi
    NEUROCOMPUTING, 2016, 193 : 250 - 259
  • [9] Hierarchical Consensus Hashing for Cross-Modal Retrieval
    Sun, Yuan
    Ren, Zhenwen
    Hu, Peng
    Peng, Dezhong
    Wang, Xu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 824 - 836
  • [10] Generalized Semantic Preserving Hashing for n-Label Cross-Modal Retrieval
    Mandal, Devraj
    Chaudhury, Kunal N.
    Biswas, Soma
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2633 - 2641