Semantic Neighbor Graph Hashing for Multimodal Retrieval

被引:36
|
作者
Jin, Lu [1 ]
Li, Kai [2 ]
Hu, Hao [2 ]
Qi, Guo-Jun [2 ]
Tang, Jinhui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[2] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
基金
中国国家自然科学基金;
关键词
Multimodal hashing; multimedia retrieval; semantic supervision; local neighborhood structure; graph; fine-grained similarity metric; REPRESENTATION;
D O I
10.1109/TIP.2017.2776745
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hashing methods have been widely used for approximate nearest neighbor search in recent years due to its computational and storage effectiveness. Most existing multimodal hashing methods try to preserve the similarity relationship based on either metric distances or semantic labels in a procrustean way, while ignoring the intra-class and interclass variations inherent in the metric space. In this paper, we propose a novel multimodal hashing method, termed as semantic neighbor graph hashing (SNGH), which aims to preserve the fine-grained similarity metric based on the semantic graph that is constructed by jointly pursuing the semantic supervision and the local neighborhood structure. Specifically, the semantic graph is constructed to capture the local similarity structure for the image modality and the text modality, respectively. Furthermore, we define a function based on the local similarity in particular to adaptively calculate multi-level similarities by encoding the intra-class and inter-class variations. After obtaining the unified hash codes, the logistic regression with kernel trick is employed to learn view-specific hash functions independently for each modality. Extensive experiments are conducted on four widely used multimodal data sets. The experimental results demonstrate the superiority of the proposed SNGH method compared with the state-of-the-art multimodal hashing methods.
引用
收藏
页码:1405 / 1417
页数:13
相关论文
共 50 条
  • [41] Concept Preserving Hashing for Semantic Image Retrieval With Concept Drift
    Tian, Xing
    Ng, Wing W. Y.
    Wang, Hui
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (10) : 5184 - 5197
  • [42] Deep semantic preserving hashing for large scale image retrieval
    Masoumeh Zareapoor
    Jie Yang
    Deepak Kumar Jain
    Pourya Shamsolmoali
    Neha Jain
    Surya Kant
    Multimedia Tools and Applications, 2019, 78 : 23831 - 23846
  • [43] Discrete Semantic Alignment Hashing for Cross-Media Retrieval
    Yao, Tao
    Kong, Xiangwei
    Fu, Haiyan
    Tian, Qi
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 4896 - 4907
  • [44] Semantic-guided hashing learning for domain adaptive retrieval
    Zhang, Wei
    Yang, Xiaoqiong
    Teng, Shaohua
    Wu, NaiQi
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (03): : 1093 - 1112
  • [45] Deep semantic preserving hashing for large scale image retrieval
    Zareapoor, Masoumeh
    Yang, Jie
    Jain, Deepak Kumar
    Shamsolmoali, Pourya
    Jain, Neha
    Kant, Surya
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 23831 - 23846
  • [46] Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval
    Meng, Min
    Sun, Jiaxuan
    Liu, Jigang
    Yu, Jun
    Wu, Jigang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1914 - 1926
  • [47] Correlation embedding semantic-enhanced hashing for multimedia retrieval
    Big Data Institute, School of Computer Science and Engineering, Central South University, Hunan, ChangSha
    410000, China
    不详
    TN
    37235, United States
    Image Vision Comput, 1600, (February 2025):
  • [48] Deep Incremental Hashing for Semantic Image Retrieval With Concept Drift
    Tian, Xing
    Ng, Wing W. Y.
    Xu, Huihui
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (04) : 1102 - 1115
  • [49] Intelligent Indexing and Semantic Retrieval of Multimodal Documents
    Rohini K. Srihari
    Zhongfei Zhang
    Aibing Rao
    Information Retrieval, 2000, 2 (2-3): : 245 - 275
  • [50] A survey on multimodal video representation for semantic retrieval
    Calic, J
    Campbell, N
    Dasiopoulou, S
    Kompatsiaris, Y
    EUROCON 2005: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOL 1 AND 2 , PROCEEDINGS, 2005, : 135 - 138