Semi-supervised cross-modal hashing with joint hyperboloid mapping

被引:0
|
作者
Fu, Hao [1 ,2 ]
Gu, Guanghua [1 ,2 ]
Dou, Yiyang [1 ,2 ]
Li, Zhuoyi [1 ,2 ]
Zhao, Yao [3 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao, Peoples R China
[2] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao, Hebei, Peoples R China
[3] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
关键词
Cross-modal retrieval; Semi-supervised hash learning; Diffusion model; Knowledge distillation; Quintet loss;
D O I
10.1016/j.knosys.2024.112547
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By using a small amount of label information to achieve favorable performance, semi-supervised methods are more practical in real-world application scenarios. However, existing semi-supervised cross-modal retrieval methods mainly focus on preserving similarities and learning more consistent hash codes yet overlook the importance of constructing a joint abstract space shared by multi-modal embeddings. In this paper, we propose a novel Semi-supervised Cross-modal Hashing with Joint Hyperboloid Mapping (SCH-JHM). Firstly, we present a diffusion-based teacher model in SCH-JHM to learn the generalized semantic knowledge and output the pseudolabels for unlabeled data. Secondly, SCH-JHM establishes a five-tuple plane, resembling an hourglass, for each retrieval task based on the queries, positive pairs, negative pairs, semi-supervised positive pairs, and semisupervised negative pairs included in the semi-supervised cross-modal retrieval task. Furthermore, it projects the 12 tasks from the image, text, video, and audio modalities into a joint hyperboloid space. Finally, the student model in SCH-JHM is employed to explore the latent semantic relevance between filtered heterogeneous entities, which can be considered as a supervised process. Comprehensive experiments compared with state-of-the-art methods on three widely used datasets verify the effectiveness of our proposed approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] SUPERVISED CROSS-MODAL HASHING WITHOUT RELAXATION
    Huang, Hua-Junjie
    Yang, Rui
    Li, Chuan-Xiang
    Shi, Yuliang
    Guo, Shanqing
    Xu, Xin-Shun
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1159 - 1164
  • [32] Semantic Consistency Cross-Modal Retrieval With Semi-Supervised Graph Regularization
    Xu, Gongwen
    Li, Xiaomei
    Zhang, Zhijun
    IEEE ACCESS, 2020, 8 : 14278 - 14288
  • [33] Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval
    Zhang, Liang
    Ma, Bingpeng
    Li, Guorong
    Huang, Qingming
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (01) : 128 - 141
  • [34] Semi-supervised cross-modal image generation with generative adversarial networks
    Li, Dan
    Du, Changde
    He, Huiguang
    PATTERN RECOGNITION, 2020, 100
  • [35] Semi-supervised constrained graph convolutional network for cross-modal retrieval
    Zhang, Lei
    Chen, Leiting
    Ou, Weihua
    Zhou, Chuan
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [36] Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching
    Liang, Jingjun
    Li, Ruichen
    Jin, Qin
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2852 - 2861
  • [37] Weakly Supervised Hashing with Reconstructive Cross-modal Attention
    Du, Yongchao
    Wang, Min
    Lu, Zhenbo
    Zhou, Wengang
    Li, Houqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [38] Supervised Hierarchical Online Hashing for Cross-modal Retrieval
    Han, Kai
    Liu, Yu
    Wei, Rukai
    Zhou, Ke
    Xu, Jinhui
    Long, Kun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [39] Correlation Autoencoder Hashing for Supervised Cross-Modal Search
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Zhu, Han
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 197 - 204
  • [40] Supervised Contrastive Discrete Hashing for cross-modal retrieval
    Li, Ze
    Yao, Tao
    Wang, Lili
    Li, Ying
    Wang, Gang
    KNOWLEDGE-BASED SYSTEMS, 2024, 295