Semi-supervised cross-modal hashing with joint hyperboloid mapping

被引:0
|
作者
Fu, Hao [1 ,2 ]
Gu, Guanghua [1 ,2 ]
Dou, Yiyang [1 ,2 ]
Li, Zhuoyi [1 ,2 ]
Zhao, Yao [3 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao, Peoples R China
[2] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao, Hebei, Peoples R China
[3] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
关键词
Cross-modal retrieval; Semi-supervised hash learning; Diffusion model; Knowledge distillation; Quintet loss;
D O I
10.1016/j.knosys.2024.112547
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By using a small amount of label information to achieve favorable performance, semi-supervised methods are more practical in real-world application scenarios. However, existing semi-supervised cross-modal retrieval methods mainly focus on preserving similarities and learning more consistent hash codes yet overlook the importance of constructing a joint abstract space shared by multi-modal embeddings. In this paper, we propose a novel Semi-supervised Cross-modal Hashing with Joint Hyperboloid Mapping (SCH-JHM). Firstly, we present a diffusion-based teacher model in SCH-JHM to learn the generalized semantic knowledge and output the pseudolabels for unlabeled data. Secondly, SCH-JHM establishes a five-tuple plane, resembling an hourglass, for each retrieval task based on the queries, positive pairs, negative pairs, semi-supervised positive pairs, and semisupervised negative pairs included in the semi-supervised cross-modal retrieval task. Furthermore, it projects the 12 tasks from the image, text, video, and audio modalities into a joint hyperboloid space. Finally, the student model in SCH-JHM is employed to explore the latent semantic relevance between filtered heterogeneous entities, which can be considered as a supervised process. Comprehensive experiments compared with state-of-the-art methods on three widely used datasets verify the effectiveness of our proposed approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Enhancing Semi-Supervised Learning with Cross-Modal Knowledge
    Zhu, Hui
    Lu, Yongchun
    Wang, Hongbin
    Zhou, Xunyi
    Ma, Qin
    Liu, Yanhong
    Jiang, Ning
    Wei, Xin
    Zeng, Linchengxi
    Zhao, Xiaofang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4456 - 4465
  • [22] Supervised Hierarchical Cross-Modal Hashing
    Sun, Changchang
    Song, Xuemeng
    Feng, Fuli
    Zhao, Wayne Xin
    Zhang, Hao
    Nie, Liqiang
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 725 - 734
  • [23] Weakly Supervised Cross-Modal Hashing
    Liu, Xuanwu
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Xiao, Guoqiang
    Guo, Maozu
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (02) : 552 - 563
  • [24] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Fuhao Zou
    Xingqiang Bai
    Chaoyang Luan
    Kai Li
    Yunfei Wang
    Hefei Ling
    World Wide Web, 2019, 22 : 825 - 841
  • [25] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Zou, Fuhao
    Bai, Xingqiang
    Luan, Chaoyang
    Li, Kai
    Wang, Yunfei
    Ling, Hefei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 825 - 841
  • [26] Deep Cross-Modal Supervised Hashing Based on Joint Semantic Matrix
    Chen, Na
    Cao, Yuan
    Liu, Chao
    NETWORK AND SYSTEM SECURITY, NSS 2021, 2021, 13041 : 258 - 274
  • [27] Semi-Supervised Cross-Modal Retrieval Based on Discriminative Comapping
    Liu, Li
    Dong, Xiao
    Wang, Tianshi
    COMPLEXITY, 2020, 2020
  • [28] Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
    Zhang, Liang
    Ma, Bingpeng
    He, Jianfeng
    Li, Guorong
    Huang, Qingming
    Tian, Qi
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3406 - 3412
  • [29] Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval
    Yu, En
    Sun, Jiande
    Li, Jing
    Chang, Xiaojun
    Han, Xian-Hua
    Hauptmann, Alexander G.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (05) : 1276 - 1288
  • [30] LABEL PREDICTION FRAMEWORK FOR SEMI-SUPERVISED CROSS-MODAL RETRIEVAL
    Mandal, Devraj
    Rao, Pramod
    Biswas, Soma
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2311 - 2315