Semi-supervised cross-modal hashing with joint hyperboloid mapping

被引：0

作者：

Fu, Hao ^{[1
,2
]}

Gu, Guanghua ^{[1
,2
]}

Dou, Yiyang ^{[1
,2
]}

Li, Zhuoyi ^{[1
,2
]}

Zhao, Yao ^{[3
]}

机构：

[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao, Peoples R China

[2] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao, Hebei, Peoples R China

[3] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 304卷

关键词：

Cross-modal retrieval; Semi-supervised hash learning; Diffusion model; Knowledge distillation; Quintet loss;

D O I：

10.1016/j.knosys.2024.112547

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

By using a small amount of label information to achieve favorable performance, semi-supervised methods are more practical in real-world application scenarios. However, existing semi-supervised cross-modal retrieval methods mainly focus on preserving similarities and learning more consistent hash codes yet overlook the importance of constructing a joint abstract space shared by multi-modal embeddings. In this paper, we propose a novel Semi-supervised Cross-modal Hashing with Joint Hyperboloid Mapping (SCH-JHM). Firstly, we present a diffusion-based teacher model in SCH-JHM to learn the generalized semantic knowledge and output the pseudolabels for unlabeled data. Secondly, SCH-JHM establishes a five-tuple plane, resembling an hourglass, for each retrieval task based on the queries, positive pairs, negative pairs, semi-supervised positive pairs, and semisupervised negative pairs included in the semi-supervised cross-modal retrieval task. Furthermore, it projects the 12 tasks from the image, text, video, and audio modalities into a joint hyperboloid space. Finally, the student model in SCH-JHM is employed to explore the latent semantic relevance between filtered heterogeneous entities, which can be considered as a supervised process. Comprehensive experiments compared with state-of-the-art methods on three widely used datasets verify the effectiveness of our proposed approach.

引用

页数：15

共 50 条

[31] SUPERVISED CROSS-MODAL HASHING WITHOUT RELAXATION
Huang, Hua-Junjie
Yang, Rui
Li, Chuan-Xiang
Shi, Yuliang
Guo, Shanqing
Xu, Xin-Shun
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1159 - 1164
[32] Semantic Consistency Cross-Modal Retrieval With Semi-Supervised Graph Regularization
Xu, Gongwen
Li, Xiaomei
Zhang, Zhijun
IEEE ACCESS, 2020, 8 : 14278 - 14288
[33] Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval
Zhang, Liang
Ma, Bingpeng
Li, Guorong
Huang, Qingming
Tian, Qi
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (01) : 128 - 141
[34] Semi-supervised cross-modal image generation with generative adversarial networks
Li, Dan
Du, Changde
He, Huiguang
PATTERN RECOGNITION, 2020, 100
[35] Semi-supervised constrained graph convolutional network for cross-modal retrieval
Zhang, Lei
Chen, Leiting
Ou, Weihua
Zhou, Chuan
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
[36] Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching
Liang, Jingjun
Li, Ruichen
Jin, Qin
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2852 - 2861
[37] Weakly Supervised Hashing with Reconstructive Cross-modal Attention
Du, Yongchao
Wang, Min
Lu, Zhenbo
Zhou, Wengang
Li, Houqiang
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[38] Supervised Hierarchical Online Hashing for Cross-modal Retrieval
Han, Kai
Liu, Yu
Wei, Rukai
Zhou, Ke
Xu, Jinhui
Long, Kun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
[39] Correlation Autoencoder Hashing for Supervised Cross-Modal Search
Cao, Yue
Long, Mingsheng
Wang, Jianmin
Zhu, Han
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 197 - 204
[40] Supervised Contrastive Discrete Hashing for cross-modal retrieval
Li, Ze
Yao, Tao
Wang, Lili
Li, Ying
Wang, Gang
KNOWLEDGE-BASED SYSTEMS, 2024, 295

← 1 2 3 4 5 →