Semi-supervised cross-modal hashing with joint hyperboloid mapping

被引：0

作者：

Fu, Hao ^{[1
,2
]}

Gu, Guanghua ^{[1
,2
]}

Dou, Yiyang ^{[1
,2
]}

Li, Zhuoyi ^{[1
,2
]}

Zhao, Yao ^{[3
]}

机构：

[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao, Peoples R China

[2] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao, Hebei, Peoples R China

[3] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 304卷

关键词：

Cross-modal retrieval; Semi-supervised hash learning; Diffusion model; Knowledge distillation; Quintet loss;

D O I：

10.1016/j.knosys.2024.112547

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

By using a small amount of label information to achieve favorable performance, semi-supervised methods are more practical in real-world application scenarios. However, existing semi-supervised cross-modal retrieval methods mainly focus on preserving similarities and learning more consistent hash codes yet overlook the importance of constructing a joint abstract space shared by multi-modal embeddings. In this paper, we propose a novel Semi-supervised Cross-modal Hashing with Joint Hyperboloid Mapping (SCH-JHM). Firstly, we present a diffusion-based teacher model in SCH-JHM to learn the generalized semantic knowledge and output the pseudolabels for unlabeled data. Secondly, SCH-JHM establishes a five-tuple plane, resembling an hourglass, for each retrieval task based on the queries, positive pairs, negative pairs, semi-supervised positive pairs, and semisupervised negative pairs included in the semi-supervised cross-modal retrieval task. Furthermore, it projects the 12 tasks from the image, text, video, and audio modalities into a joint hyperboloid space. Finally, the student model in SCH-JHM is employed to explore the latent semantic relevance between filtered heterogeneous entities, which can be considered as a supervised process. Comprehensive experiments compared with state-of-the-art methods on three widely used datasets verify the effectiveness of our proposed approach.

引用

页数：15

共 50 条

[21] Enhancing Semi-Supervised Learning with Cross-Modal Knowledge
Zhu, Hui
Lu, Yongchun
Wang, Hongbin
Zhou, Xunyi
Ma, Qin
Liu, Yanhong
Jiang, Ning
Wei, Xin
Zeng, Linchengxi
Zhao, Xiaofang
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4456 - 4465
[22] Supervised Hierarchical Cross-Modal Hashing
Sun, Changchang
Song, Xuemeng
Feng, Fuli
Zhao, Wayne Xin
Zhang, Hao
Nie, Liqiang
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 725 - 734
[23] Weakly Supervised Cross-Modal Hashing
Liu, Xuanwu
Yu, Guoxian
Domeniconi, Carlotta
Wang, Jun
Xiao, Guoqiang
Guo, Maozu
IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (02) : 552 - 563
[24] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
Fuhao Zou
Xingqiang Bai
Chaoyang Luan
Kai Li
Yunfei Wang
Hefei Ling
World Wide Web, 2019, 22 : 825 - 841
[25] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
Zou, Fuhao
Bai, Xingqiang
Luan, Chaoyang
Li, Kai
Wang, Yunfei
Ling, Hefei
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 825 - 841
[26] Deep Cross-Modal Supervised Hashing Based on Joint Semantic Matrix
Chen, Na
Cao, Yuan
Liu, Chao
NETWORK AND SYSTEM SECURITY, NSS 2021, 2021, 13041 : 258 - 274
[27] Semi-Supervised Cross-Modal Retrieval Based on Discriminative Comapping
Liu, Li
Dong, Xiao
Wang, Tianshi
COMPLEXITY, 2020, 2020
[28] Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
Zhang, Liang
Ma, Bingpeng
He, Jianfeng
Li, Guorong
Huang, Qingming
Tian, Qi
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3406 - 3412
[29] Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval
Yu, En
Sun, Jiande
Li, Jing
Chang, Xiaojun
Han, Xian-Hua
Hauptmann, Alexander G.
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (05) : 1276 - 1288
[30] LABEL PREDICTION FRAMEWORK FOR SEMI-SUPERVISED CROSS-MODAL RETRIEVAL
Mandal, Devraj
Rao, Pramod
Biswas, Soma
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2311 - 2315

← 1 2 3 4 5 →