Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval

被引：4

作者：

Li, Fengling ^{[1
]}

Wang, Bowen ^{[2
]}

Zhu, Lei ^{[3
]}

Li, Jingjing ^{[4
]}

Zhang, Zheng ^{[5
]}

Chang, Xiaojun ^{[1
]}

机构：

[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia

[2] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China

[3] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China

[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[5] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Semantics; Correlation; Training; Adaptation models; Codes; Circuits and systems; Optimization; Cross-modal hashing; cross-domain transfer; dual-pronged approach; weakly-supervised; ROBUST;

D O I：

10.1109/TCSVT.2024.3374791

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unsupervised cross-modal hashing presents significant advantages in heterogeneous modality retrieval, offering label scalability, high retrieval efficiency, and low storage costs. However, the lack of explicit semantic supervision in this process results in a noticeable semantic deficit, impacting retrieval performance. In this paper, we address this challenge with a dual-pronged approach: Cross-Domain Transfer Hashing (CDTH), a lightweight weakly-supervised cross-modal hashing model. Our method leverages a semantically rich auxiliary domain to augment the target unsupervised cross-modal hash learning process. Simultaneously, we design a lightweight target cross-modal hashing network to reduce semantic requirements, lessening the burden of parameter optimization. Within the auxiliary domain, we perform direct semantic transfer with hashing network parameter transfer and indirect correlation semantic transfer by constructing an auxiliary semantic correlation graph with the identified cross-domain semantic consistent samples. In the target domain, we generate pseudo-labels using CLIP and establish a target weak semantic correlation graph. These two graphs collaborate to bolster the target cross-modal hashing training process. Extensive experiments on three publicly available datasets affirm the superiority of our approach in both retrieval accuracy and training efficiency. The source code for our method is accessible at: https://github.com/WangBowen7/CDTH.

引用

页码：9664 / 9677

页数：14

共 50 条

[1] Cross-domain Cross-modal Food Transfer
Zhu, Bin
Ngo, Chong-Wah
Chen, Jing-jing
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3762 - 3770
[2] Efficient Discriminative Hashing for Cross-Modal Retrieval
Huang, Junfan
Kang, Peipei
Fang, Xiaozhao
Han, Na
Xie, Shengli
Gao, Hongbo
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3865 - 3878
[3] Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval
Ma, Dekui
Liang, Jian
Kong, Xiangwei
He, Ran
Li, Ying
PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 38 - 43
[4] Hashing for Cross-Modal Similarity Retrieval
Liu, Yao
Yuan, Yanhong
Huang, Qiaoli
Huang, Zhixing
2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8
[5] Semantic Boosting Cross-Modal Hashing for efficient multimedia retrieval
Wang, Ke
Tang, Jun
Wang, Nian
Shao, Ling
INFORMATION SCIENCES, 2016, 330 : 199 - 210
[6] An efficient dual semantic preserving hashing for cross-modal retrieval
Liu, Yun
Ji, Shujuan
Fu, Qiang
Chiu, Dickson K. W.
Gong, Maoguo
NEUROCOMPUTING, 2022, 492 : 264 - 277
[7] Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation
Zhao, Wentian
Wu, Xinxiao
Luo, Jiebo
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1180 - 1192
[8] Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
Liu, Yang
Chen, Qingchao
Albanie, Samuel
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14949 - 14959
[9] Online weighted hashing for cross-modal retrieval
Jiang, Zining
Weng, Zhenyu
Li, Runhao
Zhuang, Huiping
Lin, Zhiping
PATTERN RECOGNITION, 2025, 161
[10] Random Online Hashing for Cross-Modal Retrieval
Jiang, Kaihang
Wong, Wai Keung
Fang, Xiaozhao
Li, Jiaxing
Qin, Jianyang
Xie, Shengli
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 677 - 691

← 1 2 3 4 5 →