Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval

被引:4
|
作者
Li, Fengling [1 ]
Wang, Bowen [2 ]
Zhu, Lei [3 ]
Li, Jingjing [4 ]
Zhang, Zheng [5 ]
Chang, Xiaojun [1 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia
[2] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[3] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[5] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Training; Adaptation models; Codes; Circuits and systems; Optimization; Cross-modal hashing; cross-domain transfer; dual-pronged approach; weakly-supervised; ROBUST;
D O I
10.1109/TCSVT.2024.3374791
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unsupervised cross-modal hashing presents significant advantages in heterogeneous modality retrieval, offering label scalability, high retrieval efficiency, and low storage costs. However, the lack of explicit semantic supervision in this process results in a noticeable semantic deficit, impacting retrieval performance. In this paper, we address this challenge with a dual-pronged approach: Cross-Domain Transfer Hashing (CDTH), a lightweight weakly-supervised cross-modal hashing model. Our method leverages a semantically rich auxiliary domain to augment the target unsupervised cross-modal hash learning process. Simultaneously, we design a lightweight target cross-modal hashing network to reduce semantic requirements, lessening the burden of parameter optimization. Within the auxiliary domain, we perform direct semantic transfer with hashing network parameter transfer and indirect correlation semantic transfer by constructing an auxiliary semantic correlation graph with the identified cross-domain semantic consistent samples. In the target domain, we generate pseudo-labels using CLIP and establish a target weak semantic correlation graph. These two graphs collaborate to bolster the target cross-modal hashing training process. Extensive experiments on three publicly available datasets affirm the superiority of our approach in both retrieval accuracy and training efficiency. The source code for our method is accessible at: https://github.com/WangBowen7/CDTH.
引用
收藏
页码:9664 / 9677
页数:14
相关论文
共 50 条
  • [1] Cross-domain Cross-modal Food Transfer
    Zhu, Bin
    Ngo, Chong-Wah
    Chen, Jing-jing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3762 - 3770
  • [2] Efficient Discriminative Hashing for Cross-Modal Retrieval
    Huang, Junfan
    Kang, Peipei
    Fang, Xiaozhao
    Han, Na
    Xie, Shengli
    Gao, Hongbo
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3865 - 3878
  • [3] Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval
    Ma, Dekui
    Liang, Jian
    Kong, Xiangwei
    He, Ran
    Li, Ying
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 38 - 43
  • [4] Hashing for Cross-Modal Similarity Retrieval
    Liu, Yao
    Yuan, Yanhong
    Huang, Qiaoli
    Huang, Zhixing
    2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8
  • [5] Semantic Boosting Cross-Modal Hashing for efficient multimedia retrieval
    Wang, Ke
    Tang, Jun
    Wang, Nian
    Shao, Ling
    INFORMATION SCIENCES, 2016, 330 : 199 - 210
  • [6] An efficient dual semantic preserving hashing for cross-modal retrieval
    Liu, Yun
    Ji, Shujuan
    Fu, Qiang
    Chiu, Dickson K. W.
    Gong, Maoguo
    NEUROCOMPUTING, 2022, 492 : 264 - 277
  • [7] Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation
    Zhao, Wentian
    Wu, Xinxiao
    Luo, Jiebo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1180 - 1192
  • [8] Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
    Liu, Yang
    Chen, Qingchao
    Albanie, Samuel
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14949 - 14959
  • [9] Online weighted hashing for cross-modal retrieval
    Jiang, Zining
    Weng, Zhenyu
    Li, Runhao
    Zhuang, Huiping
    Lin, Zhiping
    PATTERN RECOGNITION, 2025, 161
  • [10] Random Online Hashing for Cross-Modal Retrieval
    Jiang, Kaihang
    Wong, Wai Keung
    Fang, Xiaozhao
    Li, Jiaxing
    Qin, Jianyang
    Xie, Shengli
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 677 - 691