Prototype-guided Knowledge Transfer for Federated Unsupervised Cross-modal Hashing

被引:4
|
作者
Li, Jingzhi [1 ]
Li, Fengling [2 ]
Zhu, Lei [1 ]
Cui, Hui [1 ]
Li, Jingjing [3 ]
机构
[1] Shandong Normal Univ, Jinan, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW, Australia
[3] Univ Elect Sci & Technol China, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Federated Learning; Unsupervised Learning; Cross-modal Retrieval; Unsupervised Cross-modal Hashing; Prototype Learning; NETWORK;
D O I
10.1145/3581783.3613837
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although deep cross-modal hashing methods have shown superiorities for cross-modal retrieval recently, there is a concern about potential data privacy leakage when training the models. Federated learning adopts a distributed machine learning strategy, which can collaboratively train models without leaking local private data. It is a promising technique to support privacy-preserving cross-modal hashing. However, existing federated learning-based cross-modal retrieval methods usually rely on a large number of semantic annotations, which limits the scalability of the retrieval models. Furthermore, they mostly update the global models by aggregating local model parameters, ignoring the differences in the quantity and category of multi-modal data from multiple clients. To address these issues, we propose a Prototype Transfer-based Federated Unsupervised Cross-modal Hashing (PT-FUCH) method for solving the privacy leakage problem in cross-modal retrieval model learning. PT-FUCH protects local private data by exploring unified global prototypes for different clients, without relying on any semantic annotations. Global prototypes are used to guide the local cross-modal hash learning and promote the alignment of the feature space, thereby alleviating the model bias caused by the difference in the distribution of local multi-modal data and improving the retrieval accuracy. Additionally, we design an adaptive cross-modal knowledge distillation to transfer valuable semantic knowledge from modal-specific global models to local prototype learning processes, reducing the risk of overfitting. Experimental results on three benchmark cross-modal retrieval datasets validate that our PT-FUCH method can achieve outstanding retrieval performance when trained under distributed privacy-preserving mode. The source codes of our method are available at https://github.com/exquisite1210/PT-FUCH_P.
引用
收藏
页码:1013 / 1022
页数:10
相关论文
共 50 条
  • [41] Flexible Cross-Modal Hashing
    Yu, Guoxian
    Liu, Xuanwu
    Wang, Jun
    Domeniconi, Carlotta
    Zhang, Xiangliang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 304 - 314
  • [42] Continuous cross-modal hashing
    Zheng, Hao
    Wang, Jinbao
    Zhen, Xiantong
    Song, Jingkuan
    Zheng, Feng
    Lu, Ke
    Qi, Guo-Jun
    [J]. PATTERN RECOGNITION, 2023, 142
  • [43] Deep Cross-Modal Hashing
    Jiang, Qing-Yuan
    Li, Wu-Jun
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3270 - 3278
  • [44] Cross-Modal Hamming Hashing
    Cao, Yue
    Liu, Bin
    Long, Mingsheng
    Wang, Jianmin
    [J]. COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 207 - 223
  • [45] Learning From Expert: Vision-Language Knowledge Distillation for Unsupervised Cross-Modal Hashing Retrieval
    Sun, Lina
    Li, Yewen
    Dong, Yumin
    [J]. PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 499 - 507
  • [46] Prototype-Guided Feature Learning for Unsupervised Domain Adaptation
    Du, Yongjie
    Zhou, Deyun
    Xie, Yu
    Lei, Yu
    Shi, Jiao
    [J]. PATTERN RECOGNITION, 2023, 135
  • [47] Semi-Supervised Knowledge Distillation for Cross-Modal Hashing
    Su, Mingyue
    Gu, Guanghua
    Ren, Xianlong
    Fu, Hao
    Zhao, Yao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 662 - 675
  • [48] Cross-Domain Transfer Hashing for Efficient Cross-modal Retrieval
    Li F.
    Wang B.
    Zhu L.
    Li J.
    Zhang Z.
    Chang X.
    [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (10) : 1 - 1
  • [49] Hierarchical modal interaction balance cross-modal hashing for unsupervised image-text retrieval
    Zhang, Jie
    Lin, Ziyong
    Jiang, Xiaolong
    Li, Mingyong
    Wang, Chao
    [J]. Multimedia Tools and Applications, 2024, 83 (42) : 90487 - 90509
  • [50] Multi-Grained Similarity Preserving and Updating for Unsupervised Cross-Modal Hashing
    Wu, Runbing
    Zhu, Xinghui
    Yi, Zeqian
    Zou, Zhuoyang
    Liu, Yi
    Zhu, Lei
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (02):