Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval

被引：18

作者：

Li, Mingyong ^{[1
,2
]}

Wang, Hongya ^{[1
,3
]}

机构：

[1] Donghua Univ, Coll Comp Sci & Technol, Shanghai, Peoples R China

[2] Chongqing Normal Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China

[3] Shanghai Key Lab Comp Software Evaluating & Testi, Shanghai, Peoples R China

来源：

PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21) | 2021年

关键词：

cross-modal hashing; unsupervised learning; knowledge distillation; cross-modal retrieval;

D O I：

10.1145/3460426.3463626

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-modal hashing (CMH) maps heterogeneous multiple modality data into compact binary code to achieve fast and flexible retrieval across different modalities, especially in large-scale retrieval. As the data don't need a lot of manual annotation, unsupervised cross-modal hashing has a wider application prospect than supervised method. However, the existing unsupervised methods are difficult to achieve satisfactory performance due to the lack of credible supervisory information. To solve this problem, inspired by knowledge distillation, we propose a novel unsupervised Knowledge Distillation Cross-Modal Hashing method (KDCMH), which can use similarity information distilled from unsupervised method to guide supervised method. Specifically, firstly, the teacher model adopted an unsupervised distribution-based similarity hashing method, which can construct a modal fusion similarity matrix.Secondly, under the supervision of teacher model distillation information, student model can generate more discriminative hash codes. In two public datasets NUS-WIDE and MIRFLICKR-25K, extensive experiments have proved the significant improvement of KDCMH on several representative unsupervised cross-modal hashing methods.

引用

页码：183 / 191

页数：9

共 50 条

[31] Semantic-consistent cross-modal hashing for large-scale image retrieval
Gu, Xuesong
Dong, Guohua
Zhang, Xiang
Lan, Long
Luo, Zhigang
NEUROCOMPUTING, 2021, 433 : 181 - 198
[32] Cross-Modal Self-Taught Hashing for large-scale image retrieval
Xie, Liang
Zhu, Lei
Pan, Peng
Lu, Yansheng
SIGNAL PROCESSING, 2016, 124 : 81 - 92
[33] Hashing for Cross-Modal Similarity Retrieval
Liu, Yao
Yuan, Yanhong
Huang, Qiaoli
Huang, Zhixing
2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8
[34] Deep Hashing Similarity Learning for Cross-Modal Retrieval
Ma, Ying
Wang, Meng
Lu, Guangyun
Sun, Yajun
IEEE ACCESS, 2024, 12 : 8609 - 8618
[35] Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval
Zhan, Yu-Wei
Luo, Xin
Wang, Yongxin
Xu, Xin-Shun
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3386 - 3394
[36] Deep Multiscale Fusion Hashing for Cross-Modal Retrieval
Nie, Xiushan
Wang, Bowei
Li, Jiajia
Hao, Fanchang
Jian, Muwei
Yin, Yilong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 401 - 410
[37] CKDH: CLIP-Based Knowledge Distillation Hashing for Cross-Modal Retrieval
Li, Jiaxing
Wong, Wai Keung
Jiang, Lin
Fang, Xiaozhao
Xie, Shengli
Xu, Yong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6530 - 6541
[38] Joint-Modal Graph Convolutional Hashing for unsupervised cross-modal retrieval
Meng, Hui
Zhang, Huaxiang
Liu, Li
Liu, Dongmei
Lu, Xu
Guo, Xinru
NEUROCOMPUTING, 2024, 595
[39] UNSUPERVISED CONTRASTIVE HASHING FOR CROSS-MODAL RETRIEVAL IN REMOTE SENSING
Mikriukov, Georgii
Ravanbakhsh, Mahdyar
Demir, Begum
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4463 - 4467
[40] Pseudo-label driven deep hashing for unsupervised cross-modal retrieval
Zeng, XianHua
Xu, Ke
Xie, YiCai
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3437 - 3456

← 1 2 3 4 5 →