Efficient Discriminative Hashing for Cross-Modal Retrieval

被引：0

作者：

Huang, Junfan ^{[1
,2
]}

Kang, Peipei ^{[1
,3
]}

Fang, Xiaozhao ^{[4
,5
]}

Han, Na ^{[6
]}

Xie, Shengli ^{[7
,8
]}

Gao, Hongbo ^{[9
]}

机构：

[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China

[2] Guangdong Univ Technol, Guangdong Key Lab IoT Informat Technol, Guangzhou 510006, Peoples R China

[3] Guangdong Univ Technol, Ctr Intelligent Batch Mfg Based IoT Technol 111, Guangzhou 510006, Peoples R China

[4] Guangdong Univ Technol, Sch Automat, Minist Educ, Guangzhou 510006, Peoples R China

[5] Guangdong Univ Technol, Key Lab Intelligent Detect & Internet Things Mfg, Minist Educ, Guangzhou 510006, Peoples R China

[6] Guangdong Polytech Normal Univ, Sch Comp Sci, Guangzhou 510665, Peoples R China

[7] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China

[8] Guangdong Univ Technol, Guangdong Hong Kong Macao Joint Lab Smart Discrete, Guangzhou 510006, Peoples R China

[9] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei 230027, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Cross-modal retrieval; discrete optimization; hashing; information complementarity; joint learning strategy; ROBUST;

D O I：

10.1109/TSMC.2024.3373612

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hashing techniques have been extensively studied in cross-modal retrieval due to their advantages in high computational efficiency and low storage cost. However, existing methods unconsciously ignore the complementary information of multimodal data, thus failing to consider learning discriminative hash codes from the perspective of information complementarity while often involving time-consuming training overhead. To tackle the above issues, we propose an efficient discriminative hashing (EDH) with information complementarity consideration. Specifically, we reckon that multimodal features and their corresponding semantic labels describe heterogeneous data viewed from low-and high-level structures, which owns complementarity. To this end, low-level latent representation and high-level semantics representation are simply derived. Then, a joint learning strategy is formulated to simultaneously exploit the above two representations for generating discriminative hash codes, which is quite computationally efficient. Besides, EDH decomposes hash learning into two steps. To obtain powerful hash functions which are conductive to retrieval, a regularization term considering pairwise semantic similarity is introduced into hash functions learning. In addition, an efficient optimization algorithm is designed to solve the optimization problem in EDH. Extensive experiments conducted on benchmark datasets demonstrate the superiority of our EDH in terms of retrieval performance and training efficiency. The source code is available at https://github.com/hjf-hjf/EDH.

引用

页码：3865 / 3878

页数：14

共 50 条

[1] Discriminative correlation hashing for supervised cross-modal retrieval
Lu, Xu
Zhang, Huaxiang
Sun, Jiande
Wang, Zhenhua
Guo, Peilian
Wan, Wenbo
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 65 : 221 - 230
[2] Discriminative deep asymmetric supervised hashing for cross-modal retrieval
Qiang, Haopeng
Wan, Yuan
Liu, Ziyi
Xiang, Lun
Meng, Xiaojing
[J]. Knowledge-Based Systems, 2022, 204
[3] Discriminative deep asymmetric supervised hashing for cross-modal retrieval
Qiang, Haopeng
Wan, Yuan
Liu, Ziyi
Xiang, Lun
Meng, Xiaojing
[J]. KNOWLEDGE-BASED SYSTEMS, 2020, 204
[4] Equally-Guided Discriminative Hashing for Cross-modal Retrieval
Shi, Yufeng
You, Xinge
Zheng, Feng
Wang, Shuo
Peng, Qinmu
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4767 - 4773
[5] Online Discriminative Cross-Modal Hashing
Kang, Xiao
Liu, Xingbo
Zhang, Xuening
Nie, Xiushan
Yin, Yilong
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5242 - 5254
[6] Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval
Ma, Dekui
Liang, Jian
Kong, Xiangwei
He, Ran
Li, Ying
[J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 38 - 43
[7] Discriminative Latent Semantic Regression for Cross-Modal Hashing of Multimedia Retrieval
Wan, Jianwu
Wang, Yi
[J]. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
[8] Cross-Domain Transfer Hashing for Efficient Cross-modal Retrieval
Li F.
Wang B.
Zhu L.
Li J.
Zhang Z.
Chang X.
[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (10) : 1 - 1
[9] Hashing for Cross-Modal Similarity Retrieval
Liu, Yao
Yuan, Yanhong
Huang, Qiaoli
Huang, Zhixing
[J]. 2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8
[10] Semantic Boosting Cross-Modal Hashing for efficient multimedia retrieval
Wang, Ke
Tang, Jun
Wang, Nian
Shao, Ling
[J]. INFORMATION SCIENCES, 2016, 330 : 199 - 210

← 1 2 3 4 5 →