Efficient Discriminative Hashing for Cross-Modal Retrieval

被引:0
|
作者
Huang, Junfan [1 ,2 ]
Kang, Peipei [1 ,3 ]
Fang, Xiaozhao [4 ,5 ]
Han, Na [6 ]
Xie, Shengli [7 ,8 ]
Gao, Hongbo [9 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Guangdong Univ Technol, Guangdong Key Lab IoT Informat Technol, Guangzhou 510006, Peoples R China
[3] Guangdong Univ Technol, Ctr Intelligent Batch Mfg Based IoT Technol 111, Guangzhou 510006, Peoples R China
[4] Guangdong Univ Technol, Sch Automat, Minist Educ, Guangzhou 510006, Peoples R China
[5] Guangdong Univ Technol, Key Lab Intelligent Detect & Internet Things Mfg, Minist Educ, Guangzhou 510006, Peoples R China
[6] Guangdong Polytech Normal Univ, Sch Comp Sci, Guangzhou 510665, Peoples R China
[7] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[8] Guangdong Univ Technol, Guangdong Hong Kong Macao Joint Lab Smart Discrete, Guangzhou 510006, Peoples R China
[9] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; discrete optimization; hashing; information complementarity; joint learning strategy; ROBUST;
D O I
10.1109/TSMC.2024.3373612
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing techniques have been extensively studied in cross-modal retrieval due to their advantages in high computational efficiency and low storage cost. However, existing methods unconsciously ignore the complementary information of multimodal data, thus failing to consider learning discriminative hash codes from the perspective of information complementarity while often involving time-consuming training overhead. To tackle the above issues, we propose an efficient discriminative hashing (EDH) with information complementarity consideration. Specifically, we reckon that multimodal features and their corresponding semantic labels describe heterogeneous data viewed from low-and high-level structures, which owns complementarity. To this end, low-level latent representation and high-level semantics representation are simply derived. Then, a joint learning strategy is formulated to simultaneously exploit the above two representations for generating discriminative hash codes, which is quite computationally efficient. Besides, EDH decomposes hash learning into two steps. To obtain powerful hash functions which are conductive to retrieval, a regularization term considering pairwise semantic similarity is introduced into hash functions learning. In addition, an efficient optimization algorithm is designed to solve the optimization problem in EDH. Extensive experiments conducted on benchmark datasets demonstrate the superiority of our EDH in terms of retrieval performance and training efficiency. The source code is available at https://github.com/hjf-hjf/EDH.
引用
收藏
页码:3865 / 3878
页数:14
相关论文
共 50 条
  • [1] Discriminative correlation hashing for supervised cross-modal retrieval
    Lu, Xu
    Zhang, Huaxiang
    Sun, Jiande
    Wang, Zhenhua
    Guo, Peilian
    Wan, Wenbo
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 65 : 221 - 230
  • [2] Equally-Guided Discriminative Hashing for Cross-modal Retrieval
    Shi, Yufeng
    You, Xinge
    Zheng, Feng
    Wang, Shuo
    Peng, Qinmu
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4767 - 4773
  • [3] Discriminative deep asymmetric supervised hashing for cross-modal retrieval
    Qiang, Haopeng
    Wan, Yuan
    Liu, Ziyi
    Xiang, Lun
    Meng, Xiaojing
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 204
  • [4] Discriminative deep asymmetric supervised hashing for cross-modal retrieval
    Qiang, Haopeng
    Wan, Yuan
    Liu, Ziyi
    Xiang, Lun
    Meng, Xiaojing
    [J]. Knowledge-Based Systems, 2022, 204
  • [5] Online Discriminative Cross-Modal Hashing
    Kang, Xiao
    Liu, Xingbo
    Zhang, Xuening
    Nie, Xiushan
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5242 - 5254
  • [6] Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval
    Ma, Dekui
    Liang, Jian
    Kong, Xiangwei
    He, Ran
    Li, Ying
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 38 - 43
  • [7] Discriminative Latent Semantic Regression for Cross-Modal Hashing of Multimedia Retrieval
    Wan, Jianwu
    Wang, Yi
    [J]. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [8] Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval
    Li, Fengling
    Wang, Bowen
    Zhu, Lei
    Li, Jingjing
    Zhang, Zheng
    Chang, Xiaojun
    [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (10) : 9664 - 9677
  • [9] Semantic Boosting Cross-Modal Hashing for efficient multimedia retrieval
    Wang, Ke
    Tang, Jun
    Wang, Nian
    Shao, Ling
    [J]. INFORMATION SCIENCES, 2016, 330 : 199 - 210
  • [10] Hashing for Cross-Modal Similarity Retrieval
    Liu, Yao
    Yuan, Yanhong
    Huang, Qiaoli
    Huang, Zhixing
    [J]. 2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 1 - 8