Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

被引:69
|
作者
Wang, Di [1 ]
Gao, Xinbo [1 ]
Wang, Xiumei [1 ]
He, Lihuo [1 ]
Yuan, Bo [1 ]
机构
[1] Xidian Univ, Video & Image Proc Syst Lab, Sch Elect Engn, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal; binary code; hashing; cross-modal retrieval; discriminant analysis;
D O I
10.1109/TIP.2016.2592800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal hashing, which conducts effective and efficient nearest neighbor search across heterogeneous data on large-scale multimedia databases, has been attracting increasing interest, given the explosive growth of multimedia content on the Internet. Recent multimodal hashing research mainly aims at learning the compact binary codes to preserve semantic information given by labels. The overwhelming majority of these methods are similarity preserving approaches which approximate pairwise similarity matrix with Hamming distances between the to-be-learnt binary hash codes. However, these methods ignore the discriminative property in hash learning process, which results in hash codes from different classes undistinguished, and therefore reduces the accuracy and robustness for the nearest neighbor search. To this end, we present a novel multimodal hashing method, named multimodal discriminative binary embedding (MDBE), which focuses on learning discriminative hash codes. First, the proposed method formulates the hash function learning in terms of classification, where the binary codes generated by the learned hash functions are expected to be discriminative. And then, it exploits the label information to discover the shared structures inside heterogeneous data. Finally, the learned structures are preserved for hash codes to produce similar binary codes in the same class. Hence, the proposed MDBE can preserve both discriminability and similarity for hash codes, and will enhance retrieval accuracy. Thorough experiments on benchmark data sets demonstrate that the proposed method achieves excellent accuracy and competitive computational efficiency compared with the state-of-the-art methods for large-scale cross-modal retrieval task.
引用
收藏
页码:4540 / 4554
页数:15
相关论文
共 50 条
  • [1] Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    Shen, Heng Tao
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (05) : 2494 - 2507
  • [2] SCALABLE DISCRIMINATIVE DISCRETE HASHING FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
    Qin, Jianyang
    Fei, Lunke
    Zhu, Jian
    Wen, Jie
    Tian, Chunwei
    Wu, Shuai
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4330 - 4334
  • [3] FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval
    Liu, Xin
    Wang, Xingzhi
    Yiu-ming Cheung
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6306 - 6320
  • [4] MCCN: Multimodal Coordinated Clustering Network for Large-Scale Cross-modal Retrieval
    Zeng, Zhixiong
    Sun, Ying
    Mao, Wenji
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5427 - 5435
  • [5] Binary Set Embedding for Cross-Modal Retrieval
    Yu, Mengyang
    Liu, Li
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 2899 - 2910
  • [6] Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval
    Li, Mingyong
    Wang, Hongya
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 183 - 191
  • [7] Semantic-enhanced discriminative embedding learning for cross-modal retrieval
    Pan, Hao
    Huang, Jun
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 369 - 382
  • [8] Semantic-enhanced discriminative embedding learning for cross-modal retrieval
    Hao Pan
    Jun Huang
    [J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 369 - 382
  • [9] Online Adaptive Supervised Hashing for Large-Scale Cross-Modal Retrieval
    Su, Ruoqi
    Wang, Di
    Huang, Zhen
    Liu, Yuan
    An, Yaqiang
    [J]. IEEE ACCESS, 2020, 8 : 206360 - 206370
  • [10] Efficient discrete supervised hashing for large-scale cross-modal retrieval
    Yao, Tao
    Han, Yaru
    Wang, Ruxin
    Kong, Xiangwei
    Yan, Lianshan
    Fu, Haiyan
    Tian, Qi
    [J]. NEUROCOMPUTING, 2020, 385 : 358 - 367