LEARNING A CROSS-MODAL HASHING NETWORK FOR MULTIMEDIA SEARCH

被引:0
|
作者
Liong, Venice Erin [1 ,3 ]
Lu, Jiwen [2 ]
Tan, Yap-Peng [3 ]
机构
[1] Nanyang Technol Univ, Interdisciplinary Grad Sch, Singapore, Singapore
[2] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
[3] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
关键词
hashing; cross-modal retrieval; binary code learning;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, we propose a cross-modal hashing network (CMHN) method to learn compact binary codes for cross modality multimedia search. Unlike most existing cross modal hashing methods which learn a single pair of projections to map each example into a binary vector, we design a deep neural network to learn multiple pairs of hierarchical non-linear transformations, under which the nonlinear characteristics of samples can be well exploited and the modality gap is well reduced. Our model is trained under an iterative optimization procedure which learns a (1) unified binary code discretely and discriminatively through a classification-based hinge-loss criterion, and (2) cross-modal hashing network, one deep network for each modality, through minimizing the quantization loss between real-valued neural code and binary code, and maximizing the variance of the learned neural codes. Experimental results on two benchmark datasets show the efficacy of the proposed approach.
引用
收藏
页码:3700 / 3704
页数:5
相关论文
共 50 条
  • [1] Kernelized Cross-Modal Hashing for Multimedia Retrieval
    Tan, Shoubiao
    Hu, Lingyu
    Wang-Xu, Anqi
    Tang, Jun
    Jia, Zhaohong
    [J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 1224 - 1228
  • [2] Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval
    Ma, Dekui
    Liang, Jian
    Kong, Xiangwei
    He, Ran
    Li, Ying
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 38 - 43
  • [3] Robust Unsupervised Cross-modal Hashing for Multimedia Retrieval
    Cheng, Miaomiao
    Jing, Liping
    Ng, Michael K.
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 38 (03)
  • [4] Index and Retrieve Multimedia Data: Cross-Modal Hashing by Learning Subspace Relation
    Liu, Luchen
    Yang, Yang
    Hu, Mengqiu
    Xu, Xing
    Shen, Fumin
    Xie, Ning
    Huang, Zi
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2018), PT II, 2018, 10828 : 606 - 621
  • [5] Deep Semantic Correlation Learning based Hashing for Multimedia Cross-Modal Retrieval
    Gong, Xiaolong
    Huang, Linpeng
    Wang, Fuwei
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 117 - 126
  • [6] Correlation Autoencoder Hashing for Supervised Cross-Modal Search
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Zhu, Han
    [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 197 - 204
  • [7] Semantic Boosting Cross-Modal Hashing for efficient multimedia retrieval
    Wang, Ke
    Tang, Jun
    Wang, Nian
    Shao, Ling
    [J]. INFORMATION SCIENCES, 2016, 330 : 199 - 210
  • [8] Quantized Correlation Hashing for Fast Cross-Modal Search
    Wu, Botong
    Yang, Qiang
    Zheng, Wei-Shi
    Wang, Yizhou
    Wang, Jingdong
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3946 - 3952
  • [9] Discrete Sparse Hashing for Cross-Modal Similarity Search
    Wang, Lu
    Ma, Chao
    Tu, Enmei
    Yang, Jie
    Kasabov, Nikola
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 256 - 267
  • [10] Discriminative Supervised Hashing for Cross-Modal Similarity Search
    Yu, Jun
    Wu, Xiao-Jun
    Kittler, Josef
    [J]. IMAGE AND VISION COMPUTING, 2019, 89 : 50 - 56