Unsupervised Deep Cross-modality Spectral Hashing

被引:18
|
作者
Hoang, Tuan [1 ]
Do, Thanh-Toan [2 ]
Nguyen, Tam V. [3 ]
Cheung, Ngai-Man [1 ]
机构
[1] Singapore Univ Technol & Design SUTD, Informat Syst Technol & Design, Singapore 487372, Singapore
[2] Univ Liverpool, Dept Comp Sci, Liverpool L69 3BX, Merseyside, England
[3] Univ Dayton, Dept Comp Sci, Dayton, OH 45469 USA
基金
新加坡国家研究基金会;
关键词
Binary codes; Semantics; Optimization; Correlation; Sparse matrices; Task analysis; Training data; Cross-modal retrieval; spectral hashing; image search; constraint optimization; BINARY-CODES; QUANTIZATION; SIMILARITY;
D O I
10.1109/TIP.2020.3014727
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel framework, namely Deep Cross-modality Spectral Hashing (DCSH), to tackle the unsupervised learning problem of binary hash codes for efficient cross-modal retrieval. The framework is a two-step hashing approach which decouples the optimization into (1) binary optimization and (2) hashing function learning. In the first step, we propose a novel spectral embedding-based algorithm to simultaneously learn single-modality and binary cross-modality representations. While the former is capable of well preserving the local structure of each modality, the latter reveals the hidden patterns from all modalities. In the second step, to learn mapping functions from informative data inputs (images and word embeddings) to binary codes obtained from the first step, we leverage the powerful CNN for images and propose a CNN-based deep architecture to learn text modality. Quantitative evaluations on three standard benchmark datasets demonstrate that the proposed DCSH method consistently outperforms other state-of-the-art methods.
引用
收藏
页码:8391 / 8406
页数:16
相关论文
共 50 条
  • [1] Deep Unified Cross-Modality Hashing by Pairwise Data Alignment
    Wang, Yimu
    Xue, Bo
    Cheng, Quan
    Chen, Yuhui
    Zhang, Lijun
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1129 - 1135
  • [2] CROSS-MODALITY HASHING WITH PARTIAL CORRESPONDENCE
    Gu, Yun
    Xue, Haoyang
    Yang, Jie
    Shi, Pengfei
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1925 - 1929
  • [3] A Deep Cross-Modality Hashing Network for SAR and Optical Remote Sensing Images Retrieval
    Xiong, Wei
    Xiong, Zhenyu
    Zhang, Yang
    Cui, Yaqi
    Gu, Xiangqi
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 5284 - 5296
  • [4] Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval
    Liu, Li
    Lin, Zijia
    Shao, Ling
    Shen, Fumin
    Ding, Guiguang
    Han, Jungong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (01) : 107 - 118
  • [5] Unsupervised deep consistency learning adaptation network for cardiac cross-modality structural segmentation
    Li, Dapeng
    Peng, Yanjun
    Sun, Jindong
    Guo, Yanfei
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (10) : 2713 - 2732
  • [6] Unsupervised deep consistency learning adaptation network for cardiac cross-modality structural segmentation
    Dapeng Li
    Yanjun Peng
    Jindong Sun
    Yanfei Guo
    [J]. Medical & Biological Engineering & Computing, 2023, 61 : 2713 - 2732
  • [7] Dense Auto-Encoder Hashing for Robust Cross-Modality Retrieval
    Liu, Hong
    Lin, Mingbao
    Zhang, Shengchuan
    Wu, Yongjian
    Huang, Feiyue
    Ji, Rongrong
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1589 - 1597
  • [8] LOCAL CROSS-MODALITY IMAGE ALIGNMENT USING UNSUPERVISED LEARNING
    BERNANDER, O
    KOCH, C
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1990, 427 : 573 - 575
  • [9] Cross-Modality Binary Code Learning via Fusion Similarity Hashing
    Liu, Hong
    Ji, Rongrong
    Wu, Yongjian
    Huang, Feiyue
    Zhang, Baochang
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6345 - 6353
  • [10] Unsupervised Cross-Modal Hashing With Modality-Interaction
    Tu, Rong-Cheng
    Jiang, Jie
    Lin, Qinghong
    Cai, Chengfei
    Tian, Shangxuan
    Wang, Hongfa
    Liu, Wei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5296 - 5308