Graph Convolutional Network Discrete Hashing for Cross-Modal Retrieval

被引:23
|
作者
Bai, Cong [1 ,2 ]
Zeng, Chao [1 ,2 ]
Ma, Qing [3 ]
Zhang, Jinglin [4 ]
机构
[1] Zhejiang Univ Technol, Collage Comp Sci, Hangzhou 310023, Peoples R China
[2] Key Lab Visual Media Intelligent Proc Technol Zhe, Hangzhou 310023, Peoples R China
[3] Zhejiang Univ Technol, Coll Sci, Hangzhou 310023, Peoples R China
[4] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
关键词
Feature extraction; Codes; Semantics; Convolutional codes; Training; Optimization; Hash functions; Cross-modal hashing; discrete optimization; graph convolutional network (GCN); multilabel similarity;
D O I
10.1109/TNNLS.2022.3174970
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of deep neural networks, cross-modal hashing has made great progress. However, the information of different types of data is asymmetrical, that is to say, if the resolution of an image is high enough, it can reproduce almost 100% of the real-world scenes. However, text usually carries personal emotion and it is not objective enough, so we generally think that the information of image will be much richer than text. Although most of the existing methods unify the semantic feature extraction and hash function learning modules for end-to-end learning, they ignore this issue and do not use information-rich modalities to support information-poor modalities, leading to suboptimal results, although they unify the semantic feature extraction and hash function learning modules for end-to-end learning. Furthermore, previous methods learn hash functions in a relaxed way that causes nontrivial quantization losses. To address these issues, we propose a new method called graph convolutional network (GCN) discrete hashing. This method uses a GCN to bridge the information gap between different types of data. The GCN can represent each label as word embedding, with the embedding regarded as a set of interdependent object classifiers. From these classifiers, we can obtain predicted labels to enhance feature representations across modalities. In addition, we use an efficient discrete optimization strategy to learn the discrete binary codes without relaxation. Extensive experiments conducted on three commonly used datasets demonstrate that our proposed method graph convolutional network-based discrete hashing (GCDH) outperforms the current state-of-the-art cross-modal hashing methods.
引用
收藏
页码:4756 / 4767
页数:12
相关论文
共 50 条
  • [1] Graph Convolutional Network Hashing for Cross-Modal Retrieval
    Xu, Ruiqing
    Li, Chao
    Yan, Junchi
    Deng, Cheng
    Liu, Xianglong
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 982 - 988
  • [2] Joint-Modal Graph Convolutional Hashing for unsupervised cross-modal retrieval
    Meng, Hui
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Lu, Xu
    Guo, Xinru
    [J]. NEUROCOMPUTING, 2024, 595
  • [3] Proxy-Based Graph Convolutional Hashing for Cross-Modal Retrieval
    Bai, Yibing
    Shu, Zhenqiu
    Yu, Jun
    Yu, Zhengtao
    Wu, Xiao-Jun
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (04) : 371 - 385
  • [4] Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval
    Shen, Xiaobo
    Chen, Yinfan
    Liu, Weiwei
    Zheng, Yuhui
    Sun, Quan-Sen
    Pan, Shirui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [5] Adversarial Graph Convolutional Network for Cross-Modal Retrieval
    Dong, Xinfeng
    Liu, Li
    Zhu, Lei
    Nie, Liqiang
    Zhang, Huaxiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1634 - 1645
  • [6] Graph Convolutional Network Semantic Enhancement Hashing for Self-supervised Cross-Modal Retrieval
    Hu, Jinyu
    Li, Mingyong
    Zhang, Jiayan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 410 - 422
  • [7] Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval
    Zhang, Peng-Fei
    Li, Yang
    Huang, Zi
    Xu, Xin-Shun
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 466 - 479
  • [8] SEMI-SUPERVISED GRAPH CONVOLUTIONAL HASHING NETWORK FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
    Shen, Zhanjian
    Zhai, Deming
    Liu, Xianming
    Jiang, Junjun
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2366 - 2370
  • [9] Local Graph Convolutional Networks for Cross-Modal Hashing
    Chen, Yudong
    Wang, Sen
    Lu, Jianglin
    Chen, Zhi
    Zhang, Zheng
    Huang, Zi
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1921 - 1928
  • [10] Collaborative Subspace Graph Hashing for Cross-modal Retrieval
    Zhang, Xiang
    Dong, Guohua
    Du, Yimo
    Wu, Chengkun
    Luo, Zhigang
    Yang, Canqun
    [J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 213 - 221