Graph Convolutional Network Discrete Hashing for Cross-Modal Retrieval

被引：23

作者：

Bai, Cong ^{[1
,2
]}

Zeng, Chao ^{[1
,2
]}

Ma, Qing ^{[3
]}

Zhang, Jinglin ^{[4
]}

机构：

[1] Zhejiang Univ Technol, Collage Comp Sci, Hangzhou 310023, Peoples R China

[2] Key Lab Visual Media Intelligent Proc Technol Zhe, Hangzhou 310023, Peoples R China

[3] Zhejiang Univ Technol, Coll Sci, Hangzhou 310023, Peoples R China

[4] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 04期

关键词：

Feature extraction; Codes; Semantics; Convolutional codes; Training; Optimization; Hash functions; Cross-modal hashing; discrete optimization; graph convolutional network (GCN); multilabel similarity;

D O I：

10.1109/TNNLS.2022.3174970

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the rapid development of deep neural networks, cross-modal hashing has made great progress. However, the information of different types of data is asymmetrical, that is to say, if the resolution of an image is high enough, it can reproduce almost 100% of the real-world scenes. However, text usually carries personal emotion and it is not objective enough, so we generally think that the information of image will be much richer than text. Although most of the existing methods unify the semantic feature extraction and hash function learning modules for end-to-end learning, they ignore this issue and do not use information-rich modalities to support information-poor modalities, leading to suboptimal results, although they unify the semantic feature extraction and hash function learning modules for end-to-end learning. Furthermore, previous methods learn hash functions in a relaxed way that causes nontrivial quantization losses. To address these issues, we propose a new method called graph convolutional network (GCN) discrete hashing. This method uses a GCN to bridge the information gap between different types of data. The GCN can represent each label as word embedding, with the embedding regarded as a set of interdependent object classifiers. From these classifiers, we can obtain predicted labels to enhance feature representations across modalities. In addition, we use an efficient discrete optimization strategy to learn the discrete binary codes without relaxation. Extensive experiments conducted on three commonly used datasets demonstrate that our proposed method graph convolutional network-based discrete hashing (GCDH) outperforms the current state-of-the-art cross-modal hashing methods.

引用

页码：4756 / 4767

页数：12

共 50 条

[1] Graph Convolutional Network Hashing for Cross-Modal Retrieval
Xu, Ruiqing
Li, Chao
Yan, Junchi
Deng, Cheng
Liu, Xianglong
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 982 - 988
[2] Joint-Modal Graph Convolutional Hashing for unsupervised cross-modal retrieval
Meng, Hui
Zhang, Huaxiang
Liu, Li
Liu, Dongmei
Lu, Xu
Guo, Xinru
[J]. NEUROCOMPUTING, 2024, 595
[3] Proxy-Based Graph Convolutional Hashing for Cross-Modal Retrieval
Bai, Yibing
Shu, Zhenqiu
Yu, Jun
Yu, Zhengtao
Wu, Xiao-Jun
[J]. IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (04) : 371 - 385
[4] Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval
Shen, Xiaobo
Chen, Yinfan
Liu, Weiwei
Zheng, Yuhui
Sun, Quan-Sen
Pan, Shirui
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[5] Adversarial Graph Convolutional Network for Cross-Modal Retrieval
Dong, Xinfeng
Liu, Li
Zhu, Lei
Nie, Liqiang
Zhang, Huaxiang
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1634 - 1645
[6] Graph Convolutional Network Semantic Enhancement Hashing for Self-supervised Cross-Modal Retrieval
Hu, Jinyu
Li, Mingyong
Zhang, Jiayan
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 410 - 422
[7] Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval
Zhang, Peng-Fei
Li, Yang
Huang, Zi
Xu, Xin-Shun
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 466 - 479
[8] SEMI-SUPERVISED GRAPH CONVOLUTIONAL HASHING NETWORK FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
Shen, Zhanjian
Zhai, Deming
Liu, Xianming
Jiang, Junjun
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2366 - 2370
[9] Local Graph Convolutional Networks for Cross-Modal Hashing
Chen, Yudong
Wang, Sen
Lu, Jianglin
Chen, Zhi
Zhang, Zheng
Huang, Zi
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1921 - 1928
[10] Collaborative Subspace Graph Hashing for Cross-modal Retrieval
Zhang, Xiang
Dong, Guohua
Du, Yimo
Wu, Chengkun
Luo, Zhigang
Yang, Canqun
[J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 213 - 221

← 1 2 3 4 5 →