DEEP SEMANTIC ADVERSARIAL HASHING BASED ON AUTOENCODER FOR LARGE-SCALE CROSS-MODAL RETRIEVAL

被引：0

作者：

Li, Mingyong ^{[1
,2
]}

Wang, Hongya ^{[1
]}

机构：

[1] Donghua Univ, Coll Comp Sci & Technol, Shanghai, Peoples R China

[2] Chongqing Normal Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW) | 2020年

基金：

中国国家自然科学基金;

关键词：

Cross-modal retrieval; deep hashing; Adversarial autoencoder;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Thanks to the powerful feature learning capabilities of deep learning, some studies have introduced GANs into the cross-modal hashing. However, The GAN-based hashing methods are generally unstable and difficult to train in the process of adversarial learning. To address this problem, we propose a novel AutoEncoder Semantic Adversarial Hashing for cross-modal retrieval (AESAH). Specifically, under the guidance of semantic multi-label, two types of adversarial autoencoder networks (inter-modality and intra-modality) are adopted to maximize the semantic relevance and maintain the invariance of cross-modal. Under semantic supervised, the adversarial modules guide the feature learning process, thus the modal relationship in both the common feature space and the common hamming space is maintained. Furthermore, in order to preserve the inter-modal correlation of all similar item pairs is higher than those of dissimilar ones, we use an inter-modal invariance triplet loss and a classification prediction loss to maintain similarity.Comprehensive experiments were carried out on two commonly used cross-modal datasets, compared with several existing cross-modal retrieval methods, AESAH has better retrieval performance.

引用

页数：6

共 50 条

[21] Deep supervised multimodal semantic autoencoder for cross-modal retrieval
Tian, Yu
Yang, Wenjing
Liu, Qingsong
Yang, Qiong
[J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
[22] Multi-attention based semantic deep hashing for cross-modal retrieval
Zhu, Liping
Tian, Gangyi
Wang, Bingyao
Wang, Wenjie
Zhang, Di
Li, Chengyang
[J]. APPLIED INTELLIGENCE, 2021, 51 (08) : 5927 - 5939
[23] Deep Semantic Correlation Learning based Hashing for Multimedia Cross-Modal Retrieval
Gong, Xiaolong
Huang, Linpeng
Wang, Fuwei
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 117 - 126
[24] Multi-attention based semantic deep hashing for cross-modal retrieval
Liping Zhu
Gangyi Tian
Bingyao Wang
Wenjie Wang
Di Zhang
Chengyang Li
[J]. Applied Intelligence, 2021, 51 : 5927 - 5939
[25] Semantic deep cross-modal hashing
Lin, Qiubin
Cao, Wenming
He, Zhihai
He, Zhiquan
[J]. NEUROCOMPUTING, 2020, 396 : 113 - 122
[26] Attention-Aware Deep Adversarial Hashing for Cross-Modal Retrieval
Zhang, Xi
Lai, Hanjiang
Feng, Jiashi
[J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 614 - 629
[27] Deep Adversarial Cascaded Hashing for Cross-Modal Vessel Image Retrieval
Guo, Jiaen
Guan, Xin
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 2205 - 2220
[28] Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval
Wang, Tianshi
Zhu, Lei
Zhang, Zheng
Zhang, Huaxiang
Han, Junwei
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 6159 - 6172
[29] Adversarial Projection Learning Based Hashing for Cross-Modal Retrieval
Zeng, Chao
Bai, Cong
Ma, Qing
Chen, Shengyong
[J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (06): : 904 - 912
[30] FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval
Liu, Xin
Wang, Xingzhi
Yiu-ming Cheung
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6306 - 6320

← 1 2 3 4 5 →