DEEP SEMANTIC ADVERSARIAL HASHING BASED ON AUTOENCODER FOR LARGE-SCALE CROSS-MODAL RETRIEVAL

被引:0
|
作者
Li, Mingyong [1 ,2 ]
Wang, Hongya [1 ]
机构
[1] Donghua Univ, Coll Comp Sci & Technol, Shanghai, Peoples R China
[2] Chongqing Normal Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; deep hashing; Adversarial autoencoder;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Thanks to the powerful feature learning capabilities of deep learning, some studies have introduced GANs into the cross-modal hashing. However, The GAN-based hashing methods are generally unstable and difficult to train in the process of adversarial learning. To address this problem, we propose a novel AutoEncoder Semantic Adversarial Hashing for cross-modal retrieval (AESAH). Specifically, under the guidance of semantic multi-label, two types of adversarial autoencoder networks (inter-modality and intra-modality) are adopted to maximize the semantic relevance and maintain the invariance of cross-modal. Under semantic supervised, the adversarial modules guide the feature learning process, thus the modal relationship in both the common feature space and the common hamming space is maintained. Furthermore, in order to preserve the inter-modal correlation of all similar item pairs is higher than those of dissimilar ones, we use an inter-modal invariance triplet loss and a classification prediction loss to maintain similarity.Comprehensive experiments were carried out on two commonly used cross-modal datasets, compared with several existing cross-modal retrieval methods, AESAH has better retrieval performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Deep supervised multimodal semantic autoencoder for cross-modal retrieval
    Tian, Yu
    Yang, Wenjing
    Liu, Qingsong
    Yang, Qiong
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
  • [22] Multi-attention based semantic deep hashing for cross-modal retrieval
    Zhu, Liping
    Tian, Gangyi
    Wang, Bingyao
    Wang, Wenjie
    Zhang, Di
    Li, Chengyang
    [J]. APPLIED INTELLIGENCE, 2021, 51 (08) : 5927 - 5939
  • [23] Deep Semantic Correlation Learning based Hashing for Multimedia Cross-Modal Retrieval
    Gong, Xiaolong
    Huang, Linpeng
    Wang, Fuwei
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 117 - 126
  • [24] Multi-attention based semantic deep hashing for cross-modal retrieval
    Liping Zhu
    Gangyi Tian
    Bingyao Wang
    Wenjie Wang
    Di Zhang
    Chengyang Li
    [J]. Applied Intelligence, 2021, 51 : 5927 - 5939
  • [25] Semantic deep cross-modal hashing
    Lin, Qiubin
    Cao, Wenming
    He, Zhihai
    He, Zhiquan
    [J]. NEUROCOMPUTING, 2020, 396 : 113 - 122
  • [26] Attention-Aware Deep Adversarial Hashing for Cross-Modal Retrieval
    Zhang, Xi
    Lai, Hanjiang
    Feng, Jiashi
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 614 - 629
  • [27] Deep Adversarial Cascaded Hashing for Cross-Modal Vessel Image Retrieval
    Guo, Jiaen
    Guan, Xin
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 2205 - 2220
  • [28] Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval
    Wang, Tianshi
    Zhu, Lei
    Zhang, Zheng
    Zhang, Huaxiang
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 6159 - 6172
  • [29] Adversarial Projection Learning Based Hashing for Cross-Modal Retrieval
    Zeng, Chao
    Bai, Cong
    Ma, Qing
    Chen, Shengyong
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (06): : 904 - 912
  • [30] FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval
    Liu, Xin
    Wang, Xingzhi
    Yiu-ming Cheung
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6306 - 6320