DEEP SEMANTIC ADVERSARIAL HASHING BASED ON AUTOENCODER FOR LARGE-SCALE CROSS-MODAL RETRIEVAL

被引:0
|
作者
Li, Mingyong [1 ,2 ]
Wang, Hongya [1 ]
机构
[1] Donghua Univ, Coll Comp Sci & Technol, Shanghai, Peoples R China
[2] Chongqing Normal Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; deep hashing; Adversarial autoencoder;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Thanks to the powerful feature learning capabilities of deep learning, some studies have introduced GANs into the cross-modal hashing. However, The GAN-based hashing methods are generally unstable and difficult to train in the process of adversarial learning. To address this problem, we propose a novel AutoEncoder Semantic Adversarial Hashing for cross-modal retrieval (AESAH). Specifically, under the guidance of semantic multi-label, two types of adversarial autoencoder networks (inter-modality and intra-modality) are adopted to maximize the semantic relevance and maintain the invariance of cross-modal. Under semantic supervised, the adversarial modules guide the feature learning process, thus the modal relationship in both the common feature space and the common hamming space is maintained. Furthermore, in order to preserve the inter-modal correlation of all similar item pairs is higher than those of dissimilar ones, we use an inter-modal invariance triplet loss and a classification prediction loss to maintain similarity.Comprehensive experiments were carried out on two commonly used cross-modal datasets, compared with several existing cross-modal retrieval methods, AESAH has better retrieval performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [11] Efficient discrete supervised hashing for large-scale cross-modal retrieval
    Yao, Tao
    Han, Yaru
    Wang, Ruxin
    Kong, Xiangwei
    Yan, Lianshan
    Fu, Haiyan
    Tian, Qi
    [J]. NEUROCOMPUTING, 2020, 385 : 358 - 367
  • [12] SCALABLE DISCRIMINATIVE DISCRETE HASHING FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
    Qin, Jianyang
    Fei, Lunke
    Zhu, Jian
    Wen, Jie
    Tian, Chunwei
    Wu, Shuai
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4330 - 4334
  • [13] Label guided correlation hashing for large-scale cross-modal retrieval
    Guohua Dong
    Xiang Zhang
    Long Lan
    Shiwei Wang
    Zhigang Luo
    [J]. Multimedia Tools and Applications, 2019, 78 : 30895 - 30922
  • [14] Multiple Information Embedded Hashing for Large-Scale Cross-Modal Retrieval
    Wang, Yongxin
    Zhan, Yu-Wei
    Chen, Zhen-Duo
    Luo, Xin
    Xu, Xin-Shun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5118 - 5131
  • [15] Label guided correlation hashing for large-scale cross-modal retrieval
    Dong, Guohua
    Zhang, Xiang
    Lan, Long
    Wang, Shiwei
    Luo, Zhigang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30895 - 30922
  • [16] Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval
    Liu, Song
    Qian, Shengsheng
    Guan, Yang
    Zhan, Jiawei
    Ying, Long
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1379 - 1388
  • [17] Deep semantic hashing with dual attention for cross-modal retrieval
    Jiagao Wu
    Weiwei Weng
    Junxia Fu
    Linfeng Liu
    Bin Hu
    [J]. Neural Computing and Applications, 2022, 34 : 5397 - 5416
  • [18] Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval
    Su, Shupeng
    Zhong, Zhisheng
    Zhang, Chao
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3027 - 3035
  • [19] Deep semantic hashing with dual attention for cross-modal retrieval
    Wu, Jiagao
    Weng, Weiwei
    Fu, Junxia
    Liu, Linfeng
    Hu, Bin
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (07): : 5397 - 5416
  • [20] Deep Visual-Semantic Hashing for Cross-Modal Retrieval
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Yang, Qiang
    Yu, Philip S.
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1445 - 1454