Discrete Fusion Adversarial Hashing for cross-modal retrieval

被引:11
|
作者
Li, Jing [1 ]
Yu, En [2 ]
Ma, Jianhua [2 ]
Chang, Xiaojun [3 ]
Zhang, Huaxiang [2 ]
Sun, Jiande [2 ]
机构
[1] Shandong Normal Univ, Sch Journalism & Commun, Jinan 250358, Peoples R China
[2] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[3] Univ Technol Sydney, Fac Engn & Informat Technol, Australian Artificial Intelligence Inst, Sydney, NSW 2007, Australia
关键词
Cross-modal retrieval; Deep hashing; Discrete optimization; Fusion learning; Adversarial learning;
D O I
10.1016/j.knosys.2022.109503
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep cross-modal hashing enables a flexible and efficient way for large-scale cross-modal retrieval. Existing cross-modal retrieval methods based on deep hashing aim to learn the unified hashing representation for different modalities with the supervision of pair-wise correlation, and then encode the out-of-samples via modality-specific hashing network. However, the semantic gap and distribution shift were not considered enough, and the hashing codes cannot be unified as expected under different modalities. At the same time, hashing is still a discrete problem that has not been solved well in the deep neural network. Therefore, we propose the Discrete Fusion Adversarial Hashing (DFAH) network for cross-modal retrieval to address these issues. In DFAH, the Modality-Specific Feature Extractor is designed to capture image and text features with pair-wise supervision. Especially, the Fusion Learner is proposed to learn the unified hash code, which enhances the correlation of heterogeneous modalities via the embedding strategy. Meanwhile, the Modality Discriminator is designed to adapt to the distribution shift cooperating with the Modality-Specific Feature Extractor in an adversarial way. In addition, we design an efficient discrete optimization strategy to avoid the relaxing quantization errors in the deep neural framework. Finally, the experiment results and analysis on several popular datasets also show that DFAH outperforms the state-of-the-art methods for cross-modal retrieval. (C) 2022 Published by Elsevier B.V.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Adversarial Tri-Fusion Hashing Network for Imbalanced Cross-Modal Retrieval
    Liu, Xin
    Cheung, Yiu-ming
    Hu, Zhikai
    He, Yi
    Zhong, Bineng
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (04): : 607 - 619
  • [2] Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval
    Meng, Min
    Sun, Jiaxuan
    Liu, Jigang
    Yu, Jun
    Wu, Jigang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1914 - 1926
  • [3] Label Guided Discrete Hashing for Cross-Modal Retrieval
    Lan, Rushi
    Tan, Yu
    Wang, Xiaoqin
    Liu, Zhenbing
    Luo, Xiaonan
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25236 - 25248
  • [4] Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval
    Ma, Dekui
    Liang, Jian
    Kong, Xiangwei
    He, Ran
    Li, Ying
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 38 - 43
  • [5] Supervised Contrastive Discrete Hashing for cross-modal retrieval
    Li, Ze
    Yao, Tao
    Wang, Lili
    Li, Ying
    Wang, Gang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [6] Discrete matrix factorization hashing for cross-modal retrieval
    Fang, Xiaozhao
    Liu, Zhihu
    Han, Na
    Jiang, Lin
    Teng, Shaohua
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (10) : 3023 - 3036
  • [7] Discrete Robust Supervised Hashing for Cross-Modal Retrieval
    Yao, Tao
    Zhang, Zhiwang
    Yan, Lianshan
    Yue, Jun
    Tian, Qi
    [J]. IEEE ACCESS, 2019, 7 : 39806 - 39814
  • [8] Discrete matrix factorization hashing for cross-modal retrieval
    Xiaozhao Fang
    Zhihu Liu
    Na Han
    Lin Jiang
    Shaohua Teng
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 3023 - 3036
  • [9] Nonlinear Robust Discrete Hashing for Cross-Modal Retrieval
    Yang, Zhan
    Long, Jun
    Zhu, Lei
    Huang, Wenti
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1349 - 1358
  • [10] Deep semantic similarity adversarial hashing for cross-modal retrieval
    Qiang, Haopeng
    Wan, Yuan
    Xiang, Lun
    Meng, Xiaojing
    [J]. NEUROCOMPUTING, 2020, 400 : 24 - 33