Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval

被引:7
|
作者
Jiao, Shichao [1 ]
Han, Xie [1 ]
Xiong, Fengguang [1 ]
Yang, Xiaowen [1 ]
Han, Huiyan [1 ]
He, Ligang [2 ]
Kuang, Liqun [1 ]
机构
[1] North Univ China, Sch Data Sci & Technol, Taiyuan, Peoples R China
[2] Univ Warwick, Dept Comp, Warwick, England
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 16期
基金
国家重点研发计划;
关键词
Cross-modal retrieval; Sketch-based image retrieval; Zero-shot learning; Correlation learning; CONVOLUTIONAL NEURAL-NETWORKS; SCENE; SHAPE;
D O I
10.1007/s00521-022-07169-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot sketch-based image retrieval (ZS-SBIR) is an extension of sketch-based image retrieval (SBIR) that aims to search relevant images with query sketches of the unseen categories. Most previous methods focus more on preserving semantic knowledge and improving domain alignment performance, but neglect to capture the correlation between inter-modal features, resulting in unsatisfactory performance. Hence, a sketch-image cross-modal retrieval framework is proposed to maximize the sketch-image correlation. For this framework, we develop a discriminant adversarial learning method that incorporates intra-modal discrimination, inter-modal consistency, and inter-modal correlation into a deep learning network for common feature representation learning. Specifically, sketch and image features are first projected into a shared feature subspace to achieve modality-invariance. Subsequently, we adopt a category label predictor to achieve intra-modal discrimination, use adversarial learning to confuse modal information for inter-modal consistency, and introduce correlation learning to maximize inter-modal correlation. Finally, the trained deep learning model is used to test unseen categories. Extensive experiments conducted on three zero-shot datasets show that this method outperforms state-of-the-art methods. For retrieval accuracy of unseen categories, this method exceeds the state-of-the-art methods by approximately 0.6% on the RSketch dataset, 5% on the Sketchy dataset, and 7% on the TU-Berlin dataset. We also conduct experiments on the dataset of image-based 3D model scene retrieval, the proposed method significantly outperforms the state-of-the-art approaches in all standard metrics.
引用
收藏
页码:13469 / 13483
页数:15
相关论文
共 50 条
  • [1] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Shichao Jiao
    Xie Han
    Fengguang Xiong
    Xiaowen Yang
    Huiyan Han
    Ligang He
    Liqun Kuang
    [J]. Neural Computing and Applications, 2022, 34 : 13469 - 13483
  • [2] Cross-modal Self-distillation for Zero-shot Sketch-based Image Retrieval
    Tian, Jia-Lin
    Xu, Xing
    Shen, Fu-Min
    Shen, Heng-Tao
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):
  • [3] Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval
    Deng, Cheng
    Xu, Xinxun
    Wang, Hao
    Yang, Muli
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8892 - 8902
  • [4] Cross-Modal Visual Correspondences Learning Without External Semantic Information for Zero-Shot Sketch-Based Image Retrieval
    Gao, Zhijie
    Wang, Kai
    [J]. ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 342 - 353
  • [5] TOWARDS SKETCH-BASED IMAGE RETRIEVAL WITH DEEP CROSS-MODAL CORRELATION LEARNING
    Huang, Fei
    Jin, Cheng
    Zhang, Yuejie
    Zhang, Tao
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 907 - 912
  • [6] Cross-Domain Alignment for Zero-Shot Sketch-Based Image Retrieval
    Wang, Xu
    Peng, Dezhong
    Hu, Peng
    Gong, Yunhong
    Chen, Yong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 7024 - 7035
  • [7] WAD-CMSN: Wasserstein distance-based cross-modal semantic network for zero-shot sketch-based image retrieval
    Xu, Guanglong
    Hu, Zhensheng
    Cai, Jia
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (02)
  • [8] An efficient framework for zero-shot sketch-based image retrieval
    Tursun, Osman
    Denman, Simon
    Sridharan, Sridha
    Goan, Ethan
    Fookes, Clinton
    [J]. PATTERN RECOGNITION, 2022, 126
  • [9] Generative Model for Zero-Shot Sketch-Based Image Retrieval
    Verma, Vinay Kumar
    Mishra, Aakansha
    Mishra, Ashish
    Rai, Piyush
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 704 - 713
  • [10] A Simplified Framework for Zero-shot Cross-Modal Sketch Data Retrieval
    Chaudhuri, Ushasi
    Banerjee, Biplab
    Bhattacharya, Avik
    Datcu, Mihai
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 699 - 706