Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval

被引:7
|
作者
Jiao, Shichao [1 ]
Han, Xie [1 ]
Xiong, Fengguang [1 ]
Yang, Xiaowen [1 ]
Han, Huiyan [1 ]
He, Ligang [2 ]
Kuang, Liqun [1 ]
机构
[1] North Univ China, Sch Data Sci & Technol, Taiyuan, Peoples R China
[2] Univ Warwick, Dept Comp, Warwick, England
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 16期
基金
国家重点研发计划;
关键词
Cross-modal retrieval; Sketch-based image retrieval; Zero-shot learning; Correlation learning; CONVOLUTIONAL NEURAL-NETWORKS; SCENE; SHAPE;
D O I
10.1007/s00521-022-07169-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot sketch-based image retrieval (ZS-SBIR) is an extension of sketch-based image retrieval (SBIR) that aims to search relevant images with query sketches of the unseen categories. Most previous methods focus more on preserving semantic knowledge and improving domain alignment performance, but neglect to capture the correlation between inter-modal features, resulting in unsatisfactory performance. Hence, a sketch-image cross-modal retrieval framework is proposed to maximize the sketch-image correlation. For this framework, we develop a discriminant adversarial learning method that incorporates intra-modal discrimination, inter-modal consistency, and inter-modal correlation into a deep learning network for common feature representation learning. Specifically, sketch and image features are first projected into a shared feature subspace to achieve modality-invariance. Subsequently, we adopt a category label predictor to achieve intra-modal discrimination, use adversarial learning to confuse modal information for inter-modal consistency, and introduce correlation learning to maximize inter-modal correlation. Finally, the trained deep learning model is used to test unseen categories. Extensive experiments conducted on three zero-shot datasets show that this method outperforms state-of-the-art methods. For retrieval accuracy of unseen categories, this method exceeds the state-of-the-art methods by approximately 0.6% on the RSketch dataset, 5% on the Sketchy dataset, and 7% on the TU-Berlin dataset. We also conduct experiments on the dataset of image-based 3D model scene retrieval, the proposed method significantly outperforms the state-of-the-art approaches in all standard metrics.
引用
收藏
页码:13469 / 13483
页数:15
相关论文
共 50 条
  • [21] Generalized Zero-Shot Cross-Modal Retrieval
    Dutta, Titir
    Biswas, Soma
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (12) : 5953 - 5962
  • [22] Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval
    Pandey, Anubha
    Mishra, Ashish
    Verma, Vinay Kumar
    Mittal, Anurag
    Murthy, Hema A.
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2529 - 2538
  • [23] Deep cascaded cross-modal correlation learning for fine-grained sketch-based image retrieval
    Wang, Yanfei
    Huang, Fei
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    [J]. PATTERN RECOGNITION, 2020, 100
  • [24] Zero-shot Cross-modal Retrieval by Assembling AutoEncoder and Generative Adversarial Network
    Xu, Xing
    Tian, Jialin
    Lin, Kaiyi
    Lu, Huimin
    Shao, Jie
    Shen, Heng Tao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [25] Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval
    Yang, Fan
    Wang, Zheng
    Xiao, Jing
    Satoh, Shin'chi
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12589 - 12596
  • [26] Zero-shot Sketch-based Image Retrieval with Adaptive Balanced Discriminability and Generalizability
    Tian, Jialin
    Xu, Xing
    Cao, Zuo
    Zhang, Gong
    Shen, Fumin
    Yang, Yang
    [J]. PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 407 - 415
  • [27] Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
    Wang, Zhipeng
    Wang, Hao
    Yan, Jiexi
    Wu, Aming
    Deng, Cheng
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1143 - 1149
  • [28] Cross-modal subspace learning for fine-grained sketch-based image retrieval
    Xu, Peng
    Yin, Qiyue
    Huang, Yongye
    Song, Yi-Zhe
    Ma, Zhanyu
    Wang, Liang
    Xiang, Tao
    Kleijn, W. Bastiaan
    Guo, Jun
    [J]. NEUROCOMPUTING, 2018, 278 : 75 - 86
  • [29] Zero-Shot Sketch-Based Image Retrieval via Graph Convolution Network
    Zhang, Zhaolong
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12943 - 12950
  • [30] Asymmetric Mutual Alignment for Unsupervised Zero-Shot Sketch-Based Image Retrieval
    Yin, Zhihui
    Yan, Jiexi
    Xu, Chenghao
    Deng, Cheng
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16504 - 16512