XPNet: Cross-Domain Prototypical Network for Zero-Shot Sketch-Based Image Retrieval

被引:1
|
作者
Li, Mingkang [1 ]
Qi, Yonggang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
关键词
Cross-domain prototype; Zero-shot; SBIR;
D O I
10.1007/978-3-031-18907-4_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot retrieval is a topical problem for sketch-based image search. It is largely necessitated by the fact that human sketch data is scarce in nature - in most cases retrieval will have to be conducted at zero-shot level. The problem of zero-shot sketch-based image retrieval (ZS-SBIR) is however a much harder task when compared with its photoonly counterpart. In addition to addressing the zero-shot transfer problem, it will also need to tackle the inherent domain gap between sketch and photo. Most existing works on ZS-SBIR typically address these two problems separately: a triplet-like network to address the domain gap, and employing external semantic information (such as word embeddings) to assist category transfer. In this paper, we take a different stance and ask a more difficult question - can we devise a consolidated solution to accommodate both problems simultaneously, especially without the need for additional semantic information. For that, we propose a cross-domain prototype learning framework to narrow the domain gap by encouraging a confirmation of prototypes between two domains. The intuition is there exists an embedding in which points regardless of which domain it comes from, would cluster around a single and shared prototype representation for a given class. We first show that performance comparable with that of state-of-the-art can already be achieved just by doing this alone. We then further propose two means of tackling data efficiency during training: (i) an episode training protocol that enables data feeding by demand, and (ii) a hard triplet generation algorithm to address data scarcity. Extensive experiments on TU-Berlin-Extended, Sketchy-Extended and QuickDraw-Extended validate the usefulness of our approach.
引用
收藏
页码:394 / 410
页数:17
相关论文
共 50 条
  • [41] Zero-shot sketch-based image retrieval with structure-aware asymmetric disentanglement
    Li, Jiangtong
    Ling, Zhixin
    Niu, Li
    Zhang, Liqing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 218
  • [42] A Zero-Shot Framework for Sketch Based Image Retrieval
    Yelamarthi, Sasi Kiran
    Reddy, Shiva Krishna
    Mishra, Ashish
    Mittal, Anurag
    COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 316 - 333
  • [43] Domain Disentangled Generative Adversarial Network for Zero-Shot Sketch-Based 3D Shape Retrieval
    Xu, Rui
    Han, Zongyan
    Hui, Le
    Qian, Jianjun
    Xie, Jin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2902 - 2910
  • [44] StyleGuide: Zero-Shot Sketch-Based Image Retrieval Using Style-Guided Image Generation
    Dutta, Titir
    Singh, Anurag
    Biswas, Soma
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2833 - 2842
  • [45] Zero-Shot Sketch-Based Image Retrieval with Hybrid Information Fusion and Sample Relationship Modeling
    Wu, Weijie
    Li, Jun
    Wu, Zhijian
    Xu, Jianhua
    MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 337 - 350
  • [46] Indicative Vision Transformer for end-to-end zero-shot sketch-based image retrieval
    Zhang, Haoxiang
    Cheng, Deqiang
    Kou, Qiqi
    Asad, Mujtaba
    Jiang, He
    ADVANCED ENGINEERING INFORMATICS, 2024, 60
  • [47] CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Chowdhury, Pinaki Nath
    Koley, Subhadeep
    Xiang, Tao
    Song, Yi-Zhe
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2765 - 2775
  • [48] Zero-Shot Sketch-Based Image Retrieval Using StyleGen and Stacked Siamese Neural Networks
    Gopu, Venkata Rama Muni Kumar
    Dunna, Madhavi
    JOURNAL OF IMAGING, 2024, 10 (04)
  • [49] Task-like training paradigm in CLIP for zero-shot sketch-based image retrieval
    Zhang, Haoxiang
    Cheng, Deqiang
    Jiang, He
    Liu, Jingjing
    Kou, Qiqi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (19) : 57811 - 57828
  • [50] Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
    Singha, Mainak
    Jha, Ankit
    Gupta, Divyam
    Singla, Pranav
    Banerjee, Biplab
    COMPUTER VISION - ECCV 2024, PT XXIV, 2025, 15082 : 1 - 19