XPNet: Cross-Domain Prototypical Network for Zero-Shot Sketch-Based Image Retrieval

被引:1
|
作者
Li, Mingkang [1 ]
Qi, Yonggang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
关键词
Cross-domain prototype; Zero-shot; SBIR;
D O I
10.1007/978-3-031-18907-4_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot retrieval is a topical problem for sketch-based image search. It is largely necessitated by the fact that human sketch data is scarce in nature - in most cases retrieval will have to be conducted at zero-shot level. The problem of zero-shot sketch-based image retrieval (ZS-SBIR) is however a much harder task when compared with its photoonly counterpart. In addition to addressing the zero-shot transfer problem, it will also need to tackle the inherent domain gap between sketch and photo. Most existing works on ZS-SBIR typically address these two problems separately: a triplet-like network to address the domain gap, and employing external semantic information (such as word embeddings) to assist category transfer. In this paper, we take a different stance and ask a more difficult question - can we devise a consolidated solution to accommodate both problems simultaneously, especially without the need for additional semantic information. For that, we propose a cross-domain prototype learning framework to narrow the domain gap by encouraging a confirmation of prototypes between two domains. The intuition is there exists an embedding in which points regardless of which domain it comes from, would cluster around a single and shared prototype representation for a given class. We first show that performance comparable with that of state-of-the-art can already be achieved just by doing this alone. We then further propose two means of tackling data efficiency during training: (i) an episode training protocol that enables data feeding by demand, and (ii) a hard triplet generation algorithm to address data scarcity. Extensive experiments on TU-Berlin-Extended, Sketchy-Extended and QuickDraw-Extended validate the usefulness of our approach.
引用
收藏
页码:394 / 410
页数:17
相关论文
共 50 条
  • [21] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
    Ming Zhu
    Chen Zhao
    Nian Wang
    Feiyang Gu
    Yu Liu
    Xin Li
    The Visual Computer, 2024, 40 : 3091 - 3101
  • [22] Domain-aware double attention network for zero-shot sketch-based image retrieval with similarity loss
    Zhu, Ming
    Zhao, Chen
    Wang, Nian
    Gu, Feiyang
    Liu, Yu
    Li, Xin
    VISUAL COMPUTER, 2024, 40 (05): : 3091 - 3101
  • [23] Cross-modal Self-distillation for Zero-shot Sketch-based Image Retrieval
    Tian J.-L.
    Xu X.
    Shen F.-M.
    Shen H.-T.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):
  • [24] Domain disentanglement and fusion based on hyperbolic neural networks for zero-shot sketch-based image retrieval
    Zhang, Qing
    Zhang, Jing
    Su, Xiangdong
    Wang, Yonghe
    Bao, Feilong
    Gao, Guanglai
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
  • [25] Zero-shot Sketch-based Image Retrieval with Adaptive Balanced Discriminability and Generalizability
    Tian, Jialin
    Xu, Xing
    Cao, Zuo
    Zhang, Gong
    Shen, Fumin
    Yang, Yang
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 407 - 415
  • [26] Asymmetric Mutual Alignment for Unsupervised Zero-Shot Sketch-Based Image Retrieval
    Yin, Zhihui
    Yan, Jiexi
    Xu, Chenghao
    Deng, Cheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16504 - 16512
  • [27] Deep supervision network with contrastive learning for zero-shot sketch-based retrieval
    Shu, Zhenqiu
    Zhuo, Guangyao
    Yu, Jun
    Yu, Zhengtao
    APPLIED SOFT COMPUTING, 2024, 167
  • [28] Cross-Modal Attention Alignment Network with Auxiliary Text Description for Zero-Shot Sketch-Based Image Retrieval
    Su, Hanwen
    Song, Ge
    Huang, Kai
    Wang, Jiyan
    Yang, Ming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VI, 2024, 15021 : 52 - 65
  • [29] Cross-Domain Spatial Co-Attention Network for Sketch-Based Image Retrieval
    Yu Lingzhi
    Zhang Xifan
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (22)
  • [30] SKETCH-BASED IMAGE RETRIEVAL VIA A SEMI-HETEROGENEOUS CROSS-DOMAIN NETWORK
    Li, Chuo
    Zhou, Yuan
    Yang, Jianxing
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 216 - 221