WAD-CMSN: Wasserstein distance-based cross-modal semantic network for zero-shot sketch-based image retrieval

被引:1
|
作者
Xu, Guanglong [1 ]
Hu, Zhensheng [2 ]
Cai, Jia [3 ]
机构
[1] South China Univ Technol, Sch Econ & Finance, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[3] Guangdong Univ Finance & Econ, Sch Digital Econ, Guangzhou 510320, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; sketch-based image retrieval; Wasserstein distance; identity matching loss;
D O I
10.1142/S0219691322500540
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Zero-shot sketch-based image retrieval (ZSSBIR) aims at retrieving natural images given free hand-drawn sketches that may not appear during training. Previous approaches used semantic aligned sketch-image pairs or utilized memory expensive fusion layer for projecting the visual information to a low-dimensional subspace, which ignores the significant heterogeneous cross-domain discrepancy between highly abstract sketch and relevant image. This may yield poor performance in the training phase. To tackle this issue and overcome this drawback, we propose a Wasserstein distance-based cross-modal semantic network (WAD-CMSN) for ZSSBIR. Specifically, it first projects the visual information of each branch (sketch, image) to a common low-dimensional semantic subspace via Wasserstein distance in an adversarial training manner. Furthermore, a novel identity matching loss is employed to select useful features, which can not only capture complete semantic knowledge, but also alleviate the over-fitting phenomenon caused by the WAD-CMSN model. Experimental results on the challenging Sketchy (Extended) and TU-Berlin (Extended) datasets indicate the effectiveness of the proposed WAD-CMSN model over several competitors.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval
    Deng, Cheng
    Xu, Xinxun
    Wang, Hao
    Yang, Muli
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8892 - 8902
  • [2] Cross-modal Self-distillation for Zero-shot Sketch-based Image Retrieval
    Tian, Jia-Lin
    Xu, Xing
    Shen, Fu-Min
    Shen, Heng-Tao
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):
  • [3] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Jiao, Shichao
    Han, Xie
    Xiong, Fengguang
    Yang, Xiaowen
    Han, Huiyan
    He, Ligang
    Kuang, Liqun
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 13469 - 13483
  • [4] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Shichao Jiao
    Xie Han
    Fengguang Xiong
    Xiaowen Yang
    Huiyan Han
    Ligang He
    Liqun Kuang
    [J]. Neural Computing and Applications, 2022, 34 : 13469 - 13483
  • [5] Cross-Modal Visual Correspondences Learning Without External Semantic Information for Zero-Shot Sketch-Based Image Retrieval
    Gao, Zhijie
    Wang, Kai
    [J]. ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 342 - 353
  • [6] Transferable Coupled Network for Zero-Shot Sketch-Based Image Retrieval
    Wang, Hao
    Deng, Cheng
    Liu, Tongliang
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9181 - 9194
  • [7] Contour detection network for zero-shot sketch-based image retrieval
    Zhang, Qing
    Zhang, Jing
    Su, Xiangdong
    Bao, Feilong
    Gao, Guanglai
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 6781 - 6795
  • [8] Contour detection network for zero-shot sketch-based image retrieval
    Qing Zhang
    Jing Zhang
    Xiangdong Su
    Feilong Bao
    Guanglai Gao
    [J]. Complex & Intelligent Systems, 2023, 9 : 6781 - 6795
  • [9] Cross-Domain Alignment for Zero-Shot Sketch-Based Image Retrieval
    Wang, Xu
    Peng, Dezhong
    Hu, Peng
    Gong, Yunhong
    Chen, Yong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 7024 - 7035
  • [10] Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
    Wang, Zhipeng
    Wang, Hao
    Yan, Jiexi
    Wu, Aming
    Deng, Cheng
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1143 - 1149