SceneSketcher-v2: Fine-Grained Scene-Level Sketch-Based Image Retrieval Using Adaptive GCNs

被引:10
|
作者
Liu, Fang [1 ,2 ]
Deng, Xiaoming [3 ,4 ,5 ]
Zou, Changqing
Lai, Yu-Kun [8 ]
Chen, Keqi [3 ,4 ,5 ]
Zuo, Ran [3 ,4 ,5 ,6 ,7 ]
Ma, Cuixia [3 ,4 ,5 ]
Liu, Yong-Jin [9 ]
Wang, Hongan [3 ,4 ,5 ]
机构
[1] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[3] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing 100190, Peoples R China
[5] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing Key Lab Human Comp Interact, Beijing 100190, Peoples R China
[6] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China
[7] Zhejiang Lab, Res Ctr Artificial Intelligence & Fine Arts, Hangzhou 310058, Peoples R China
[8] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales
[9] Tsinghua Univ, Dept Comp Sci & Technol, MOE Key Lab Pervas Comp, BNRist, Beijing 100084, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Image retrieval; Semantics; Visualization; Layout; Task analysis; Electronic mail; Adaptation models; Sketch-based image retrieval; graph convolutional network; scene sketch; fine-grained image retrieval; DESCRIPTOR; ALIGNMENT;
D O I
10.1109/TIP.2022.3175403
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sketch-based image retrieval (SBIR) is a long-standing research topic in computer vision. Existing methods mainly focus on category-level or instance-level image retrieval. This paper investigates the fine-grained scene-level SBIR problem where a free-hand sketch depicting a scene is used to retrieve desired images. This problem is useful yet challenging mainly because of two entangled facts: 1) achieving an effective representation of the input query data and scene-level images is difficult as it requires to model the information across multiple modalities such as object layout, relative size and visual appearances, and 2) there is a great domain gap between the query sketch input and target images. We present SceneSketcher-v2, a Graph Convolutional Network (GCN) based architecture to address these challenges. SceneSketcher-v2 employs a carefully designed graph convolution network to fuse the multi-modality information in the query sketch and target images and uses a triplet training process and end-to-end training manner to alleviate the domain gap. Extensive experiments demonstrate SceneSketcher-v2 outperforms state-of-the-art scene-level SBIR models with a significant margin.
引用
收藏
页码:3737 / 3751
页数:15
相关论文
共 40 条
  • [1] Adaptive Fine-Grained Sketch-Based Image Retrieval
    Bhunia, Ayan Kumar
    Sain, Aneeshan
    Shah, Parth Hiren
    Gupta, Animesh
    Chowdhury, Pinaki Nath
    Xiang, Tao
    Song, Yi-Zhe
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 163 - 181
  • [2] Fine-Grained Instance-Level Sketch-Based Image Retrieval
    Yu, Qian
    Song, Jifei
    Song, Yi-Zhe
    Xiang, Tao
    Hospedales, Timothy M.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 484 - 500
  • [3] Fine-Grained Instance-Level Sketch-Based Image Retrieval
    Qian Yu
    Jifei Song
    Yi-Zhe Song
    Tao Xiang
    Timothy M. Hospedales
    International Journal of Computer Vision, 2021, 129 : 484 - 500
  • [4] Fine-Grained Color Sketch-Based Image Retrieval
    Xia, Yu
    Wang, Shuangbu
    Li, Yanran
    You, Lihua
    Yang, Xiaosong
    Zhang, Jian Jun
    ADVANCES IN COMPUTER GRAPHICS, CGI 2019, 2019, 11542 : 424 - 430
  • [5] Generalising Fine-Grained Sketch-Based Image Retrieval
    Pang, Kaiyue
    Li, Ke
    Yang, Yongxin
    Zhang, Honggang
    Hospedales, Timothy M.
    Xiang, Tao
    Song, Yi-Zhe
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 677 - 686
  • [6] Scene-Level Sketch-Based Image Retrieval with Minimal Pairwise Supervision
    Ge, Ce
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    Xu, Tong
    Liao, Jianxin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 650 - 657
  • [7] Gradually focused fine-grained sketch-based image retrieval
    Zhu, Ming
    Chen, Chun
    Wang, Nian
    Tang, Jun
    Bao, Wenxia
    PLOS ONE, 2019, 14 (05):
  • [8] Multi-Level Region Matching for Fine-Grained Sketch-Based Image Retrieval
    Ling, Zhixin
    Xing, Zhen
    Li, Jiangtong
    Niu, Li
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [9] Conditional Stroke Recovery for Fine-Grained Sketch-Based Image Retrieval
    Ling, Zhixin
    Xing, Zhen
    Zhou, Jian
    Zhou, Xiangdong
    COMPUTER VISION, ECCV 2022, PT XXVI, 2022, 13686 : 722 - 738
  • [10] Fine-Grained Instance-Level Sketch-Based Video Retrieval
    Xu, Peng
    Liu, Kun
    Xiang, Tao
    Hospedales, Timothy M.
    Ma, Zhanyu
    Guo, Jun
    Song, Yi-Zhe
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1995 - 2007