SceneSketcher-v2: Fine-Grained Scene-Level Sketch-Based Image Retrieval Using Adaptive GCNs

被引:10
|
作者
Liu, Fang [1 ,2 ]
Deng, Xiaoming [3 ,4 ,5 ]
Zou, Changqing
Lai, Yu-Kun [8 ]
Chen, Keqi [3 ,4 ,5 ]
Zuo, Ran [3 ,4 ,5 ,6 ,7 ]
Ma, Cuixia [3 ,4 ,5 ]
Liu, Yong-Jin [9 ]
Wang, Hongan [3 ,4 ,5 ]
机构
[1] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[3] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing 100190, Peoples R China
[5] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing Key Lab Human Comp Interact, Beijing 100190, Peoples R China
[6] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China
[7] Zhejiang Lab, Res Ctr Artificial Intelligence & Fine Arts, Hangzhou 310058, Peoples R China
[8] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales
[9] Tsinghua Univ, Dept Comp Sci & Technol, MOE Key Lab Pervas Comp, BNRist, Beijing 100084, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Image retrieval; Semantics; Visualization; Layout; Task analysis; Electronic mail; Adaptation models; Sketch-based image retrieval; graph convolutional network; scene sketch; fine-grained image retrieval; DESCRIPTOR; ALIGNMENT;
D O I
10.1109/TIP.2022.3175403
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sketch-based image retrieval (SBIR) is a long-standing research topic in computer vision. Existing methods mainly focus on category-level or instance-level image retrieval. This paper investigates the fine-grained scene-level SBIR problem where a free-hand sketch depicting a scene is used to retrieve desired images. This problem is useful yet challenging mainly because of two entangled facts: 1) achieving an effective representation of the input query data and scene-level images is difficult as it requires to model the information across multiple modalities such as object layout, relative size and visual appearances, and 2) there is a great domain gap between the query sketch input and target images. We present SceneSketcher-v2, a Graph Convolutional Network (GCN) based architecture to address these challenges. SceneSketcher-v2 employs a carefully designed graph convolution network to fuse the multi-modality information in the query sketch and target images and uses a triplet training process and end-to-end training manner to alleviate the domain gap. Extensive experiments demonstrate SceneSketcher-v2 outperforms state-of-the-art scene-level SBIR models with a significant margin.
引用
收藏
页码:3737 / 3751
页数:15
相关论文
共 40 条
  • [21] A deformable CNN-based triplet model for fine-grained sketch-based image retrieval
    Zhang, Xianlin
    Shen, Mengling
    Li, Xueming
    Feng, Fangxiang
    PATTERN RECOGNITION, 2022, 125
  • [22] CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Chowdhury, Pinaki Nath
    Koley, Subhadeep
    Xiang, Tao
    Song, Yi-Zhe
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2765 - 2775
  • [23] Expansion window local alignment weighted network for fine-grained sketch-based image retrieval
    Zhang, Zi-Chao
    Xie, Zhen-Yu
    Chen, Zhen-Duo
    Zhan, Yu-Wei
    Luo, Xin
    Xu, Xin-Shun
    PATTERN RECOGNITION, 2023, 144
  • [24] Bi-LSTM Sequence Modeling for On-The-Fly Fine-Grained Sketch-Based Image Retrieval
    Liu Y.
    Dai D.
    Tang X.
    Xia S.
    Wang G.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1178 - 1185
  • [25] SketchANIMAR: Sketch-based 3D animal fine-grained retrieval
    Le, Trung-Nghia
    Nguyen, Tam, V
    Le, Minh-Quan
    Nguyen, Trong-Thuan
    Huynh, Viet-Tham
    Do, Trong-Le
    Le, Khanh-Duy
    Tran, Mai-Khiem
    Hoang-Xuan, Nhat
    Nguyen-Ho, Thang-Long
    Nguyen, Vinh-Tiep
    Le-Pham, Nhat-Quynh
    Pham, Huu-Phuc
    Hoang, Trong-Vu
    Nguyen, Quang-Binh
    Nguyen-Mau, Trong-Hieu
    Huynh, Tuan-Luc
    Le, Thanh-Danh
    Nguyen-Ha, Ngoc-Linh
    Truong-Thuy, Tuong-Vy
    Phong, Truong Hoai
    Diep, Tuong-Nghiem
    Ho, Khanh-Duy
    Nguyen, Xuan-Hieu
    Tran, Thien-Phuc
    Yang, Tuan-Anh
    Tran, Kim-Phat
    Hoang, Nhu-Vinh
    Nguyen, Minh-Quang
    Vo, Hoai-Danh
    Doan, Minh-Hoa
    Nguyen, Hai-Dang
    Sugimoto, Akihiro
    Tran, Minh-Triet
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 150 - 161
  • [26] Toward Fine-Grained Sketch-Based 3D Shape Retrieval
    Qi, Anran
    Gryaditskaya, Yulia
    Song, Jifei
    Yang, Yongxin
    Qi, Yonggang
    Hospedales, Timothy M.
    Xiang, Tao
    Song, Yi-Zhe
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 8595 - 8606
  • [27] Deep cascaded cross-modal correlation learning for fine-grained sketch-based image retrieval
    Wang, Yanfei
    Huang, Fei
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    PATTERN RECOGNITION, 2020, 100 (100)
  • [28] DLI-Net: Dual Local Interaction Network for Fine-Grained Sketch-Based Image Retrieval
    Sun, Haifeng
    Xu, Jiaqing
    Wang, Jingyu
    Qi, Qi
    Ge, Ce
    Liao, Jianxin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7177 - 7189
  • [29] Multi-granularity Association Learning for On-the-fly Fine-grained Sketch-based Image Retrieval
    Dai, Dawei
    Tang, Xiaoyu
    Liu, Yingge
    Xia, Shuyin
    Wang, Guoyin
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [30] AE-Net: Fine-grained sketch-based image retrieval via attention-enhanced network
    Chen, Yangdong
    Zhang, Zhaolong
    Wang, Yanfei
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    PATTERN RECOGNITION, 2022, 122