Global-Local Query-Support Cross-Attention for Few-Shot Semantic Segmentation

被引:1
|
作者
Xie, Fengxi [1 ]
Liang, Guozhen [1 ]
Chien, Ying-Ren [2 ]
机构
[1] Tech Univ Berlin, Dept Elect Engn & Comp Sci, D-10623 Berlin, Germany
[2] Natl Ilan Univ, Dept Elect Engn, Yilan, Taiwan
关键词
few-shot semantic segmentation; global-local query-support cross-attention; multi-head attention; transformer;
D O I
10.3390/math12182936
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Few-shot semantic segmentation (FSS) models aim to segment unseen target objects in a query image with scarce annotated support samples. This challenging task requires an effective utilization of support information contained in the limited support set. However, the majority of existing FSS methods either compressed support features into several prototype vectors or constructed pixel-wise support-query correlations to guide the segmentation, which failed in effectively utilizing the support information from the global-local perspective. In this paper, we propose Global-Local Query-Support Cross-Attention (GLQSCA), where both global semantics and local details are exploited. Implemented with multi-head attention in a transformer architecture, GLQSCA treats every query pixel as a token, aggregates the segmentation label from the support mask values (weighted by the similarities with all foreground prototypes (global information)), and supports pixels (local information). Experiments show that our GLQSCA significantly surpasses state-of-the-art methods on the standard FSS benchmarks PASCAL-5i and COCO-20i.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Few-Shot Semantic Segmentation via Mask Aggregation
    Ao, Wei
    Zheng, Shunyi
    Meng, Yan
    Yang, Yang
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [42] Deep Reasoning Network for Few-shot Semantic Segmentation
    Zhuge, Yunzhi
    Shen, Chunhua
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5344 - 5352
  • [43] Incorporating Depth Information into Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3582 - 3588
  • [44] Dynamic Extension Nets for Few-shot Semantic Segmentation
    Liu, Lizhao
    Cao, Junyi
    Liu, Minqian
    Guo, Yong
    Chen, Qi
    Tan, Mingkui
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1441 - 1449
  • [45] Few-shot semantic segmentation: a review on recent approaches
    Zhaobin Chang
    Yonggang Lu
    Xingcheng Ran
    Xiong Gao
    Xiangwen Wang
    Neural Computing and Applications, 2023, 35 : 18251 - 18275
  • [46] Few-Shot Semantic Segmentation for Complex Driving Scenes
    Zhou, Jingxing
    Chen, Ruei-Bo
    Beyerer, Juergen
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 695 - 702
  • [47] Prediction Calibration for Generalized Few-Shot Semantic Segmentation
    Lu, Zhihe
    He, Sen
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3311 - 3323
  • [48] Few-shot semantic segmentation: a review on recent approaches
    Chang, Zhaobin
    Lu, Yonggang
    Ran, Xingcheng
    Gao, Xiong
    Wang, Xiangwen
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18251 - 18275
  • [49] A lightweight siamese transformer for few-shot semantic segmentation
    Zhu, Hegui
    Zhou, Yange
    Jiang, Cong
    Yang, Lianping
    Jiang, Wuming
    Wang, Zhimu
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7455 - 7469
  • [50] Research Status and Analysis of Few-Shot Semantic Segmentation
    Chen, Shan-Juan
    Yu, Yun-Long
    Li, Ying-Ming
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (10): : 2417 - 2451