Global-Local Query-Support Cross-Attention for Few-Shot Semantic Segmentation

被引:1
|
作者
Xie, Fengxi [1 ]
Liang, Guozhen [1 ]
Chien, Ying-Ren [2 ]
机构
[1] Tech Univ Berlin, Dept Elect Engn & Comp Sci, D-10623 Berlin, Germany
[2] Natl Ilan Univ, Dept Elect Engn, Yilan, Taiwan
关键词
few-shot semantic segmentation; global-local query-support cross-attention; multi-head attention; transformer;
D O I
10.3390/math12182936
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Few-shot semantic segmentation (FSS) models aim to segment unseen target objects in a query image with scarce annotated support samples. This challenging task requires an effective utilization of support information contained in the limited support set. However, the majority of existing FSS methods either compressed support features into several prototype vectors or constructed pixel-wise support-query correlations to guide the segmentation, which failed in effectively utilizing the support information from the global-local perspective. In this paper, we propose Global-Local Query-Support Cross-Attention (GLQSCA), where both global semantics and local details are exploited. Implemented with multi-head attention in a transformer architecture, GLQSCA treats every query pixel as a token, aggregates the segmentation label from the support mask values (weighted by the similarities with all foreground prototypes (global information)), and supports pixels (local information). Experiments show that our GLQSCA significantly surpasses state-of-the-art methods on the standard FSS benchmarks PASCAL-5i and COCO-20i.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Query-support semantic correlation mining for few-shot segmentation
    Shao, Ji
    Gong, Bo
    Dai, Kanyuan
    Li, Daoliang
    Jing, Ling
    Chen, Yingyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [2] Dual cross-attention Transformer network for few-shot image semantic segmentation
    Liu, Yu
    Guo, Yingchun
    Zhu, Ye
    Yu, Ming
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (11) : 1494 - 1505
  • [3] Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
    Wang, Xixi
    Wang, Xiao
    Jiang, Bo
    Luo, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7789 - 7802
  • [4] Global-Local Interplay in Semantic Alignment for Few-Shot Learning
    Hao, Fusheng
    He, Fengxiang
    Cheng, Jun
    Tao, Dacheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4351 - 4363
  • [5] Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
    Shi, Xinyu
    Wei, Dong
    Zhang, Yu
    Lu, Donghuan
    Ning, Munan
    Chen, Jiashun
    Ma, Kai
    Zheng, Yefeng
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 151 - 168
  • [6] Few-Shot Segmentation based on Global-cross Attention
    Wang, Cailing
    Xu, Yinpeng
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4905 - 4910
  • [7] Query semantic reconstruction for background in few-shot segmentation
    Guan, Haoyan
    Spratling, Michael
    VISUAL COMPUTER, 2024, 40 (02): : 799 - 810
  • [8] Query semantic reconstruction for background in few-shot segmentation
    Haoyan Guan
    Michael Spratling
    The Visual Computer, 2024, 40 (2) : 799 - 810
  • [9] Class-Aware Self- and Cross-Attention Network for Few-Shot Semantic Segmentation of Remote Sensing Images
    Liang, Guozhen
    Xie, Fengxi
    Chien, Ying-Ren
    MATHEMATICS, 2024, 12 (17)
  • [10] CROSS-SCALE QUERY-SUPPORT ALIGNMENT APPROACH FOR SMALL OBJECT DETECTION IN THE FEW-SHOT REGIME
    Le Jeune, Pierre
    Mokraoui, Anissa
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 555 - 559