Adaptive similarity-guided self-merging network for few-shot semantic segmentation

被引:1
|
作者
Liu, Yu [1 ]
Guo, Yingchun [2 ]
Zhu, Ye [2 ]
Yu, Ming [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot semantic segmentation; Style differences; Adaptive weight; Bi-aggregation; Prototype merging;
D O I
10.1016/j.compeleceng.2024.109527
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot Semantic Segmentation (FSS) attempts to segment the new category with only a few labeled samples, presenting a significant challenge. Existing approaches primarily focus on leveraging category information from the support set to identify objects of the new category in the query image. However, these models often struggle when confronted with substantial differences between paired images. To address issues stemming from scenario differences and intra-class diversity, this paper proposes an adaptive similarity-guided self-merging network. Firstly, style differences of multi-level features are introduced to alleviate the network's sensitivity to scenario variations and learn an adaptive weight for the K-shot scheme. Secondly, a feature-mask bi-aggregation module is designed to learn an enhanced feature and an initial mask for the query image. Within this module, dynamic correlations cover all the spatial locations, providing global information crucial for feature and mask aggregation. Subsequently, a self-merging module is proposed to alleviate prototype bias. It merges a self-prototype derived from the initial mask with an adaptive weighted support prototype obtained from K support images. Finally, the target object is segmented using the enhanced feature and merging prototype, and segmentation results are further refined by predictions of base categories and an adjustment factor derived from multilevel style differences. The proposed method achieves 69.1% (1-shot) and 72.3% (5-shot) mIoU on the PASCAL-5i dataset, and 47.4% (1-shot) and 52.1% (5-shot) mIoU on the COCO-20i dataset. These results demonstrate state-of-the-art segmentation performance compared to mainstream methods. (c) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] CLIP-Driven Prototype Network for Few-Shot Semantic Segmentation
    Guo, Shi-Cheng
    Liu, Shang-Kun
    Wang, Jing-Yu
    Zheng, Wei-Min
    Jiang, Cheng-Yu
    ENTROPY, 2023, 25 (09)
  • [42] MGNet: Mutual-guidance network for few-shot semantic segmentation
    Chang, Zhaobin
    Lu, Yonggang
    Wang, Xiangwen
    Ran, Xingcheng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [43] Word vector embedding and self-supplementing network for Generalized Few-shot Semantic Segmentation
    Wang, Xiaowei
    Chen, Qiong
    Yang, Yong
    NEUROCOMPUTING, 2025, 613
  • [44] Prototype-Guided Prior Enhancement and Rectification in Few-shot Semantic Segmentation
    Tang, Yiming
    Yu, Yi
    Chen, Yan Qiu
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [45] Query-Guided Prototype Evolution Network for Few-Shot Segmentation
    Cong, Runmin
    Xiong, Hang
    Chen, Jinpeng
    Zhang, Wei
    Huang, Qingming
    Zhao, Yao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6501 - 6512
  • [46] Task-aware adaptive attention learning for few-shot semantic segmentation
    Mao, Binjie
    Wang, Lingfeng
    Xiang, Shiming
    Pan, Chunhong
    NEUROCOMPUTING, 2022, 494 : 104 - 115
  • [47] Adaptive Agent Transformer for Few-Shot Segmentation
    Wang, Yuan
    Sun, Rui
    Zhang, Zhe
    Zhang, Tianzhu
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 36 - 52
  • [48] MASK-GUIDED ATTENTION AND EPISODE ADAPTIVE WEIGHTS FOR FEW-SHOT SEGMENTATION
    Kwon, Hyeongjun
    Song, Taeyong
    Kim, Sunok
    Sohn, Kwanghoon
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2611 - 2615
  • [49] A few-shot semantic segmentation method based on adaptively mining correlation network
    Huang, Zhifu
    Jiang, Bin
    Liu, Yu
    ROBOTICA, 2023, 41 (06) : 1828 - 1836
  • [50] Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7372 - 7378