Adaptive similarity-guided self-merging network for few-shot semantic segmentation

被引:1
|
作者
Liu, Yu [1 ]
Guo, Yingchun [2 ]
Zhu, Ye [2 ]
Yu, Ming [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot semantic segmentation; Style differences; Adaptive weight; Bi-aggregation; Prototype merging;
D O I
10.1016/j.compeleceng.2024.109527
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot Semantic Segmentation (FSS) attempts to segment the new category with only a few labeled samples, presenting a significant challenge. Existing approaches primarily focus on leveraging category information from the support set to identify objects of the new category in the query image. However, these models often struggle when confronted with substantial differences between paired images. To address issues stemming from scenario differences and intra-class diversity, this paper proposes an adaptive similarity-guided self-merging network. Firstly, style differences of multi-level features are introduced to alleviate the network's sensitivity to scenario variations and learn an adaptive weight for the K-shot scheme. Secondly, a feature-mask bi-aggregation module is designed to learn an enhanced feature and an initial mask for the query image. Within this module, dynamic correlations cover all the spatial locations, providing global information crucial for feature and mask aggregation. Subsequently, a self-merging module is proposed to alleviate prototype bias. It merges a self-prototype derived from the initial mask with an adaptive weighted support prototype obtained from K support images. Finally, the target object is segmented using the enhanced feature and merging prototype, and segmentation results are further refined by predictions of base categories and an adjustment factor derived from multilevel style differences. The proposed method achieves 69.1% (1-shot) and 72.3% (5-shot) mIoU on the PASCAL-5i dataset, and 47.4% (1-shot) and 52.1% (5-shot) mIoU on the COCO-20i dataset. These results demonstrate state-of-the-art segmentation performance compared to mainstream methods. (c) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Bi-aggregation-aggregation and self-merging network for few-shot image semantic segmentation
    Liu, Yu
    Yu, Ming
    Zhu, Ye
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (10) : 1421 - 1430
  • [2] A SIMILARITY DISTILLATION GUIDED FEATURE REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Lyu, Shuchang
    Liu, Binghao
    Chen, Lijiang
    Zhao, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 666 - 670
  • [3] Few-Shot Semantic Segmentation via Frequency Guided Neural Network
    Rao, Xiya
    Lu, Tao
    Wang, Zhongyuan
    Zhang, Yanduo
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1092 - 1096
  • [4] APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation
    Chen, Jiacheng
    Gao, Bin-Bin
    Lu, Zongqing
    Xue, Jing-Hao
    Wang, Chengjie
    Liao, Qingmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4361 - 4373
  • [5] Self-regularized prototypical network for few-shot semantic segmentation
    Ding, Henghui
    Zhang, Hui
    Jiang, Xudong
    PATTERN RECOGNITION, 2023, 133
  • [6] SANet: similarity aggregation and semantic fusion for few-shot semantic segmentation
    Ye, Minrui
    Zhang, Tao
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [7] Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation
    Wen, Chunlin
    Huang, Hui
    Ma, Yan
    Yuan, Feiniu
    Zhu, Hongqing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8874 - 8888
  • [8] PRIOR SEMANTIC HARMONIZATION NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Yang, Xinhao
    Ma, Liyan
    Zhou, Yang
    Peng, Yan
    Xie, Shaorong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1126 - 1130
  • [9] Few-Shot Semantic Segmentation with Cyclic Memory Network
    Xie, Guo-Sen
    Xiong, Huan
    Liu, Jie
    Yao, Yazhou
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7273 - 7282
  • [10] Deep Reasoning Network for Few-shot Semantic Segmentation
    Zhuge, Yunzhi
    Shen, Chunhua
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5344 - 5352