SAM-RSP: A new few-shot segmentation method based on segment anything model and rough segmentation prompts

被引:0
|
作者
Li, Jiaguang [1 ]
Wei, Ying [1 ]
Zhang, Wei [1 ]
Shi, Zhenrui [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
关键词
Few-shot segmentation; Prompt learning; Prototype learning; Segment anything model (SAM); Semantic segmentation;
D O I
10.1016/j.imavis.2024.105214
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot segmentation (FSS) aims to segment novel classes with a few labeled images. The backbones used in existing methods are pre-trained through classification tasks on the ImageNet dataset. Although these backbones can effectively perceive the semantic categories of images, they cannot accurately perceive the regional boundaries within one image, which limits the model performance. Recently, Segment Anything Model (SAM) has achieved precise image segmentation based on point or box prompts, thanks to its excellent perception of region boundaries within one image. However, it cannot effectively provide semantic information of images. This paper proposes a new few-shot segmentation method that can effectively perceive both semantic categories and regional boundaries. This method first utilizes the SAM encoder to perceive regions and obtain the query embedding. Then the support and query images are input into a backbone pre-trained on ImageNet to perceive semantics and generate a rough segmentation prompt (RSP). This query embedding is combined with the prompt to generate a pixel-level query prototype, which can better match the query embedding. Finally, the query embedding, prompt, and prototype are combined and input into the designed multi-layer prompt transformer decoder, which is more efficient and lightweight, and can provide a more accurate segmentation result. In addition, other methods can be easily combined with our framework to improve their performance. Plenty of experiments on PASCAL-5i and COCO-20i under 1-shot and 5-shot settings prove the effectiveness of our method. Our method also achieves new state-of-the-art. Codes are available at https://github.com/Jiaguang-NE U/SAM-RSP.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Differentiable Meta-Learning Model for Few-Shot Semantic Segmentation
    Tian, Pinzhuo
    Wu, Zhangkai
    Qi, Lei
    Wang, Lei
    Shi, Yinghuan
    Gao, Yang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12087 - 12094
  • [22] Part-Based Semantic Transform for Few-Shot Semantic Segmentation
    Yang, Boyu
    Wan, Fang
    Liu, Chang
    Li, Bohao
    Ji, Xiangyang
    Ye, Qixiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7141 - 7152
  • [23] Multi-similarity based hyperrelation network for few-shot segmentation
    Shi, Xiangwen
    Cui, Zhe
    Zhang, Shaobing
    Cheng, Miao
    He, Lian
    Tang, Xianghong
    IET IMAGE PROCESSING, 2023, 17 (01) : 204 - 214
  • [24] G-SAM: GMM-based segment anything model for medical image classification and segmentation
    Liu, Xiaoxiao
    Zhao, Yan
    Wang, Shigang
    Wei, Jian
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (10): : 14231 - 14245
  • [25] A pseudo-labeling based weakly supervised segmentation method for few-shot texture images
    Han, Yuexing
    Li, Ruiqi
    Wang, Bing
    Ruan, Liheng
    Chen, Qiaochuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [26] CDSG-SAM: A cross-domain self-generating prompt few-shot brain tumor segmentation pipeline based on SAM
    Yang, Yang
    Fang, Xianjin
    Li, Xiang
    Han, Yuxi
    Yu, Zekuan
    Biomedical Signal Processing and Control, 2025, 100
  • [27] Few-shot Tumor Bud Segmentation Using Generative Model in Colorectal Carcinoma
    Su, Ziyu
    Chen, Wei
    Leigh, Preston J.
    Sajjad, Usama
    Niu, Shuo
    Rezapour, Mostafa
    Frankel, Wendy L.
    Gurcan, Metin N.
    Niazi, M. Khalid Khan
    DIGITAL AND COMPUTATIONAL PATHOLOGY, MEDICAL IMAGING 2024, 2024, 12933
  • [28] MW-SAM:Mangrove wetland remote sensing image segmentation network based on segment anything model
    Zhang, Yu
    Wang, Xin
    Cai, Jingye
    Yang, Qun
    IET Image Processing, 2024, 18 (14) : 4503 - 4513
  • [29] Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7372 - 7378
  • [30] Kernel-based similarity sorting and allocation for few-shot semantic segmentation
    Ze-yu Liu
    Jian-wei Liu
    Neural Computing and Applications, 2022, 34 : 21939 - 21960