Learning prototypes from background and latent objects for few-shot semantic segmentation

被引:0
|
作者
Wang, Yicong [1 ]
Huang, Rong [1 ,3 ]
Zhou, Shubo [1 ,3 ]
Jiang, Xueqin [1 ,3 ]
Fang, Zhijun [2 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Donghua Univ, Sch Comp Sci & Technol, Shanghai 201620, Peoples R China
[3] Donghua Univ, Engn Res Ctr Digitized Text & Apparel Technol, Minist Educ, Shanghai 201620, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Few-shot semantic segmentation; Prototype learning; Self-attention mechanism; NETWORK;
D O I
10.1016/j.knosys.2025.113218
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot semantic segmentation (FSS) aims to segment target object within a given image supported by few samples with pixel-level annotations. Existing FSS framework primarily focuses on target area for learning a target-object prototype while directly neglecting non-target clues. As such, the target-object prototype has not only to segment the target object but also to filter out non-target area simultaneously, resulting in numerous false positives. In this paper, we propose a background and latent-object prototype learning network (BLPLNet), which learns prototypes from not only the target area but also the non-target counterpart. From our perspective, the non-target area is delineated into background full of repeated textures and salient objects, refer to as latent objects in this paper. Specifically, a background mining module (BMM) is developed to specially learn a background prototype by episodic learning. The learned background prototype replaces the target-object one for background filtering, reducing the false positives. Moreover, a latent object mining module (LOMM), based on self-attention mechanism, works together with the BMM for learning multiple soft-orthogonal prototypes from latent objects. Then, the learned latent-object prototypes, which condense the general knowledge of objects, are used in a target object enhancement module (TOEM) to enhance the target-object prototype with the guidance of affinity-based scores. Extensive experiments on PASCAL-5i and COCO-20i datasets demonstrate the superiority of the BLPLNet, which outperforms state-of-the-art methods by an average of 0.60% on PASCAL5i. Ablation studies validate the effectiveness of each component, and visualization results indicate that the learned latent-object prototypes indeed convey the general knowledge of objects.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Few-shot semantic segmentation: a review on recent approaches
    Zhaobin Chang
    Yonggang Lu
    Xingcheng Ran
    Xiong Gao
    Xiangwen Wang
    Neural Computing and Applications, 2023, 35 : 18251 - 18275
  • [32] Few-Shot Semantic Segmentation for Complex Driving Scenes
    Zhou, Jingxing
    Chen, Ruei-Bo
    Beyerer, Juergen
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 695 - 702
  • [33] Prediction Calibration for Generalized Few-Shot Semantic Segmentation
    Lu, Zhihe
    He, Sen
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3311 - 3323
  • [34] Cross-Domain Few-Shot Semantic Segmentation
    Lei, Shuo
    Zhang, Xuchao
    He, Jianfeng
    Chen, Fanglan
    Du, Bowen
    Lu, Chang-Tien
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 73 - 90
  • [35] Few-shot semantic segmentation: a review on recent approaches
    Chang, Zhaobin
    Lu, Yonggang
    Ran, Xingcheng
    Gao, Xiong
    Wang, Xiangwen
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18251 - 18275
  • [36] A lightweight siamese transformer for few-shot semantic segmentation
    Zhu, Hegui
    Zhou, Yange
    Jiang, Cong
    Yang, Lianping
    Jiang, Wuming
    Wang, Zhimu
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7455 - 7469
  • [37] Research Status and Analysis of Few-Shot Semantic Segmentation
    Chen, Shan-Juan
    Yu, Yun-Long
    Li, Ying-Ming
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (10): : 2417 - 2451
  • [38] Few-shot semantic segmentation in complex industrial components
    Xu C.
    Wang B.
    Gan J.
    Jiang J.
    Wang Y.
    Tu M.
    Zhou W.
    Multimedia Tools and Applications, 2025, 84 (2) : 1013 - 1030
  • [39] Survey on Image Semantic Segmentation in Dilemma of Few-Shot
    Wei, Ting
    Li, Xinlei
    Liu, Hui
    Computer Engineering and Applications, 2024, 59 (02) : 1 - 11
  • [40] Few-shot semantic segmentation for industrial defect recognition
    Shi, Xiangwen
    Zhang, Shaobing
    Cheng, Miao
    He, Lian
    Tang, Xianghong
    Cui, Zhe
    COMPUTERS IN INDUSTRY, 2023, 148