Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression

被引:0
|
作者
Liu, Xinyi [1 ]
Huang, Guoheng [1 ]
Yuan, Xiaochen [2 ]
Zheng, Zewen [1 ]
Zhong, Guo [3 ]
Chen, Xuhang [4 ]
Pun, Chi-Man [5 ]
机构
[1] Guangdong Univ Technol, Guangzhou, Peoples R China
[2] Macao Polytech Univ, Macau, Peoples R China
[3] Guangdong Univ Foreign Studies, Guangzhou, Peoples R China
[4] Huizhou Univ, Huizhou, Peoples R China
[5] Univ Macau, Macau, Peoples R China
来源
关键词
Weakly Supervised Semantic Segmentation; Class Activation Mapping; Uncertainty estimation; Attention mechanism;
D O I
10.1007/s00371-024-03574-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Weakly Supervised Semantic Segmentation (WSSS) has become increasingly popular for achieving remarkable segmentation with only image-level labels. Current WSSS approaches extract Class Activation Mapping (CAM) from classification models to produce pseudo-masks for segmentation supervision. However, due to the gap between image-level supervised classification loss and pixel-level CAM generation tasks, the model tends to activate discriminative regions at the image level rather than pursuing pixel-level classification results. Moreover, insufficient supervision leads to unrestricted attention diffusion in the model, further introducing inter-class recognition noise. In this paper, we introduce a framework that employs Saliency Perception and Uncertainty, which includes a Saliency Perception Module (SPM) with Pixel-wise Transfer Loss (SP-PT), and an Uncertainty-guided Noise Suppression method. Specifically, within the SPM, we employ a hybrid attention mechanism to expand the receptive field of the module and enhance its ability to perceive salient object features. Meanwhile, a Pixel-wise Transfer Loss is designed to guide the attention diffusion of the classification model to non-discriminative regions at the pixel-level, thereby mitigating the bias of the model. To further enhance the robustness of CAM for obtaining more accurate pseudo-masks, we propose a noise suppression method based on uncertainty estimation, which applies a confidence matrix to the loss function to suppress the propagation of erroneous information and correct it, thus making the model more robust to noise. We conducted experiments on the PASCAL VOC 2012 and MS COCO 2014, and the experimental results demonstrate the effectiveness of our proposed framework. Code is available at https://github.com/pur-suit/SPU.
引用
收藏
页码:2891 / 2906
页数:16
相关论文
共 50 条
  • [31] Clustering-Guided Class Activation for Weakly Supervised Semantic Segmentation
    Kim, Yeong Woo
    Kim, Wonjun
    IEEE ACCESS, 2024, 12 : 4871 - 4880
  • [32] Weakly Supervised Semantic Segmentation Via Progressive Patch Learning
    Li, Jinlong
    Jie, Zequn
    Wang, Xu
    Zhou, Yu
    Wei, Xiaolin
    Ma, Lin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1686 - 1699
  • [33] Saliency as Pseudo-Pixel Supervision for Weakly and Semi-Supervised Semantic Segmentation
    Lee, Minhyun
    Lee, Seungho
    Lee, Jongwuk
    Shim, Hyunjung
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12341 - 12357
  • [34] Railroad is not a Train: Saliency as Pseudo-pixel Supervision for Weakly Supervised Semantic Segmentation
    Lee, Seungho
    Lee, Minhyun
    Lee, Jongwuk
    Shim, Hyunjung
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5491 - 5501
  • [35] Comprehensive mining of information in Weakly Supervised Semantic Segmentation: Saliency semantics and edge semantics
    Wang, Shaohui
    Shao, Youjia
    Tian, Na
    Zhao, Wencang
    NEURAL NETWORKS, 2024, 169 : 75 - 82
  • [36] Weakly supervised semantic segmentation via self-supervised destruction learning
    Li, Jinlong
    Jie, Zequn
    Wang, Xu
    Zhou, Yu
    Ma, Lin
    Jiang, Jianmin
    NEUROCOMPUTING, 2023, 561
  • [37] UTFNet: Uncertainty-Guided Trustworthy Fusion Network for RGB-Thermal Semantic Segmentation
    Wang, Qingwang
    Yin, Cheng
    Song, Haochen
    Shen, Tao
    Gu, Yanfeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [38] Uncertainty-Guided Voxel-Level Supervised Contrastive Learning for Semi-Supervised Medical Image Segmentation
    Hua, Yu
    Shu, Xin
    Wang, Zizhou
    Zhang, Lei
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (04)
  • [39] UNCERTAINTY-GUIDED ROBUST TRAINING FOR MEDICAL IMAGE SEGMENTATION
    Li, Yan
    Chen, Xiaoyi
    Quan, Li
    Zhang, Ni
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 1471 - 1475
  • [40] Weakly supervised fine-grained semantic segmentation via spatial correlation-guided learning
    Dong, Zihao
    Fang, Tiyu
    Li, Jinping
    Shao, Xiuli
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236