Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression

被引:0
|
作者
Liu, Xinyi [1 ]
Huang, Guoheng [1 ]
Yuan, Xiaochen [2 ]
Zheng, Zewen [1 ]
Zhong, Guo [3 ]
Chen, Xuhang [4 ]
Pun, Chi-Man [5 ]
机构
[1] Guangdong Univ Technol, Guangzhou, Peoples R China
[2] Macao Polytech Univ, Macau, Peoples R China
[3] Guangdong Univ Foreign Studies, Guangzhou, Peoples R China
[4] Huizhou Univ, Huizhou, Peoples R China
[5] Univ Macau, Macau, Peoples R China
来源
关键词
Weakly Supervised Semantic Segmentation; Class Activation Mapping; Uncertainty estimation; Attention mechanism;
D O I
10.1007/s00371-024-03574-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Weakly Supervised Semantic Segmentation (WSSS) has become increasingly popular for achieving remarkable segmentation with only image-level labels. Current WSSS approaches extract Class Activation Mapping (CAM) from classification models to produce pseudo-masks for segmentation supervision. However, due to the gap between image-level supervised classification loss and pixel-level CAM generation tasks, the model tends to activate discriminative regions at the image level rather than pursuing pixel-level classification results. Moreover, insufficient supervision leads to unrestricted attention diffusion in the model, further introducing inter-class recognition noise. In this paper, we introduce a framework that employs Saliency Perception and Uncertainty, which includes a Saliency Perception Module (SPM) with Pixel-wise Transfer Loss (SP-PT), and an Uncertainty-guided Noise Suppression method. Specifically, within the SPM, we employ a hybrid attention mechanism to expand the receptive field of the module and enhance its ability to perceive salient object features. Meanwhile, a Pixel-wise Transfer Loss is designed to guide the attention diffusion of the classification model to non-discriminative regions at the pixel-level, thereby mitigating the bias of the model. To further enhance the robustness of CAM for obtaining more accurate pseudo-masks, we propose a noise suppression method based on uncertainty estimation, which applies a confidence matrix to the loss function to suppress the propagation of erroneous information and correct it, thus making the model more robust to noise. We conducted experiments on the PASCAL VOC 2012 and MS COCO 2014, and the experimental results demonstrate the effectiveness of our proposed framework. Code is available at https://github.com/pur-suit/SPU.
引用
收藏
页码:2891 / 2906
页数:16
相关论文
共 50 条
  • [21] Attention Guided Enhancement Network for Weakly Supervised Semantic Segmentation
    Zhang Zhe
    Wang Bilin
    Yu Zhezhou
    Zhao Fengzhi
    CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (04) : 896 - 907
  • [22] Distinct Class-Specific Saliency Maps for Weakly Supervised Semantic Segmentation
    Shimoda, Wataru
    Yanai, Keiji
    COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 218 - 234
  • [23] Weakly supervised semantic segmentation using distinct class specific saliency maps
    Shimoda, Wataru
    Yanai, Keiji
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 191
  • [24] Uncertainty-guided cross learning via CNN and transformer for semi-supervised honeycomb lung lesion segmentation
    Zhao, Zi-an
    Feng, Xiu-fang
    Ren, Xiao-qiang
    Dong, Yun-yun
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (24):
  • [25] Uncertainty-Guided Segmentation Network for Geospatial Object Segmentation
    Jia, Hongyu
    Yang, Wenwu
    Wang, Lin
    Li, Haolin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5824 - 5833
  • [26] Uncertainty-guided mutual consistency learning for semi-supervised medical image segmentation
    Zhang, Yichi
    Jiao, Rushi
    Liao, Qingcheng
    Li, Dongyang
    Zhang, Jicong
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 138
  • [27] Uncertainty-guided transformer for brain tumor segmentation
    Chen, Zan
    Peng, Chenxu
    Guo, Wenlong
    Xie, Lei
    Wang, Shanshan
    Zhuge, Qichuan
    Wen, Caiyun
    Feng, Yuanjing
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (12) : 3289 - 3301
  • [28] Uncertainty-guided transformer for brain tumor segmentation
    Zan Chen
    Chenxu Peng
    Wenlong Guo
    Lei Xie
    Shanshan Wang
    Qichuan Zhuge
    Caiyun Wen
    Yuanjing Feng
    Medical & Biological Engineering & Computing, 2023, 61 : 3289 - 3301
  • [29] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Wei Zhai
    Pingyu Wu
    Kai Zhu
    Yang Cao
    Feng Wu
    Zheng-Jun Zha
    International Journal of Computer Vision, 2024, 132 (3) : 750 - 775
  • [30] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Zhai, Wei
    Wu, Pingyu
    Zhu, Kai
    Cao, Yang
    Wu, Feng
    Zha, Zheng-Jun
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 750 - 775