Segmentation from localization: a weakly supervised semantic segmentation method for resegmenting CAM

被引:0
|
作者
Jiang, Jingjing [1 ]
Wang, Hongxia [1 ]
Wu, Jiali [1 ]
Liu, Chun [1 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Hubei, Peoples R China
关键词
Image segmentation; Weakly supervised semantic segmentation; Class activation map; Class-agnostic segmentation;
D O I
10.1007/s11042-023-17779-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation has wide applications in computer vision tasks. Due to the high labor cost of pixel-level annotation, weakly supervised semantic segmentation(WSSS) methods based on image-level labels have become an important research topic. However, existing WSSS based on image-level labels has problems such as sparse segmentation results and inaccurate object boundaries. To overcome these problems, we propose a novel locate-then-segment framework that separates the localization process and segmentation process of WSSS. During the localization process we use class activation map(CAM) to locate the rough position of the object as most WSSS methods do. During the segmentation process, we focused on designing the object segmenter to refine the CAM to obtain the pseudo mask. The object segmenter consists of a dual localization feature fusion module and a boundary enhancement decoder. The former effectively extracts the semantic features of the object and finds the whole object; the latter judges long-range pixels to search for the exact object boundary. Additionally, we utilize extra pixel-level labels to train our object segmenter and add some constraints to optimize its training process. Finally, we apply the trained object segmenter to weakly supervised segmented data to improve the prediction results of CAM. Experimental results show that our proposed method significantly improves the quality of pseudo masks and obtains competitive segmentation results. Compared to existing methods, our method has the best result on the PASCAL VOC 2012 validation set with 68.8% mIoU and the competitive result on the test set with 67.9% mIoU. Our method outperforms all CNN-based methods on the MS COCO 2014 validation set, second only to transformer-based methods, achieving 36.5% mIoU. Code is available at https://github.com/wjlbnw/SegmentationFromLocalization.
引用
收藏
页码:57785 / 57810
页数:26
相关论文
共 50 条
  • [31] Boosted MIML method for weakly-supervised image semantic segmentation
    Liu, Yang
    Li, Zechao
    Liu, Jing
    Lu, Hanqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 543 - 559
  • [32] A Weakly Supervised Semantic Segmentation Method on Lung Adenocarcinoma Histopathology Images
    Lan, Xiaobin
    Mei, Jiaming
    Lin, Ruohan
    Chen, Jiahao
    Zhang, Yanju
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 688 - 698
  • [33] Boosted MIML method for weakly-supervised image semantic segmentation
    Yang Liu
    Zechao Li
    Jing Liu
    Hanqing Lu
    Multimedia Tools and Applications, 2015, 74 : 543 - 559
  • [34] Multi-model Integrated Weakly Supervised Semantic Segmentation Method
    Xiong C.
    Zhi H.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (05): : 800 - 807
  • [35] A Weakly Supervised Semantic Segmentation Method Based on Local Superpixel Transformation
    Zhiming Ma
    Dali Chen
    Yilin Mo
    Yue Chen
    Yumin Zhang
    Neural Processing Letters, 2023, 55 : 12039 - 12060
  • [36] Weakly supervised object localization and segmentation in videos
    Rochan, Mrigank
    Rahman, Shafin
    Bruce, Neil D. B.
    Wang, Yang
    IMAGE AND VISION COMPUTING, 2016, 56 : 1 - 12
  • [37] Semantic-Aware Superpixel for Weakly Supervised Semantic Segmentation
    Kim, Sangtae
    Park, Daeyoung
    Shim, Byonghyo
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 1142 - 1150
  • [38] Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation
    Zhou, Tianfei
    Zhang, Meijie
    Zhao, Fang
    Li, Jianwu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4289 - 4299
  • [39] TSD-CAM: transformer-based self distillation with CAM similarity for weakly supervised semantic segmentation
    Yan, Lingyu
    Chen, Jiangfeng
    Tang, Yuanyan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [40] A variant of WSL Framework for Weakly Supervised Semantic Segmentation
    Ma, Ling-Yun
    2018 3RD INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE), 2018, : 520 - 523