Enhancing Weakly Supervised Semantic Segmentation through Patch-Based Refinement

被引:0
|
作者
Tajrishi, Narges Javid [1 ]
Afshar, Sepehr Amini [1 ]
Kasaei, Shohreh [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
Weakly-supervised Semantic Segmentation; Image Classification; Deep Learning; Vision Transformer;
D O I
10.1109/MVIP62238.2024.10491171
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly-Supervised Semantic Segmentation (WSSS) with image-level labels, commonly uses Class Activation Maps (CAM) to generate pseudo-labels. However, Convolutional Neural Networks (CNNs), with their limited local receptive field, often struggle to identify entire object regions. Recently, the Vision Transformer (ViT) architecture has been employed instead of CNNs to capture long-range feature dependencies, by using the self-attention mechanism. Despite its advantages, ViT tends to overlook local feature details, leading to attention maps with low quality and unclear object details. This paper introduces a novel method to enhance the local details in attention maps by leveraging local patches. These local patches are selected from regions that are more likely to contain the desired objects. By effectively utilizing these local patches during the training and generation stages, the model yields more detailed attention maps. Extensive experiments were conducted on the PASCAL VOC 2012 benchmark dataset to demonstrate the efficacy of the proposed approach. The results show significant improvements (+2.6% mIoU) with minimal computational overhead, underscoring the potential of the proposed method in the field of Weakly-Supervised Semantic Segmentation.
引用
收藏
页码:70 / 75
页数:6
相关论文
共 50 条
  • [1] Patch-based weakly supervised semantic segmentation network for crack detection
    Dong, Zhiming
    Wang, Jiajun
    Cui, Bo
    Wang, Dong
    Wang, Xiaoling
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2020, 258
  • [2] Weakly Supervised Semantic Segmentation with Patch-Based Metric Learning Enhancement
    Chan, Patrick P. K.
    Chen, Keke
    Xu, Linyi
    Hu, Xiaoman
    Yeung, Daniel S.
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 471 - 482
  • [3] Complementary Patch for Weakly Supervised Semantic Segmentation
    Zhang, Fei
    Gu, Chaochen
    Zhang, Chenyue
    Dai, Yuchao
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7222 - 7231
  • [5] Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement
    Yuan, Zheng
    Zhang, Jie
    Wang, Yude
    Shan, Shiguang
    Chen, Xilin
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 5270 - 5292
  • [6] EnNuSegNet: Enhancing Weakly Supervised Nucleus Segmentation through Feature Preservation and Edge Refinement
    Chen, Xiaohui
    Ruan, Qisheng
    Chen, Lingjun
    Sheng, Guanqun
    Chen, Peng
    [J]. ELECTRONICS, 2024, 13 (03)
  • [7] Weakly Supervised Semantic Segmentation Via Progressive Patch Learning
    Li, Jinlong
    Jie, Zequn
    Wang, Xu
    Zhou, Yu
    Wei, Xiaolin
    Ma, Lin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1686 - 1699
  • [8] TransCAM: Transformer attention-based CAM refinement for Weakly supervised semantic segmentation
    Li, Ruiwen
    Mai, Zheda
    Zhang, Zhibo
    Jang, Jongseong
    Sanner, Scott
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 92
  • [9] Weakly Supervised Building Semantic Segmentation Based on Spot-Seeds and Refinement Process
    Moghalles, Khaled
    Li, Heng-Chao
    Alazeb, Abdulwahab
    [J]. ENTROPY, 2022, 24 (05)
  • [10] Cross-Patch Relation Enhanced for Weakly Supervised Semantic Segmentation
    [J]. Lu, Zongqing (luzq@sz.tsinghua.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc.