Weakly-supervised Semantic Segmentation with Image-level Labels: From Traditional Models to Foundation Models

被引:0
|
作者
Chen, Zhaozheng [1 ]
Sun, Qianru [1 ]
机构
[1] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
Weakly supervised; semantic segmentation; segment anything model;
D O I
10.1145/3707447
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The rapid development of deep learning has driven significant progress in image semantic segmentation-a fundamental task in computer vision. Semantic segmentation algorithms often depend on the availability of pixel-level labels (i.e., masks of objects), which are expensive, time consuming, and labor intensive. Weakly supervised semantic segmentation (WSSS) is an effective solution to avoid such labeling. It utilizes only partial or incomplete annotations and provides a cost-effective alternative to fully supervised semantic segmentation. In this article, our focus is on the WSSS with image-level labels, which is the most challenging form of WSSS. Our work has two parts. First, we conduct a comprehensive survey on traditional methods, primarily focusing on those presented at premier research conferences. We categorize them into four groups based on where their methods operate: pixel-wise, image-wise, cross-image, and external data. Second, we investigate the applicability of visual foundation models, such as the Segment Anything Model (SAM), in the context of WSSS. We scrutinize SAM in two intriguing scenarios: text prompting and zero-shot learning. We provide insights into the potential and challenges of deploying visual foundational models for WSSS, facilitating future developments in this exciting research area. Our code is provided at this link: https://github.com/ zhaozhengChen/SAM_WSSS.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] An One-step Triple Enhanced weakly supervised semantic segmentation using image-level labels
    Quan, Longjie
    Huang, Dandan
    Liu, Zhi
    Gao, Kai
    Mi, Xiaohong
    PLOS ONE, 2024, 19 (10):
  • [2] WAILS: Watershed Algorithm With Image-Level Supervision for Weakly Supervised Semantic Segmentation
    Zhou, Hongming
    Song, Kang
    Zhang, Xianglei
    Gui, Wenyong
    Qian, Qiusuo
    IEEE ACCESS, 2019, 7 : 42745 - 42756
  • [3] Exploiting Confident Information for Weakly Supervised Prostate Segmentation based on Image-level Labels
    Chen, Zhang
    Tian, Zhiqiang
    Li, Xiaojian
    Zhang, Yinshu
    MEDICAL IMAGING 2020: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2021, 11315
  • [4] Weakly-Supervised Dual Clustering for Image Semantic Segmentation
    Liu, Yang
    Liu, Jing
    Li, Zechao
    Tang, Jinhui
    Lu, Hanqing
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2075 - 2082
  • [5] Coarse-to-Fine Semantic Segmentation From Image-Level Labels
    Jing, Longlong
    Chen, Yucheng
    Tian, Yingli
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 225 - 236
  • [6] WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models
    Wu, Xin-Jian
    Zhang, Ruisong
    Qin, Jie
    Ma, Shijie
    Liu, Cheng-Lin
    COMPUTER VISION-ECCV 2024, PT XLIV, 2025, 15102 : 314 - 333
  • [7] Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition
    Zhu, Peipei
    Wang, Xiao
    Luo, Yong
    Sun, Zhenglong
    Zheng, Wei-Shi
    Wang, Yaowei
    Chen, Changwen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6702 - 6716
  • [8] Boosted MIML method for weakly-supervised image semantic segmentation
    Liu, Yang
    Li, Zechao
    Liu, Jing
    Lu, Hanqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 543 - 559
  • [9] Weakly-supervised Semantic Segmentation in Cityscape via Hyperspectral Image
    Huang, Yuxing
    Shen, Qiu
    Fu, Ying
    You, Shaodi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1117 - 1126
  • [10] A Weakly-Supervised Approach for Semantic Segmentation
    Feng, Yanqing
    Wang, Lunwen
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2311 - 2314