Boosting Weakly-Supervised Image Segmentation via Representation, Transform, and Compensator

被引:0
|
作者
Wang, Chunyan [1 ]
Zhang, Dong [2 ]
Yan, Rui [3 ]
机构
[1] Nanjing Univ Sci & Technol, Dept Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[3] Nanjing Univ, Dept Comp Sci & Technol, Nanjing 210023, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Weakly-supervised learning; single-stage semantic segmentation; contrastive learning;
D O I
10.1109/TCSVT.2024.3413778
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly-supervised image segmentation (WSIS) is a fundamental task in the domain of computer vision that relies on image-level class labels. While multi-stage training procedures have been widely used in existing WSIS methods to obtain high-quality pseudo-masks as ground-truth, resulting in significant progress, single-stage WSIS methods have recently gained attention due to their potential for simplifying the training procedure. However, single-stage methods suffer from low-quality pseudo-masks that limit their practical applications. To address this problem, this paper proposes a novel single-stage WSIS method that utilizes a siamese network with contrastive learning to improve the quality of class activation maps (CAMs) and achieve a self-refinement result. The proposed method employs a cross-representation refinement method that expands reliable object regions by utilizing different feature representations from the backbone. Besides, a cross-transform regularization module is introduced that learns robust class prototypes for contrastive learning and captures global context information to feed back rough CAMs, thereby improving the quality of CAMs. The final high-quality CAMs are used as pseudo-masks to supervise the segmentation result. Experimental results on the PASCAL VOC 2012 and COCO datasets demonstrate that the proposed method significantly outperforms other state-of-the-art methods, achieving 72.38% and 72.95% mIoU on PASCAL VOC 2012 val set and test set, 42.51% mIoU on COCO val set, respectively. Furthermore, the proposed method has been extended to weakly supervised object localization, and experimental results demonstrate that it continues to achieve very competitive results. The source codes have been released at https://github.com/ChunyanWang1/RTC.
引用
收藏
页码:11013 / 11025
页数:13
相关论文
共 50 条
  • [21] Weakly-Supervised Image Semantic Segmentation Based on Superpixel Region Merging
    Jiang, Quanchun
    Tawose, Olamide Timothy
    Pei, Songwen
    Chen, Xiaodong
    Jiang, Linhua
    Wang, Jiayao
    Zhao, Dongfang
    BIG DATA AND COGNITIVE COMPUTING, 2019, 3 (02) : 1 - 20
  • [22] Partial Image Texture Translation Using Weakly-Supervised Semantic Segmentation
    Benitez-Garcia, Gibran
    Shimoda, Wataru
    Matsuo, Shin
    Yanai, Keiji
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2019, 2020, 12331 : 387 - 401
  • [23] GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic Segmentation
    Pu, Mengyang
    Huang, Yaping
    Guan, Qingji
    Zou, Qi
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 483 - 491
  • [24] A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains
    Chan, Lyndon
    Hosseini, Mahdi S.
    Plataniotis, Konstantinos N.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 361 - 384
  • [25] Seed, Expand and Constrain: Three Principles for Weakly-Supervised Image Segmentation
    Kolesnikov, Alexander
    Lampert, Christoph H.
    COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 695 - 711
  • [26] A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains
    Lyndon Chan
    Mahdi S. Hosseini
    Konstantinos N. Plataniotis
    International Journal of Computer Vision, 2021, 129 : 361 - 384
  • [27] Weakly-supervised Object Representation Learning for Few-shot Semantic Segmentation
    Ying, Xiaowen
    Li, Xin
    Chuah, Mooi Choo
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1496 - 1505
  • [28] Scribble2D5: Weakly-Supervised Volumetric Image Segmentation via Scribble Annotations
    Chen, Qiuhui
    Hong, Yi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VIII, 2022, 13438 : 234 - 243
  • [29] On Regularized Losses for Weakly-supervised CNN Segmentation
    Tang, Meng
    Perazzi, Federico
    Djelouah, Abdelaziz
    Ben Ayed, Ismail
    Schroers, Christopher
    Boykov, Yuri
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 524 - 540
  • [30] Weakly-Supervised Audio-Visual Segmentation
    Mo, Shentong
    Raj, Bhiksha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,