Boosting Weakly-Supervised Image Segmentation via Representation, Transform, and Compensator

被引:0
|
作者
Wang, Chunyan [1 ]
Zhang, Dong [2 ]
Yan, Rui [3 ]
机构
[1] Nanjing Univ Sci & Technol, Dept Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[3] Nanjing Univ, Dept Comp Sci & Technol, Nanjing 210023, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Weakly-supervised learning; single-stage semantic segmentation; contrastive learning;
D O I
10.1109/TCSVT.2024.3413778
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly-supervised image segmentation (WSIS) is a fundamental task in the domain of computer vision that relies on image-level class labels. While multi-stage training procedures have been widely used in existing WSIS methods to obtain high-quality pseudo-masks as ground-truth, resulting in significant progress, single-stage WSIS methods have recently gained attention due to their potential for simplifying the training procedure. However, single-stage methods suffer from low-quality pseudo-masks that limit their practical applications. To address this problem, this paper proposes a novel single-stage WSIS method that utilizes a siamese network with contrastive learning to improve the quality of class activation maps (CAMs) and achieve a self-refinement result. The proposed method employs a cross-representation refinement method that expands reliable object regions by utilizing different feature representations from the backbone. Besides, a cross-transform regularization module is introduced that learns robust class prototypes for contrastive learning and captures global context information to feed back rough CAMs, thereby improving the quality of CAMs. The final high-quality CAMs are used as pseudo-masks to supervise the segmentation result. Experimental results on the PASCAL VOC 2012 and COCO datasets demonstrate that the proposed method significantly outperforms other state-of-the-art methods, achieving 72.38% and 72.95% mIoU on PASCAL VOC 2012 val set and test set, 42.51% mIoU on COCO val set, respectively. Furthermore, the proposed method has been extended to weakly supervised object localization, and experimental results demonstrate that it continues to achieve very competitive results. The source codes have been released at https://github.com/ChunyanWang1/RTC.
引用
收藏
页码:11013 / 11025
页数:13
相关论文
共 50 条
  • [31] Weakly-Supervised RGBD Video Object Segmentation
    Yang, Jinyu
    Gao, Mingqi
    Zheng, Feng
    Zhen, Xiantong
    Ji, Rongrong
    Shao, Ling
    Leonardis, Ales
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2158 - 2170
  • [32] IMPORTANCE SAMPLING CAMS FOR WEAKLY-SUPERVISED SEGMENTATION
    Jonnarth, Arvi
    Felsberg, Michael
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2639 - 2643
  • [33] Token Contrast for Weakly-Supervised Semantic Segmentation
    Ru, Lixiang
    Zheng, Hehang
    Zhan, Yibing
    Du, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3093 - 3102
  • [34] Rethinking CAM in Weakly-Supervised Semantic Segmentation
    Song, Yuqi
    Li, Xiaojie
    Shi, Canghong
    Feng, Shihao
    Wang, Xin
    Luo, Yong
    Xi, Wu
    IEEE ACCESS, 2022, 10 : 126440 - 126450
  • [35] WEAKLY-SUPERVISED PLATE AND FOOD REGION SEGMENTATION
    Shimoda, Wataru
    Yanai, Keiji
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [36] Figure-Ground Image Segmentation Helps Weakly-Supervised Learning of Objects
    Fragkiadaki, Katerina
    Shi, Jianbo
    COMPUTER VISION - ECCV 2010, PT VI, 2010, 6316 : 561 - 574
  • [37] Doppler Image-Based Weakly-Supervised Vascular Ultrasound Segmentation with Transformer
    Ning, Guochen
    Liang, Hanying
    Chen, Fang
    Zhang, Xinran
    Liao, Hongen
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [38] Weakly-Supervised Medical Image Segmentation Based on Multi-task Learning
    Xie, Xuanhua
    Fan, Huijie
    Yu, Zhencheng
    Bai, Haijun
    Tang, Yandong
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT II, 2022, 13456 : 395 - 404
  • [39] Weakly-Supervised Image Parsing via Constructing Semantic Graphs and Hypergraphs
    Xie, Wenxuan
    Peng, Yuxin
    Xiao, Jianguo
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 277 - 286
  • [40] Predicting Segmentation "Easiness" from the Consistency for Weakly-Supervised Segmentation
    Shimoda, Wataru
    Yanai, Keiji
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 292 - 297