MECPformer: multi-estimations complementary patch with CNN-transformers for weakly supervised semantic segmentation

被引:3
|
作者
Liu, Chunmeng [1 ]
Li, Guangyao [1 ]
Shen, Yao [1 ]
Wang, Ruiqi [1 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 31期
关键词
Weakly supervised learning; Semantic segmentation; Transformer; CNN; Computer vision;
D O I
10.1007/s00521-023-08816-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The initial seed based on the convolutional neural network (CNN) for weakly supervised semantic segmentation always highlights the most discriminative regions but fails to identify the global target information. Methods based on transformers have been proposed successively benefiting from the advantage of capturing long-range feature representations. However, we observe a flaw regardless of the gifts based on the transformer. Given a class, the initial seeds generated based on the transformer may invade regions belonging to other classes. Inspired by the mentioned issues, we devise a simple yet effective method with multi-estimations complementary patch (MECP) strategy and adaptive conflict module (ACM), dubbed MECPformer. Given an image, we manipulate it with the MECP strategy at different epochs, and the network mines and deeply fuses the semantic information at different levels. In addition, ACM adaptively removes conflicting pixels and exploits the network self-training capability to mine potential target information. Without bells and whistles, our MECPformer has reached new state-of-the-art 72.0% mIoU on the PASCAL VOC 2012 and 42.4% on MS COCO 2014 dataset. The code is available at https://github.com/ChunmengLiu1/MECPformer.
引用
收藏
页码:23249 / 23264
页数:16
相关论文
共 50 条
  • [41] JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
    Guo, Rongxin
    Sun, Xian
    Chen, Kaiqiang
    Zhou, Xiao
    Yan, Zhiyuan
    Diao, Wenhui
    Yan, Menglong
    REMOTE SENSING, 2020, 12 (19) : 1 - 18
  • [42] Multi-scale feature correspondence and pseudo label retraining strategy for weakly supervised semantic segmentation
    Wang, Weizheng
    Zhou, Lei
    Wang, Haonan
    IMAGE AND VISION COMPUTING, 2024, 150
  • [43] RSS-Net: Weakly-Supervised Multi-Class Semantic Segmentation with FMCW Radar
    Kaul, Prannay
    De Martini, Daniele
    Gadd, Matthew
    Newman, Paul
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 431 - 436
  • [44] A Weakly Supervised Semantic Segmentation Network by Aggregating Seed Cues: The Multi-Object Proposal Generation Perspective
    Xiao, Junsheng
    Xu, Huahu
    Gao, Honghao
    Bian, Minjie
    Li, Yang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [45] A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation
    Yang, Guoqing
    Zhu, Chuang
    Zhang, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 876 - 881
  • [46] Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point Clouds
    Wei, Jiacheng
    Lin, Guosheng
    Yap, Kim-Hui
    Hung, Tzu-Yi
    Xie, Lihua
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4383 - 4392
  • [47] Multi-class Token-Guided End-to-End Weakly Supervised Image Semantic Segmentation Method
    Cao, Yifan
    He, Lijun
    Ma, Ting
    Li, Fan
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 93 - 106
  • [48] Weakly supervised pavement crack semantic segmentation based on multi-scale object localization and incremental annotation refinement
    Al-Huda, Zaid
    Peng, Bo
    Algburi, Riyadh Nazar Ali
    Alfasly, Saghir
    Li, Tianrui
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14527 - 14546
  • [49] Weakly supervised pavement crack semantic segmentation based on multi-scale object localization and incremental annotation refinement
    Zaid Al-Huda
    Bo Peng
    Riyadh Nazar Ali Algburi
    Saghir Alfasly
    Tianrui Li
    Applied Intelligence, 2023, 53 : 14527 - 14546
  • [50] Beyond Pixels: Semi-supervised Semantic Segmentation with a Multi-scale Patch-Based Multi-label Classifier
    Howlader, Prantik
    Das, Srijan
    Le, Hieu
    Samaras, Dimitris
    COMPUTER VISION - ECCV 2024, PT LXXV, 2025, 15133 : 342 - 360