A Weakly Supervised Semantic Segmentation Method Based on Improved Conformer

被引:0
|
作者
Shen, Xueli [1 ]
Wang, Meng [1 ]
机构
[1] Liaoning Tech Univ, Sch Software, Huludao 125105, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2025年 / 82卷 / 03期
关键词
WSSS; CAM; transformer; CNN; multi-scale feature extraction; lightweight;
D O I
10.32604/cmc.2025.05914
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of Weakly Supervised Semantic Segmentation (WSSS), methods based on image-level annotation face challenges in accurately capturing objects of varying sizes, lacking sensitivity to image details, and having high computational costs. To address these issues, we improve the dual-branch architecture of the Conformer as the fundamental network for generating class activation graphs, proposing a multi-scale efficient weakly-supervised semantic segmentation method based on the improved Conformer. In the Convolution Neural Network (CNN) branch, a cross-scale feature integration convolution module is designed, incorporating multi-receptive field convolution layers to enhance the model's ability to capture long-range dependencies and improve sensitivity to multi-scale objects. In the Vision Transformer (ViT) branch, an efficient multi-head self-attention module is developed, reducing unnecessary computation through spatial compression and feature partitioning, thereby improving overall network efficiency. Finally, a multi-feature coupling module is introduced to complement the features generated by both branches. This design retains the strength of Convolution Neural Network in extracting local details while harnessing the strength of Vision Transformer to capture comprehensive global features. Experimental results show that the mean Intersection over Union of the image segmentation results of the proposed method on the validation and test sets of the PASCAL VOC 2012 datasets are improved by 2.9% and 3.6%, respectively, over the TransCAM algorithm. Besides, the improved model demonstrates a 1.3% increase of the mean Intersections over Union on the COCO 2014 datasets. Additionally, the number of parameters and the floating-point operations are reduced by 16.2% and 12.9%. However, the proposed method still has limitations of poor performance when dealing with complex scenarios. There is a need for further enhancing the performance of this method to address this issue.
引用
收藏
页码:4631 / 4647
页数:17
相关论文
共 50 条
  • [31] Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation
    Zhou, Tianfei
    Zhang, Meijie
    Zhao, Fang
    Li, Jianwu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4289 - 4299
  • [32] Weakly supervised semantic segmentation based on EM algorithm with localization clues
    Li, Yang
    Liu, Yang
    Liu, Guojun
    Zhai, Deming
    Guo, Maozu
    NEUROCOMPUTING, 2018, 275 : 2574 - 2587
  • [33] Weakly Supervised Semantic Segmentation Based on Superpixel Sampling Clustering Networks
    Xiao, Jun-sheng
    Xu, Hua-hu
    Ma, Xiao-jin
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSSE 2019), 2019,
  • [34] Weakly supervised point cloud semantic segmentation based on scene consistency
    Niu, Yingchun
    Yin, Jianqin
    Qi, Chao
    Geng, Liang
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12439 - 12452
  • [35] A variant of WSL Framework for Weakly Supervised Semantic Segmentation
    Ma, Ling-Yun
    2018 3RD INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE), 2018, : 520 - 523
  • [36] Weakly supervised semantic segmentation by knowledge graph inference
    Zhang, Jia
    Peng, Bo
    Wu, Xi
    Hu, Jie
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [37] AAR:Attention Remodulation for Weakly Supervised Semantic Segmentation
    Lin, Yu-e
    Li, Houguo
    Liang, Xingzhu
    Li, Mengfan
    Liu, Huilin
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (07): : 9096 - 9114
  • [38] Token Contrast for Weakly-Supervised Semantic Segmentation
    Ru, Lixiang
    Zheng, Hehang
    Zhan, Yibing
    Du, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3093 - 3102
  • [39] A Weakly Supervised Deep Learning Semantic Segmentation Framework
    Zhang, Jizhi
    Zhang, Guoying
    Wang, Qiangyu
    Bai, Shuang
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 182 - 185
  • [40] Image Piece Learning for Weakly Supervised Semantic Segmentation
    Li, Yi
    Guo, Yanqing
    Kao, Yueying
    He, Ran
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (04): : 648 - 659