Learning graph structures with transformer for weakly supervised semantic segmentation

被引:0
|
作者
Sun, Wanchun [1 ]
Feng, Xin [1 ,2 ]
Ma, Hui [3 ]
Liu, Jingyao [1 ,4 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China
[2] Changchun Univ Sci & Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[3] Anhui Vocat Coll Police Officers, Comp Basic Teaching & Res Dept, Hefei 232001, Peoples R China
[4] Chuzhou Univ, Sch Comp & Informat Engn, Chuzhou 239000, Peoples R China
关键词
Weakly supervised; Transformer; Graph convolutional network; Semantic segmentation;
D O I
10.1007/s40747-023-01152-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised semantic segmentation (WSSS) is a challenging task of computer vision. The state-of-the-art semantic segmentation methods are usually based on the convolutional neural network (CNN), which mainly have the drawbacks of inability to explore the global information correctly and failure to activate potential object regions. To avoid such drawbacks, the transformer approach is explored in the WSSS task, but no effective semantic association between different patch tokens can be determined in the transformer. To address this issue, inspired by the graph convolutional network (GCN), this paper proposes a graph structure to learn the semantic category relationships between different blocks in the vector sequence. To verify the effectiveness of the proposed method in this paper, a large number of experiments were conducted on the publicly available PASCAL VOC2012 dataset. The experimental results show that our proposed method achieves significant performance improvement in the WSSS task and outperforms other state-of-the-art transformer-based methods.
引用
收藏
页码:7511 / 7521
页数:11
相关论文
共 50 条
  • [1] Learning graph structures with transformer for weakly supervised semantic segmentation
    Wanchun Sun
    Xin Feng
    Hui Ma
    Jingyao Liu
    Complex & Intelligent Systems, 2023, 9 : 7511 - 7521
  • [2] Weakly supervised semantic segmentation by knowledge graph inference
    Zhang, Jia
    Peng, Bo
    Wu, Xi
    Hu, Jie
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [3] Transformer Based Prototype Learning for Weakly-Supervised Histopathology Tissue Semantic Segmentation
    She, Jinwen
    Hu, Yanxu
    Ma, Andy J.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 203 - 215
  • [4] Weakly supervised graph based semantic segmentation by learning communities of image-parts
    Pourian, Niloufar
    Karthikeyan, S.
    Manjunath, B. S.
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1359 - 1367
  • [5] Image Piece Learning for Weakly Supervised Semantic Segmentation
    Li, Yi
    Guo, Yanqing
    Kao, Yueying
    He, Ran
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (04): : 648 - 659
  • [6] A Weakly Supervised Deep Learning Semantic Segmentation Framework
    Zhang, Jizhi
    Zhang, Guoying
    Wang, Qiangyu
    Bai, Shuang
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 182 - 185
  • [7] Weakly Supervised Structured Output Learning for Semantic Segmentation
    Vezhnevets, Alexander
    Ferrari, Vittorio
    Buhmann, Joachim M.
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 845 - 852
  • [8] Weakly Supervised Semantic Segmentation Based on Deep Learning
    Liang, Binxiu
    Liu, Yan
    He, Linxi
    Li, Jiangyun
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC2019), 2020, 582 : 455 - 464
  • [9] Weakly Supervised Learning of Dense Semantic Correspondences and Segmentation
    Ufer, Nikolai
    Lui, Kam To
    Schwarz, Katja
    Warkentin, Paul
    Ommer, Bjoern
    PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 456 - 470
  • [10] Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Ouyang, Wanli
    Bennamoun, Mohammed
    Boussaid, Farid
    Xu, Dan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4300 - 4309