Token Contrast for Weakly-Supervised Semantic Segmentation

被引:29
|
作者
Ru, Lixiang [1 ,2 ,3 ]
Zheng, Hehang [3 ]
Zhan, Yibing [3 ]
Du, Bo [1 ,2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Inst Artificial Intelligence, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] JD Explore Acad, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.00302
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly-Supervised Semantic Segmentation (WSSS) using image-level labels typically utilizes Class Activation Map (CAM) to generate the pseudo labels. Limited by the local structure perception of CNN, CAM usually cannot identify the integral object regions. Though the recent Vision Transformer (ViT) can remedy this flaw, we observe it also brings the over-smoothing issue, i.e., the final patch tokens incline to be uniform. In this work, we propose Token Contrast (ToCo) to address this issue and further explore the virtue of ViT for WSSS. Firstly, motivated by the observation that intermediate layers in ViT can still retain semantic diversity, we designed a Patch Token Contrast module (PTC). PTC supervises the final patch tokens with the pseudo token relations derived from intermediate layers, allowing them to align the semantic regions and thus yield more accurate CAM. Secondly, to further differentiate the low-confidence regions in CAM, we devised a Class Token Contrast module (CTC) inspired by the fact that class tokens in ViT can capture high-level semantics. CTC facilitates the representation consistency between uncertain local regions and global objects by contrasting their class tokens. Experiments on the PASCAL VOC and MS COCO datasets show the proposed ToCo can remarkably surpass other single-stage competitors and achieve comparable performance with state-of-the-art multi-stage methods. Code is available at https://github.com/rulixiang/ToCo.
引用
收藏
页码:3093 / 3102
页数:10
相关论文
共 50 条
  • [21] Saliency Background Guided Network for Weakly-Supervised Semantic Segmentation
    Bai, Xuefei
    Li, Wenjing
    Wang, Wenjian
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (09): : 824 - 835
  • [22] Boosted MIML method for weakly-supervised image semantic segmentation
    Liu, Yang
    Li, Zechao
    Liu, Jing
    Lu, Hanqing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 543 - 559
  • [23] Semantic-Transferable Weakly-Supervised Endoscopic Lesions Segmentation
    Dong, Jiahua
    Cong, Yang
    Sun, Gan
    Hou, Dongdong
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10711 - 10720
  • [24] Weakly-supervised Semantic Segmentation in Cityscape via Hyperspectral Image
    Huang, Yuxing
    Shen, Qiu
    Fu, Ying
    You, Shaodi
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1117 - 1126
  • [25] Weakly-supervised Incremental learning for Semantic segmentation with Class Hierarchy
    Kim, Hyoseo
    Choe, Junsuk
    [J]. PATTERN RECOGNITION LETTERS, 2024, 182 : 31 - 38
  • [26] Global Consistency Enhancement Network for Weakly-Supervised Semantic Segmentation
    Jiang, Le
    Yang, Xinhao
    Ma, Liyan
    Li, Zhenglin
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 53 - 65
  • [27] Weakly-Supervised Semantic Segmentation via Self-training
    Cheng, Hao
    Gu, Chaochen
    Wu, Kaijie
    [J]. 2020 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND ARTIFICIAL INTELLIGENCE (CCEAI 2020), 2020, 1487
  • [28] Deep graph cut network for weakly-supervised semantic segmentation
    Feng, Jiapei
    Wang, Xinggang
    Liu, Wenyu
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (03)
  • [29] Pseudo-mask Matters in Weakly-supervised Semantic Segmentation
    Li, Yi
    Kuang, Zhanghui
    Liu, Liyang
    Chen, Yimin
    Zhang, Wayne
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6944 - 6953
  • [30] STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation
    Wei, Yunchao
    Liang, Xiaodan
    Chen, Yunpeng
    Shen, Xiaohui
    Cheng, Ming-Ming
    Feng, Jiashi
    Zhao, Yao
    Yan, Shuicheng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2314 - 2320