Learning Visual Words for Weakly-Supervised Semantic Segmentation

被引:0
|
作者
Ru, Lixiang [1 ,2 ]
Du, Bo [1 ,2 ]
Wu, Chen [3 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Inst Artificial Intelligence, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] Wuhan Univ, LIESMARS, Wuhan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current weakly-supervised semantic segmentation (WSSS) methods with image-level labels mainly adopt class activation maps (CAM) to generate the initial pseudo labels. However, CAM usually only identifies the most discriminative object extents, which is attributed to the fact that the network doesn't need to discover the integral object to recognize image-level labels. In this work, to tackle this problem, we proposed to simultaneously learn the image-level labels and local visual word labels. Specifically, in each forward propagation, the feature maps of the input image will be encoded to visual words with a learnable codebook. By enforcing the network to classify the encoded fine-grained visual words, the generated CAM could cover more semantic regions. Besides, we also proposed a hybrid spatial pyramid pooling module that could preserve local maximum and global average values of feature maps, so that more object details and less background were considered. Based on the proposed methods, we conducted experiments on the PASCAL VOC 2012 dataset. Our proposed method achieved 67.2% mIoU on the val set and 67.3% mIoU on the test set, which outperformed recent state-of-the-art methods.
引用
收藏
页码:982 / 988
页数:7
相关论文
共 50 条
  • [41] Weakly-supervised Semantic Segmentation in Cityscape via Hyperspectral Image
    Huang, Yuxing
    Shen, Qiu
    Fu, Ying
    You, Shaodi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1117 - 1126
  • [42] Global Consistency Enhancement Network for Weakly-Supervised Semantic Segmentation
    Jiang, Le
    Yang, Xinhao
    Ma, Liyan
    Li, Zhenglin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 53 - 65
  • [43] IMAGE AUGMENTATION WITH CONTROLLED DIFFUSION FOR WEAKLY-SUPERVISED SEMANTIC SEGMENTATION
    Wu, Wangyu
    Dai, Tianhong
    Huang, Xiaowei
    Ma, Fei
    Xiao, Jimin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6175 - 6179
  • [44] Pseudo-mask Matters in Weakly-supervised Semantic Segmentation
    Li, Yi
    Kuang, Zhanghui
    Liu, Liyang
    Chen, Yimin
    Zhang, Wayne
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6944 - 6953
  • [45] Weakly-Supervised Semantic Segmentation via Self-training
    Cheng, Hao
    Gu, Chaochen
    Wu, Kaijie
    2020 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND ARTIFICIAL INTELLIGENCE (CCEAI 2020), 2020, 1487
  • [46] Deep graph cut network for weakly-supervised semantic segmentation
    Feng, Jiapei
    Wang, Xinggang
    Liu, Wenyu
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (03)
  • [47] Deep graph cut network for weakly-supervised semantic segmentation
    Jiapei FENG
    Xinggang WANG
    Wenyu LIU
    ScienceChina(InformationSciences), 2021, 64 (03) : 57 - 68
  • [48] STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation
    Wei, Yunchao
    Liang, Xiaodan
    Chen, Yunpeng
    Shen, Xiaohui
    Cheng, Ming-Ming
    Feng, Jiashi
    Zhao, Yao
    Yan, Shuicheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2314 - 2320
  • [49] Deep graph cut network for weakly-supervised semantic segmentation
    Jiapei Feng
    Xinggang Wang
    Wenyu Liu
    Science China Information Sciences, 2021, 64
  • [50] Boosted MIML method for weakly-supervised image semantic segmentation
    Yang Liu
    Zechao Li
    Jing Liu
    Hanqing Lu
    Multimedia Tools and Applications, 2015, 74 : 543 - 559