Learning Visual Words for Weakly-Supervised Semantic Segmentation

被引:0
|
作者
Ru, Lixiang [1 ,2 ]
Du, Bo [1 ,2 ]
Wu, Chen [3 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Inst Artificial Intelligence, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] Wuhan Univ, LIESMARS, Wuhan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current weakly-supervised semantic segmentation (WSSS) methods with image-level labels mainly adopt class activation maps (CAM) to generate the initial pseudo labels. However, CAM usually only identifies the most discriminative object extents, which is attributed to the fact that the network doesn't need to discover the integral object to recognize image-level labels. In this work, to tackle this problem, we proposed to simultaneously learn the image-level labels and local visual word labels. Specifically, in each forward propagation, the feature maps of the input image will be encoded to visual words with a learnable codebook. By enforcing the network to classify the encoded fine-grained visual words, the generated CAM could cover more semantic regions. Besides, we also proposed a hybrid spatial pyramid pooling module that could preserve local maximum and global average values of feature maps, so that more object details and less background were considered. Based on the proposed methods, we conducted experiments on the PASCAL VOC 2012 dataset. Our proposed method achieved 67.2% mIoU on the val set and 67.3% mIoU on the test set, which outperformed recent state-of-the-art methods.
引用
收藏
页码:982 / 988
页数:7
相关论文
共 50 条
  • [31] CyCSNet: Learning Cycle-Consistency of Semantics for Weakly-Supervised Semantic Segmentation
    Duan, Zhikui
    Yu, Xinmei
    Ding, Yi
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1328 - 1337
  • [32] Transformer Based Prototype Learning for Weakly-Supervised Histopathology Tissue Semantic Segmentation
    She, Jinwen
    Hu, Yanxu
    Ma, Andy J.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 203 - 215
  • [33] Weakly-supervised Object Representation Learning for Few-shot Semantic Segmentation
    Ying, Xiaowen
    Li, Xin
    Chuah, Mooi Choo
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1496 - 1505
  • [34] CSENet: Cascade semantic erasing network for weakly-supervised semantic segmentation
    Liu, Jiahui
    Yu, Changqian
    Yang, Beibei
    Gao, Changxin
    Sang, Nong
    NEUROCOMPUTING, 2021, 453 : 885 - 895
  • [35] Boosted MIML method for weakly-supervised image semantic segmentation
    Liu, Yang
    Li, Zechao
    Liu, Jing
    Lu, Hanqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 543 - 559
  • [36] Semantic-Transferable Weakly-Supervised Endoscopic Lesions Segmentation
    Dong, Jiahua
    Cong, Yang
    Sun, Gan
    Hou, Dongdong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10711 - 10720
  • [37] Saliency Background Guided Network for Weakly-Supervised Semantic Segmentation
    Bai X.
    Li W.
    Wang W.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (09): : 824 - 835
  • [38] Weakly-supervised semantic segmentation with saliency and incremental supervision updating
    Luo, Wenfeng
    Yang, Meng
    Zheng, Weishi
    PATTERN RECOGNITION, 2021, 115
  • [39] Class agnostic and specific consistency learning for weakly-supervised point cloud semantic segmentation
    Wu, Junwei
    Sun, Mingjie
    Xu, Haotian
    Jiang, Chenru
    Ma, Wuwei
    Zhang, Quan
    PATTERN RECOGNITION, 2025, 158
  • [40] WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations
    Liu, Peidong
    He, Zibin
    Yan, Xiyu
    Jiang, Yong
    Xia, Shu-Tao
    Zheng, Feng
    Hu, Maowei
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2995 - 3004