Group-Wise Learning for Weakly Supervised Semantic Segmentation

被引:73
|
作者
Zhou, Tianfei [1 ]
Li, Liulei [2 ]
Li, Xueyi [2 ]
Feng, Chun-Mei [3 ]
Li, Jianwu [2 ]
Shao, Ling [4 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, CH-8092 Zurich, Switzerland
[2] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100811, Peoples R China
[3] Harbin Inst Technol Shenzhen, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China
[4] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
基金
北京市自然科学基金;
关键词
Semantics; Image segmentation; Training; Location awareness; Cognition; Task analysis; Graph neural networks; Semantic segmentation; weakly supervised learning; group-wise learning; graph neural networks; object localization; neural attention; NEURAL-NETWORK;
D O I
10.1109/TIP.2021.3132834
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acquiring sufficient ground-truth supervision to train deep visual models has been a bottleneck over the years due to the data-hungry nature of deep learning. This is exacerbated in some structured prediction tasks, such as semantic segmentation, which require pixel-level annotations. This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation. To achieve this, we propose, for the first time, a novel group-wise learning framework for WSSS. The framework explicitly encodes semantic dependencies in a group of images to discover rich semantic context for estimating more reliable pseudo ground-truths, which are subsequently employed to train more effective segmentation models. In particular, we solve the group-wise learning within a graph neural network (GNN), wherein input images are represented as graph nodes, and the underlying relations between a pair of images are characterized by graph edges. We then formulate semantic mining as an iterative reasoning process which propagates the common semantics shared by a group of images to enrich node representations. Moreover, in order to prevent the model from paying excessive attention to common semantics, we further propose a graph dropout layer to encourage the graph model to capture more accurate and complete object responses. With the above efforts, our model lays the foundation for more sophisticated and flexible group-wise semantic mining. We conduct comprehensive experiments on the popular PASCAL VOC 2012 and COCO benchmarks, and our model yields state-of-the-art performance. In addition, our model shows promising performance in weakly supervised object localization (WSOL) on the CUB-200-2011 dataset, demonstrating strong generalizability. Our code is available at: https://github.com/Lixy1997/Group-WSSS.
引用
收藏
页码:799 / 811
页数:13
相关论文
共 50 条
  • [1] Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation
    Li, Xueyi
    Zhou, Tianfei
    Li, Jianwu
    Zhou, Yi
    Zhang, Zhaoxiang
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1984 - 1992
  • [2] Weakly Supervised Group-Wise Model Learning Based on Discrete Optimization
    Donner, Rene
    Wildenauer, Horst
    Bischof, Horst
    Langs, Georg
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2009, PT II, PROCEEDINGS, 2009, 5762 : 860 - +
  • [3] GROUP-WISE FEATURE SELECTION FOR SUPERVISED LEARNING
    Xiao, Qi
    Li, Hebi
    Tian, Jin
    Wang, Zhengdao
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3149 - 3153
  • [4] Weakly Supervised Semantic Segmentation by Multiple Group Cosegmentation
    Luo, Kunming
    Meng, Fanman
    Wu, Qingbo
    Li, Hongliang
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [5] Image Piece Learning for Weakly Supervised Semantic Segmentation
    Li, Yi
    Guo, Yanqing
    Kao, Yueying
    He, Ran
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (04): : 648 - 659
  • [6] A Weakly Supervised Deep Learning Semantic Segmentation Framework
    Zhang, Jizhi
    Zhang, Guoying
    Wang, Qiangyu
    Bai, Shuang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 182 - 185
  • [7] Weakly Supervised Structured Output Learning for Semantic Segmentation
    Vezhnevets, Alexander
    Ferrari, Vittorio
    Buhmann, Joachim M.
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 845 - 852
  • [8] Weakly Supervised Semantic Segmentation Based on Deep Learning
    Liang, Binxiu
    Liu, Yan
    He, Linxi
    Li, Jiangyun
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC2019), 2020, 582 : 455 - 464
  • [9] Weakly Supervised Learning of Dense Semantic Correspondences and Segmentation
    Ufer, Nikolai
    Lui, Kam To
    Schwarz, Katja
    Warkentin, Paul
    Ommer, Bjoern
    [J]. PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 456 - 470
  • [10] Spatial Group-Wise Enhance: Enhancing Semantic Feature Learning in CNN
    Li, Yuxuan
    Li, Xiang
    Yang, Jian
    [J]. COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 316 - 332