Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation

被引:70
|
作者
Roy, Anirban [1 ]
Todorovic, Sinisa [1 ]
机构
[1] Oregon State Univ, Corvallis, OR 97330 USA
关键词
D O I
10.1109/CVPR.2017.770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of weakly supervised semantic image segmentation. Our goal is to label every pixel in a new image, given only image-level object labels associated with training images. Our problem statement differs from common semantic segmentation, where pixel-wise annotations are typically assumed available in training. We specify a novel deep architecture which fuses three distinct computation processes toward semantic segmentation namely, (i) the bottom-up computation of neural activations in a CNN for the image-level prediction of object classes; (ii) the top-down estimation of conditional likelihoods of the CNN's activations given the predicted objects, resulting in probabilistic attention maps per object class; and (iii) the lateral attention-message passing from neighboring neurons at the same CNN layer. The fusion of (i)-(iii) is realized via a conditional random field as recurrent network aimed at generating a smooth and boundary-preserving segmentation. Unlike existing work, we formulate a unified end-to-end learning of all components of our deep architecture. Evaluation on the benchmark PASCAL VOC 2012 dataset demonstrates that we outperform reasonable weakly supervised baselines and state-of-the-art approaches.
引用
下载
收藏
页码:7282 / 7291
页数:10
相关论文
共 50 条
  • [1] Unsupervised Tattoo Segmentation Combining Bottom-Up and Top-Down Cues
    Allen, Josef D.
    Zhao, Nan
    Yuan, Jiangbo
    Liu, Xiuwen
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2011, 2011, 8063
  • [2] Combining bottom-up and top-down
    Boehringer, Christoph
    Rutherford, Thomas F.
    ENERGY ECONOMICS, 2008, 30 (02) : 574 - 596
  • [3] OBJCUT: Efficient Segmentation Using Top-Down and Bottom-Up Cues
    Kumar, M. Pawan
    Torr, P. H. S.
    Zisserman, A.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (03) : 530 - 545
  • [4] Bottom-up Segmentation for Top-down Detection
    Fidler, Sanja
    Mottaghi, Roozbeh
    Yuille, Alan
    Urtasun, Raquel
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3294 - 3301
  • [5] Combined Top-Down/Bottom-Up Segmentation
    Borenstein, Eran
    Ullman, Shimon
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (12) : 2109 - 2125
  • [6] Top-down and bottom-up image processing
    Stark, LW
    Privitera, C
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 2294 - 2299
  • [7] On Combining Top-down and Bottom-up Strategies in Reading
    张荣
    读与写(教育教学刊), 2010, 7 (09) : 5 - 7
  • [8] Combining bottom-up and top-down attentional influences
    Navalpakkam, Vidhya
    Itti, Laurent
    HUMAN VISION AND ELECTRONIC IMAGING XI, 2006, 6057
  • [9] Learning to Combine Bottom-Up and Top-Down Segmentation
    Levin, Anat
    Weiss, Yair
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 81 (01) : 105 - 118
  • [10] Learning to combine bottom-up and top-down segmentation
    Levin, Anat
    Weiss, Yair
    COMPUTER VISION - ECCV 2006, PT 4, PROCEEDINGS, 2006, 3954 : 581 - 594