Image Parsing with a Wide Range of Classes and Scene-Level Context

被引:0
|
作者
George, Marian [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a nonparametric scene parsing approach that improves the overall accuracy, as well as the coverage of foreground classes in scene images. We first improve the label likelihood estimates at superpixels by merging likelihood scores from different probabilistic classifiers. This boosts the classification performance and enriches the representation of less-represented classes. Our second contribution consists of incorporating semantic context in the parsing process through global label costs. Our method does not rely on image retrieval sets but rather assigns a global likelihood estimate to each label, which is plugged into the overall energy function. We evaluate our system on two large-scale datasets, SIFTflow and LMSun. We achieve state-of-the-art performance on the SIFTflow dataset and near-record results on LMSun.
引用
收藏
页码:3622 / 3630
页数:9
相关论文
共 50 条
  • [1] Context Driven Scene Parsing with Attention to Rare Classes
    Yang, Jimei
    Price, Brian
    Cohen, Scott
    Yang, Ming-Hsuan
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3294 - 3301
  • [2] Scene-Level Sketch-Based Image Retrieval with Minimal Pairwise Supervision
    Ge, Ce
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    Xu, Tong
    Liao, Jianxin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 650 - 657
  • [3] Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships
    Liu, Yong
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6985 - 6994
  • [4] MegaScenes: Scene-Level View Synthesis at Scale
    Tung, Joseph
    Chou, Gene
    Cai, Ruojin
    Yang, Guandao
    Zhang, Kai
    Wetzstein, Gordon
    Hariharan, Bharath
    Snavely, Noah
    COMPUTER VISION - ECCV 2024, PT XXIX, 2025, 15087 : 197 - 214
  • [5] Multi-Subject Image Retrieval by Fusing Object and Scene-Level Feature Embeddings
    Ban, Chung-Gi
    Hwang, Youngbae
    Park, Dayoung
    Lee, Ryong
    Jang, Rae-Young
    Choi, Myung-Seok
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [6] Adaptive Context Network for Scene Parsing
    Fu, Jun
    Liu, Jing
    Wang, Yuhang
    Li, Yong
    Bao, Yongjun
    Tang, Jinhui
    Lu, Hanqing
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6747 - 6756
  • [7] Scene Parsing with Global Context Embedding
    Hung, Wei-Chih
    Tsai, Yi-Hsuan
    Shen, Xiaohui
    Lin, Zhe
    Sunkavalli, Kalyan
    Lu, Xin
    Yang, Ming-Hsuan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2650 - 2658
  • [8] Scene-level Tracking and Reconstruction without Object Priors
    Chang, Haonan
    Boularias, Abdeslam
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 3785 - 3792
  • [9] An L1 Image Transform for Edge-Preserving Smoothing and Scene-Level Intrinsic Decomposition
    Bi, Sai
    Han, Xiaoguang
    Yu, Yizhou
    ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04):
  • [10] Variations in clubbers' substance use by individual and scene-level factors
    Anderson, Tammy L.
    Kavanaugh, Philip R.
    Rapp, Laura
    Daly, Kevin
    ADICCIONES, 2009, 21 (04) : 289 - 308