Image Parsing with a Wide Range of Classes and Scene-Level Context

被引:0
|
作者
George, Marian [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a nonparametric scene parsing approach that improves the overall accuracy, as well as the coverage of foreground classes in scene images. We first improve the label likelihood estimates at superpixels by merging likelihood scores from different probabilistic classifiers. This boosts the classification performance and enriches the representation of less-represented classes. Our second contribution consists of incorporating semantic context in the parsing process through global label costs. Our method does not rely on image retrieval sets but rather assigns a global likelihood estimate to each label, which is plugged into the overall energy function. We evaluate our system on two large-scale datasets, SIFTflow and LMSun. We achieve state-of-the-art performance on the SIFTflow dataset and near-record results on LMSun.
引用
收藏
页码:3622 / 3630
页数:9
相关论文
共 50 条
  • [21] Language-Guided Traffic Simulation via Scene-Level Diffusion
    Zhong, Ziyuan
    Rempe, Davis
    Chen, Yuxiao
    Ivanovic, Boris
    Cao, Yulong
    Xu, Danfei
    Pavone, Marco
    Ray, Baishakhi
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [22] Sequential Scene Parsing using Range and Intensity Information
    Brucker, Manuel
    Leonard, Simon
    Bodenmueller, Tim
    Hager, Gregory D.
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 5417 - 5424
  • [23] Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction
    Chang, Haonan
    Ramesh, Dhruv Metha
    Geng, Shijie
    Gan, Yuqiu
    Boularias, Abdeslam
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 820 - 826
  • [24] Scene-level Point Cloud Colorization with Semantics-and-geometry-aware Networks
    Gao, Rongrong
    Xiang, Tian-Zhu
    Lei, Chenyang
    Park, Jaesik
    Chen, Qifeng
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2818 - 2824
  • [25] Deep Structured Scene Parsing by Learning with Image Descriptions
    Lin, Liang
    Wang, Guangrun
    Zhang, Rui
    Zhang, Ruimao
    Liang, Xiaodan
    Zuo, Wangmeng
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2276 - 2284
  • [26] Adaptive Template for Parsing Object of Indoor Scene Image
    Xia, Changqun
    Xu, Jie
    Li, Qing
    Zhang, Yu
    Li, Jia
    Chen, Xiaowu
    2015 5TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2015), 2015, : 16 - 23
  • [27] Exploiting Large Image Sets for Road Scene Parsing
    Alvarez, Jose M.
    Salzmann, Mathieu
    Barnes, Nick
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (09) : 2456 - 2465
  • [28] Partially Does It: Towards Scene-Level FG-SBIR with Partial Input
    Chowdhury, Pinaki Nath
    Bhunia, Ayan Kumar
    Gajjala, Viswanatha Reddy
    Sain, Aneeshan
    Xiang, Tao
    Song, Yi-Zhe
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2385 - 2395
  • [29] Partially Does It: Towards Scene-Level FG-SBIR with Partial Input
    Chowdhury, Pinaki Nath
    Bhunia, Ayan Kumar
    Gajjala, Viswanatha Reddy
    Sain, Aneeshan
    Xiang, Tao
    Song, Yi-Zhe
    arXiv, 2022,
  • [30] Scene-level buildings damage recognition based on Cross Conv-Transformer
    Shi, Lingfei
    Zhang, Feng
    Xia, Junshi
    Xie, Jibo
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (02) : 3987 - 4007