Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images

被引:365
|
作者
Gupta, Saurabh [1 ]
Arbelaez, Pablo [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CVPR.2013.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problems of contour detection, bottom-up grouping and semantic segmentation using RGB-D data. We focus on the challenging setting of cluttered indoor scenes, and evaluate our approach on the recently introduced NYU-Depth V2 (NYUD2) dataset [27]. We propose algorithms for object boundary detection and hierarchical segmentation that generalize the gPb - ucm approach of [2] by making effective use of depth information. We show that our system can label each contour with its type (depth, normal or albedo). We also propose a generic method for long-range amodal completion of surfaces and show its effectiveness in grouping. We then turn to the problem of semantic segmentation and propose a simple approach that classifies superpixels into the 40 dominant object categories in NYUD2. We use both generic and class-specific features to encode the appearance and geometry of objects. We also show how our approach can be used for scene classification, and how this contextual information in turn improves object recognition. In all of these tasks, we report significant improvements over the state-of-the-art.
引用
收藏
页码:564 / 571
页数:8
相关论文
共 50 条
  • [1] Object Segmentation of Indoor Scenes Using Perceptual Organization on RGB-D Images
    Wang, Chaonan
    Xue, Yanbing
    Zhang, Hua
    Xu, Guangping
    Gao, Zan
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [2] Semantic Labeling of Indoor Scenes from RGB-D Images with Discriminative Learning
    Liu, Bo
    Fan, Haoqi
    SIXTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2013), 2013, 9067
  • [3] RGB-D IBR: Rendering Indoor Scenes Using Sparse RGB-D Images with Local Alignments
    Jeong, Yeongyu
    Kim, Haejoon
    Seo, Hyewon
    Cordier, Frederic
    Lee, Seungyong
    PROCEEDINGS I3D 2016: 20TH ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES, 2016, : 205 - 206
  • [4] Dense 3D Semantic Mapping of Indoor Scenes from RGB-D Images
    Hermans, Alexander
    Floros, Georgios
    Leibe, Bastian
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2631 - 2638
  • [5] Online Reconstruction of Indoor Scenes from RGB-D Streams
    Wang, Hao
    Wang, Jun
    Wang, Liang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3271 - 3279
  • [6] Indoor Scene Recognition from RGB-D Images by Learning Scene Bases
    Wan, Shaohua
    Hu, Changbo
    Aggarwal, J. K.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3416 - 3421
  • [7] RGB-D DSO: Direct Sparse Odometry With RGB-D Cameras for Indoor Scenes
    Yuan, Zikang
    Cheng, Ken
    Tang, Jinhui
    Yang, Xin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4092 - 4101
  • [8] 21/2D Scene Reconstruction of Indoor Scenes from Single RGB-D Images
    Neverova, Natalia
    Muselet, Damien
    Tremeau, Alain
    COMPUTATIONAL COLOR IMAGING, CCIW 2013, 2013, 7786 : 281 - 295
  • [9] SparseGNV: Generating Novel Views of Indoor Scenes with Sparse RGB-D Images
    Cheng, Weihao
    Cao, Yan-Pei
    Shan, Ying
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1308 - 1316
  • [10] RGB-D Images for Object Segmentation, Localization and Recognition in Indoor Scenes using Feature Descriptor and Hough Voting
    Ahmed, Abrar
    Jalal, Ahmad
    Kim, Kibum
    PROCEEDINGS OF 2020 17TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2020, : 290 - 295