Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images

被引:16
|
作者
Khan, Salman H. [1 ]
Bennamoun, Mohammed [1 ]
Sohel, Ferdous [1 ]
Togneri, Roberto [2 ]
Naseem, Imran [3 ]
机构
[1] Univ Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
[2] Univ Western Australia, Sch EECE, 35 Stirling Highway, Crawley, WA 6009, Australia
[3] Karachi Inst Econ & Technol, Dept Engn, Karachi 75190, Pakistan
基金
澳大利亚研究理事会;
关键词
Scene parsing; Graphical models; Geometric reasoning; Structured learning; OBJECT RECOGNITION; FEATURES; SCALE; TEXTURE;
D O I
10.1007/s11263-015-0843-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inexpensive structured light sensors can capture rich information from indoor scenes, and scene labeling problems provide a compelling opportunity to make use of this information. In this paper we present a novel conditional random field (CRF) model to effectively utilize depth information for semantic labeling of indoor scenes. At the core of the model, we propose a novel and efficient plane detection algorithm which is robust to erroneous depthmaps. Our CRF formulation defines local, pairwise and higher order interactions between image pixels. At the local level, we propose a novel scheme to combine energies derived from appearance, depth and geometry-based cues. The proposed local energy also encodes the location of each object class by considering the approximate geometry of a scene. For the pairwise interactions, we learn a boundary measure which defines the spatial discontinuity of object classes across an image. To model higher-order interactions, the proposed energy treats smooth surfaces as cliques and encourages all the pixels on a surface to take the same label. We show that the proposed higher-order energies can be decomposed into pairwise submodular energies and efficient inference can be made using the graph-cuts algorithm. We follow a systematic approach which uses structured learning to fine-tune the model parameters. We rigorously test our approach on SUN3D and both versions of the NYU-Depth database. Experimental results show that our work achieves superior performance to state-of-the-art scene labeling techniques.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 44 条
  • [21] Semantic labeling of high-resolution aerial images using an ensemble of fully convolutional networks
    Sun, Xiaofeng
    Shen, Shuhan
    Lin, Xiangguo
    Hu, Zhanyi
    JOURNAL OF APPLIED REMOTE SENSING, 2017, 11
  • [22] Integrating semantic edges and segmentation information for building extraction from aerial images using UNet
    Abdollahi, Abolfazl
    Pradhan, Biswajeet
    MACHINE LEARNING WITH APPLICATIONS, 2021, 6
  • [23] EFASPP U-Net for semantic segmentation of night traffic scenes using fusion of visible and thermal images
    Shojaiee, Faegheh
    Baleghi, Yasser
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [24] Semantic Context and Visual Feature Effects in Object Naming: An fMRI Study using Arterial Spin Labeling
    Hocking, Julia
    McMahon, Katie L.
    de Zubicaray, Greig I.
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2009, 21 (08) : 1571 - 1583
  • [25] Joint Cranial Bone Labeling and Landmark Detection in Pediatric CT Images Using Context Encoding
    Liu, Jiawei
    Xing, Fuyong
    Shaikh, Abbas
    French, Brooke
    Linguraru, Marius George
    Porras, Antonio R.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (10) : 3117 - 3126
  • [26] Semantic and Context Based Image Retrieval Method Using a Single Image Sensor for Visual Indoor Positioning
    Jia, Shuang
    Ma, Lin
    Yang, Songxiang
    Qin, Danyang
    IEEE SENSORS JOURNAL, 2021, 21 (16) : 18020 - 18032
  • [27] Semantic Path Planning for Indoor Navigation Tasks Using Multi-View Context and Prior Knowledge
    Wu, Jianbing
    Huang, Weibo
    Hua, Guoliang
    Zhang, Wanruo
    Kang, Risheng
    Liu, Hong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 756 - 764
  • [28] Customer Context Analysis in Shopping Malls: A Method Combining Semantic Behavior and Indoor Positioning Using a Smartphone
    Tian, Ye
    Gu, Yanlei
    Lu, Qianwen
    Kamijo, Shunsuke
    SENSORS, 2025, 25 (03)
  • [29] Automatic Semantic Modeling of Indoor Scenes from Low-quality RGB-D Data using Contextual Information
    Chen, Kang
    Lai, Yu-Kun
    Wu, Yu-Xin
    Martin, Ralph
    Hu, Shi-Min
    ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06):
  • [30] RGB-D IBR: Rendering Indoor Scenes Using Sparse RGB-D Images with Local Alignments
    Jeong, Yeongyu
    Kim, Haejoon
    Seo, Hyewon
    Cordier, Frederic
    Lee, Seungyong
    PROCEEDINGS I3D 2016: 20TH ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES, 2016, : 205 - 206