Automated segmentation of RGB-D images into a comprehensive set of building components using deep learning

被引:36
|
作者
Czerniawski, Thomas [1 ]
Leite, Fernanda [1 ]
机构
[1] Univ Texas Austin, Dept Civil Architectural & Environm Engn, 301 E Dean Keeton St Stop C1752, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
Building information modeling; Semantic segmentation; Deep learning; Class balancing; RGB-D; 3DFacilities; POINT CLOUDS; RECONSTRUCTION; MODELS; BIM; INTERIORS; ELEMENTS; OBJECT;
D O I
10.1016/j.aei.2020.101131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building information modeling (BIM) has a semantic scope that encompasses all building systems, e.g. architectural, structural, mechanical, electrical, and plumbing. Automated, comprehensive digital modeling of buildings will require methods for semantic segmentation of images and 3D reconstructions capable of recognizing all building component classes. However, prior building component recognition methods have had limited semantic coverage and are not easily combined or scaled. Here we show that a deep neural network can semantically segment RGB-D (i.e. color and depth) images into 13 building component classes simultaneously despite the use of a small training dataset with only 1490 object instances. For this task, the method achieves an average intersection over union (IoU) of 0.5. The dataset was designed using a common building taxonomy to ensure comprehensive semantic coverage and was collected from a diversity of buildings to ensure infra-class diversity. As a consequence of its semantic scope, it was necessary to perform pre-segmentation and 3D to 2D projection as leverage for dataset annotation. In creating our deep learning pipeline, we found that transfer learning, class balancing, and prevention of overfitting effectively overcame the dataset's borderline adequate class representation. Our results demonstrate how the semantic coverage of a building component recognition method can be scaled to include a larger diversity of building systems. We anticipate our method to be a starting point for broadening the scope of the semantic segmentation methods involved in digital modeling of buildings.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A brief survey on RGB-D semantic segmentation using deep learning*
    Wang, Changshuo
    Wang, Chen
    Li, Weijun
    Wang, Haining
    [J]. DISPLAYS, 2021, 70
  • [2] Unsupervised Segmentation of RGB-D Images
    Deng, Zhuo
    Latecki, Longin Jan
    [J]. COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 423 - 435
  • [3] RGB-D Semantic Segmentation for Indoor Modeling Using Deep Learning: A Review
    Rached, Ishraq
    Hajji, Rafika
    Landes, Tania
    [J]. RECENT ADVANCES IN 3D GEOINFORMATION SCIENCE, 3D GEOINFO 2023, 2024, : 587 - 604
  • [4] Deep Convolutional Neural Networks for Automated Convulsion Scoring using RGB-D Images
    Wang, Zheyuan
    Ray, Azizi
    Murnane, Kevin S.
    Ghovanloo, Maysam
    [J]. 2018 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS): ADVANCED SYSTEMS FOR ENHANCING HUMAN HEALTH, 2018, : 121 - 124
  • [5] 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data
    Yasir, Siddiqui Muhammad
    Sadiq, Amin Muhammad
    Ahn, Hyunsik
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5777 - 5791
  • [6] Segmentation of Shipping Bags in RGB-D Images
    Vasileva, Elena
    Ivanovski, Zoran
    [J]. 2022 IEEE 5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING APPLICATIONS AND SYSTEMS, IPAS, 2022,
  • [7] Efficient Image Segmentation of RGB-D Images
    Fouad, Islam I.
    Rady, Sherine
    Mostafa, G. M. Mostafa
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2017, : 353 - 358
  • [8] Traffic Scene Segmentation Based on RGB-D Image and Deep Learning
    Li, Linhui
    Qian, Bo
    Lian, Jing
    Zheng, Weina
    Zhou, Yafu
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (05) : 1664 - 1669
  • [9] Region Merging Driven by Deep Learning for RGB-D Segmentation and Labeling
    Michieli, Umberto
    Camporese, Maria
    Agiollo, Andrea
    Pagnutti, Giampaolo
    Zanuttigh, Pietro
    [J]. ICDSC 2019: 13TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS, 2019,
  • [10] Parallel RCNN: A Deep Learning Method for People Detection Using RGB-D Images
    Ren, Xiaodong
    Du, Sanping
    Zheng, Yi
    [J]. 2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,