3D Semantic Scene Segmentation with Multi-View RGB-D Images in Indoor Environments

被引:0
|
作者
Bae H.-L. [1 ]
Kim I. [1 ]
机构
[1] Department of Computer Science, Kyonggi University
关键词
3D Semantic Scene Segmentation; Indoor Environment; Multi-View RGB-D Images; Point Cloud;
D O I
10.5302/J.ICROS.2023.22.0234
中图分类号
学科分类号
摘要
This paper proposes a novel model for 3D semantic scene segmentation in indoor environments. Existing models for 3D semantic scene segmentation use either only 3D geometric features of the scene point cloud or only 2D visual features of RGB color images. We overcome the limitations of existing models and improve the performance of 3D semantic scene segmentation by proposing a multimodal 3D semantic scene segmentation model to use both 3D geometric features of the scene point cloud and rich 2D visual features of multi-view color images. The proposed model overcomes the point sparsity problem by using the dense point cloud obtained from multi-view depth images and uses an adaptive point feature extractor to extract 3D geometric features representing the local structural characteristics of points. Moreover, the model adopts a unique early fusion strategy to fuse the 2D-3D features. Based on experiments conducted using the ScanNet benchmark dataset, we demonstrate the effectiveness and superiority of the proposed model. © ICROS 2023.
引用
收藏
页码:235 / 244
页数:9
相关论文
共 50 条
  • [1] RGB-D Multi-View System Calibration for Full 3D Scene Reconstruction
    Afzal, Hassan
    Aouada, Djamila
    Fofi, David
    Mirbach, Bruno
    Ottersten, Bjoern
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2459 - 2464
  • [2] Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
    Seichter, Daniel
    Koehler, Mona
    Lewandowski, Benjamin
    Wengefeld, Tim
    Gross, Horst-Michael
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13525 - 13531
  • [3] Indoor Scene Understanding by Fusing Multi-View RGB-D Image Frames
    Li X.
    Zhang B.
    Sun F.
    Liu J.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (06): : 1218 - 1226
  • [4] Accurate semantic segmentation of RGB-D images for indoor navigation
    Sharan, Sudeep
    Nauth, Peter
    Dominguez-Jimenez, Juan-Jose
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [5] Temporally Consistent Semantic Segmentation using Spatially Aware Multi-view Semantic Fusion for Indoor RGB-D videos
    Sun, Fengyuan
    Karaoglu, Sezer
    Gevers, Theo
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4250 - 4259
  • [6] 3D Background Modeling in Multi-view RGB-D Video
    Huang, Yung-Lin
    Wei, Ku-Chu
    Chien, Shao-Yi
    [J]. MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1051 - 1054
  • [7] Semantic Segmentation of Indoor-Scene RGB-D Images Based on Iterative Contraction and Merging
    Syu, Jia-Hao
    Cho, Shih-Hsuan
    Wang, Sheng-Jyh
    Wang, Li-Chun
    [J]. IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 252 - 261
  • [8] Semantic Segmentation Networks of 3D Point Clouds for RGB-D Indoor Scenes
    Wang, Ya
    Zell, Andreas
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [9] Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
    Saurabh Gupta
    Pablo Arbeláez
    Ross Girshick
    Jitendra Malik
    [J]. International Journal of Computer Vision, 2015, 112 : 133 - 149
  • [10] Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
    Gupta, Saurabh
    Arbelaez, Pablo
    Girshick, Ross
    Malik, Jitendra
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 112 (02) : 133 - 149