Voxel- and Bird's-Eye-View-Based Semantic Scene Completion for LiDAR Point Clouds

被引:0
|
作者
Liang, Li [1 ]
Akhtar, Naveed [2 ]
Vice, Jordan [1 ]
Mian, Ajmal [1 ]
机构
[1] Univ Western Australia, Dept Comp Sci & Software Engn, 35 Stirling Hwy, Crawley, WA 6009, Australia
[2] Univ Melbourne, Sch Comp & Informat Syst, Parkville, Vic 3052, Australia
基金
澳大利亚研究理事会;
关键词
LiDAR; 3D point cloud; 3D semantic scene completion; convolution; FUSION NETWORK; SEGMENTATION; SHAPE;
D O I
10.3390/rs16132266
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic scene completion is a crucial outdoor scene understanding task that has direct implications for technologies like autonomous driving and robotics. It compensates for unavoidable occlusions and partial measurements in LiDAR scans, which may otherwise cause catastrophic failures. Due to the inherent complexity of this task, existing methods generally rely on complex and computationally demanding scene completion models, which limits their practicality in downstream applications. Addressing this, we propose a novel integrated network that combines the strengths of 3D and 2D semantic scene completion techniques for efficient LiDAR point cloud scene completion. Our network leverages a newly devised lightweight multi-scale convolutional block (MSB) to efficiently aggregate multi-scale features, thereby improving the identification of small and distant objects. It further utilizes a layout-aware semantic block (LSB), developed to grasp the overall layout of the scene to precisely guide the reconstruction and recognition of features. Moreover, we also develop a feature fusion module (FFM) for effective interaction between the data derived from two disparate streams in our network, ensuring a robust and cohesive scene completion process. Extensive experiments with the popular SemanticKITTI dataset demonstrate that our method achieves highly competitive performance, with an mIoU of 35.7 and an IoU of 51.4. Notably, the proposed method achieves an mIoU improvement of 2.6 % compared to previous methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Scene Representation in Bird's-Eye View from Surrounding Cameras with Transformers
    Zhao, Yun
    Zhang, Yu
    Gong, Zhan
    Zhu, Hong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4510 - 4518
  • [32] Semantic Scene Completion with Point Cloud Representation and Transformer-based feature fusion
    Fu, Ruochong
    Wu, Hang
    Hao, Mengxiang
    Miao, Yubin
    [J]. Proceedings - International Conference on Image Processing, ICIP, 2023, : 3369 - 3373
  • [33] SEMANTIC SCENE COMPLETION WITH POINT CLOUD REPRESENTATION AND TRANSFORMER-BASED FEATURE FUSION
    Fu, Ruochong
    Wu, Hang
    Hao, Mengxiang
    Miao, Yubin
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3369 - 3373
  • [34] Cycle and Semantic Consistent Adversarial Domain Adaptation for Reducing Simulation-to-Real Domain Shift in LiDAR Bird's Eye View
    Barrera, Alejandro
    Beltran, Jorge
    Guindel, Carlos
    Iglesias, Jose Antonio
    Garcia, Fernando
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3081 - 3086
  • [35] High-precision Real-time Object Detection Based on Bird's Eye View from 3D Point Clouds
    Zhang Y.
    Xiang Z.
    Qiao C.
    Chen S.
    [J]. Jiqiren/Robot, 2020, 42 (02): : 148 - 156
  • [36] BEVDetNet: Bird's Eye View LiDAR Point Cloud based Real-time 3D Object Detection for Autonomous Driving
    Mohapatra, Sambit
    Yogamani, Senthil
    Gotzig, Heinrich
    Milz, Stefan
    Maeder, Patrick
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2809 - 2815
  • [37] Voxel-Based Neighborhood for Spatial Shape Pattern Classification of Lidar Point Clouds with Supervised Learning
    Plaza-Leiva, Victoria
    Antonio Gomez-Ruiz, Jose
    Mandow, Anthony
    Garcia-Cerezo, Alfonso
    [J]. SENSORS, 2017, 17 (03)
  • [38] BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
    Peng, Lang
    Chen, Zhirong
    Fu, Zhangjie
    Liang, Pengpeng
    Cheng, Erkang
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5924 - 5932
  • [39] A Voxel-Based 3D Building Detection Algorithm for Airborne LIDAR Point Clouds
    Liying Wang
    Yan Xu
    Yu Li
    [J]. Journal of the Indian Society of Remote Sensing, 2019, 47 : 349 - 358
  • [40] Automatic parking based on bird's eye view cameras
    [J]. Wang, C.-X. (wangcx@sjtu.edu.cn), 2013, Shanghai Jiaotong University (47):