RGAM: A novel network architecture for 3D point cloud semantic segmentation in indoor scenes

被引:21
|
作者
Chen, Xue-Tao [1 ,2 ]
Li, Ying [1 ,2 ]
Fan, Jia-Hao [1 ,2 ]
Wang, Rui [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[3] Space Technol Jilin Ltd Co, Jilin 132013, Jilin, Peoples R China
关键词
3D Point cloud; Semantic segmentation; Deep neural network; Attention mechanism; CLASSIFICATION;
D O I
10.1016/j.ins.2021.04.069
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional (3D) point cloud semantic segmentation is an essential part of computer vision for scene comprehension. Nevertheless, due to their loss of detail, existing networks lack the ability to recognize complex scenes. This paper proposes a novel network architecture, called the ring grouping neural network with attention module (RGAM), which presents four improvements over the existing networks. First, novel multi-scale ring grouping learning is designed to extract the multi-scale neighborhood features without overlapped sampling, allowing the network to adapt to objects of different scales. Second, neighborhood information fusion is defined as the weighted sum of multiple neighborhood features, enabling the representation of each point to be considered in different neighborhoods. Third, in the global view, a spatial attention module is introduced among the neighborhoods, allowing long-range contextual information to be exploited for 3D point cloud semantic segmentation. Finally, a channel attention module is appended to the RGAM: the correlation of each channel with key information enhances the complex scene recognition ability of the RGAM. Experimental results on the challenging S3DIS, ScanNet, and NYU-V2 datasets demonstrate that the RGAM has stronger recognition ability than the existing networks based on several state-of-the-art algorithms for 3D point cloud semantic segmentation. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:87 / 103
页数:17
相关论文
共 50 条
  • [1] RGAM: A novel network architecture for 3D point cloud semantic segmentation in indoor scenes
    Chen, Xue-Tao
    Li, Ying
    Fan, Jia-Hao
    Wang, Rui
    Information Sciences, 2021, 571 : 87 - 103
  • [2] A review of point cloud segmentation for understanding 3D indoor scenes
    Yuliang Sun
    Xudong Zhang
    Yongwei Miao
    Visual Intelligence, 2 (1):
  • [3] SHREC 2020: 3D point cloud semantic segmentation for street scenes
    Ku, Tao
    Veltkamp, Remco C.
    Boom, Bas
    Duque-Arias, David
    Velasco-Forero, Santiago
    Deschaud, Jean-Emmanuel
    Goulette, Francois
    Marcotegui, Beatriz
    Ortega, Sebastian
    Trujillo, Agustin
    Pablo Suarez, Jose
    Miguel Santana, Jose
    Ramirez, Cristian
    Akadas, Kiran
    Gangisetty, Shankar
    COMPUTERS & GRAPHICS-UK, 2020, 93 : 13 - 24
  • [4] Semantic Segmentation Networks of 3D Point Clouds for RGB-D Indoor Scenes
    Wang, Ya
    Zell, Andreas
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [5] 3d indoor point cloud semantic segmentation using image and voxel
    Yeom S.-S.
    Ha J.-E.
    Ha, Jong-Eun (jeha@seoultech.ac.kr), 1600, Institute of Control, Robotics and Systems (27): : 1000 - 1007
  • [6] Local Transformer Network on 3D Point Cloud Semantic Segmentation
    Wang, Zijun
    Wang, Yun
    An, Lifeng
    Liu, Jian
    Liu, Haiyang
    INFORMATION, 2022, 13 (04)
  • [7] Novel Class Discovery for 3D Point Cloud Semantic Segmentation
    Riz, Luigi
    Saltori, Cristiano
    Ricci, Elisa
    Poiesi, Fabio
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9393 - 9402
  • [8] Semantic Segmentation of Indoor 3D Point Cloud Model Based on 2D-3D Semantic Transfer
    Xiong H.
    Zheng X.
    Ding Y.
    Zhang Y.
    Wu X.
    Zhou Y.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2018, 43 (12): : 2303 - 2309
  • [9] Understanding the Imperfection of 3D point Cloud and Semantic Segmentation algorithms for 3D Models of Indoor Environment
    Cai, Guoray
    Pan, Yimu
    25TH AGILE CONFERENCE ON GEOGRAPHIC INFORMATION SCIENCE ARTIFICIAL INTELLIGENCE IN THE SERVICE OF GEOSPATIAL TECHNOLOGIES, 2022, 3
  • [10] LONet: Local Optimization Network for 3D point cloud semantic segmentation
    Su, Shengbin
    Lu, Jian
    Chen, Xiaogai
    Zhang, Kaibing
    Zhou, Jian
    DIGITAL SIGNAL PROCESSING, 2024, 154