Improving performance of deep learning models for 3D point cloud semantic segmentation via attention mechanisms

被引:20
|
作者
Vanian V. [1 ]
Zamanakos G. [1 ]
Pratikakis I. [1 ]
机构
[1] Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi
来源
关键词
3D semantic segmentation; Attention mechanisms; Autonomous driving; Deep learning;
D O I
10.1016/j.cag.2022.06.010
中图分类号
学科分类号
摘要
3D Semantic segmentation is a key element for a variety of applications in robotics and autonomous vehicles. For such applications, 3D data are usually acquired by LiDAR sensors resulting in a point cloud, which is a set of points characterized by its unstructured form and inherent sparsity. For the task of 3D semantic segmentation where the corresponding point clouds should be labeled with semantics, the current tendency is the use of deep learning neural network architectures for effective representation learning. On the other hand, various 2D and 3D computer vision tasks have used attention mechanisms which result in an effective re-weighting of the already learned features. In this work, we aim to investigate the role of attention mechanisms for the task of 3D semantic segmentation for autonomous driving, by identifying the significance of different attention mechanisms when adopted in existing deep learning networks. Our study is further supported by an extensive experimentation on two standard datasets for autonomous driving, namely Street3D and SemanticKITTI, that permit to draw conclusions at both a quantitative and qualitative level. Our experimental findings show that there is a clear advantage when attention mechanisms have been adopted, resulting in a superior performance. In particular, we show that the adoption of a Point Transformer in a SPVCNN network, results in an architecture which outperforms the state of the art on the Street3D dataset. © 2022 Elsevier Ltd
引用
收藏
页码:277 / 287
页数:10
相关论文
共 50 条
  • [1] DEEP LEARNING FOR SEMANTIC SEGMENTATION OF 3D POINT CLOUD
    Malinverni, E. S.
    Pierdicca, R.
    Paolanti, M.
    Martini, M.
    Morbidoni, C.
    Matrone, F.
    Lingua, A.
    27TH CIPA INTERNATIONAL SYMPOSIUM: DOCUMENTING THE PAST FOR A BETTER FUTURE, 2019, 42-2 (W15): : 735 - 742
  • [2] Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection
    Unal, Ozan
    Van Gool, Luc
    Dai, Dengxin
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2949 - 2958
  • [3] Bottleneck Identification to Semantic Segmentation of Industrial 3D Point Cloud Scene via Deep Learning
    Cazorla, Romain
    Poinel, Line
    Papadakis, Panagiotis
    Buche, Cedric
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4877 - 4878
  • [4] Research of Deep Learning-Based Semantic Segmentation for 3D Point Cloud
    Wang, Tao
    Wang, Wenju
    Cai, Yu
    Computer Engineering and Applications, 2024, 57 (23) : 18 - 26
  • [5] A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation
    Sarker, Sushmita
    Sarker, Prithul
    Stone, Gunner
    Gorman, Ryan
    Tavakkoli, Alireza
    Bebis, George
    Sattarvand, Javad
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [6] A 3D Semantic Segmentation Method for Large-Scale Point Cloud on Deep Learning
    Liu, Sihan
    Zhang, Wenyu
    Zhang, Yujun
    Wang, Zhijian
    Gao, Dongxiang
    ENGINEERING LETTERS, 2023, 31 (04) : 1667 - 1674
  • [7] AttAN: Attention Adversarial Networks for 3D Point Cloud Semantic Segmentation
    Zhang, Gege
    Ma, Qinghua
    Jiao, Licheng
    Liu, Fang
    Sun, Qigong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 789 - 796
  • [8] Semantic segmentation of 3D point cloud based on contextual attention CNN
    Yang J.
    Dang J.
    Tongxin Xuebao/Journal on Communications, 2020, 41 (07): : 195 - 203
  • [9] A GLOBAL POINT-SIFT ATTENTION NETWORK FOR 3D POINT CLOUD SEMANTIC SEGMENTATION
    Jia, Meixia
    Li, Aijin
    Wu, Zhaoyang
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5065 - 5068
  • [10] 3D semantic segmentation using deep learning for large-scale indoor point cloud
    Chen Hui
    Xu Peng
    Zuo Yipeng
    Wang Weina
    PROCEEDINGS OF 2019 14TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS (ICEMI), 2019, : 1650 - 1655