DFAMNet: dual fusion attention multi-modal network for semantic segmentation on LiDAR point clouds

被引:1
|
作者
Li, Mingjie [1 ]
Wang, Gaihua [2 ]
Zhu, Minghao [1 ]
Li, Chunzheng [1 ]
Liu, Hong [1 ]
Pan, Xuran [2 ]
Long, Qian [2 ]
机构
[1] Hubei Univ Technol, Sch Elect & Elctron Engn, Wuhan 430068, Peoples R China
[2] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin 300457, Peoples R China
关键词
Semantic segmentation; Multi-modal; Pseudo point cloud; Point cloud; PRIORS;
D O I
10.1007/s10489-024-05302-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation of outdoor point clouds is an important task in the field of computer vision, aiming to classify outdoor point cloud data into different semantic categories. The methods based on pure point cloud have some shortcomings, such as incomplete information and difficulty in processing incomplete data. In the paper, it proposes pseudo point cloud method to align image with point cloud. The image features are extracted through a 2D network, and then the point cloud is mapped onto the image to obtain the corresponding pixel features, forming the pseudo point cloud. Then the dual fusion attention mechanism is designed to fuse the features of point cloud and pseudo point cloud. It improves the efficiency of the fusion network. The experimental results show that this method outperforms existing methods on the large-scale SemanticKITTI benchmark and achieves third place performance on the NuScenes benchmark. Code is available at https://github.com/Pdsn5/DFAMNet.
引用
收藏
页码:3169 / 3180
页数:12
相关论文
共 50 条
  • [1] DFAMNet: dual fusion attention multi-modal network for semantic segmentation on LiDAR point clouds
    Mingjie Li
    Gaihua Wang
    Minghao Zhu
    Chunzheng Li
    Hong Liu
    Xuran Pan
    Qian Long
    [J]. Applied Intelligence, 2024, 54 : 3169 - 3180
  • [2] Dual fusion network for semantic segmentation of point clouds *
    Lu, Jian
    Guo, Huihui
    Jia, Xurui
    Wu, Jiatong
    Chen, Xiaogai
    [J]. OPTICS AND LASERS IN ENGINEERING, 2024, 177
  • [3] Application of Multi-modal Fusion Attention Mechanism in Semantic Segmentation
    Liu, Yunlong
    Yoshie, Osamu
    Watanabe, Hiroshi
    [J]. COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 378 - 397
  • [4] Dual-Attention Deep Fusion Network for Multi-modal Medical Image Segmentation
    Zheng, Shenhai
    Ye, Xin
    Tan, Jiaxin
    Yang, Yifei
    Li, Laquan
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [6] Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
    Kim, Kyungmin
    [J]. Sensors, 2024, 24 (23)
  • [7] EISNet: A Multi-Modal Fusion Network for Semantic Segmentation With Events and Images
    Xie, Bochen
    Deng, Yongjian
    Shao, Zhanpeng
    Li, Youfu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8639 - 8650
  • [8] A Tri-Attention fusion guided multi-modal segmentation network
    Zhou, Tongxue
    Ruan, Su
    Vera, Pierre
    Canu, Stephane
    [J]. PATTERN RECOGNITION, 2022, 124
  • [9] Imbalance knowledge-driven multi-modal network for land-cover semantic segmentation using aerial images and LiDAR point clouds
    Wang, Yameng
    Wan, Yi
    Zhang, Yongjun
    Zhang, Bin
    Gao, Zhi
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 202 : 385 - 404
  • [10] Local Fusion Attention Network for Semantic Segmentation of Building Facade Point Clouds
    Su, Yanfei
    Liu, Weiquan
    Cheng, Ming
    Yuan, Zhimin
    Wang, Cheng
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19