MFSA-Net: Semantic Segmentation With Camera-LiDAR Cross-Attention Fusion Based on Fast Neighbor Feature Aggregation

被引:0
|
作者
Duan, Yijian [1 ]
Meng, Liwen [1 ]
Meng, Yanmei [1 ]
Zhu, Jihong [2 ]
Zhang, Jiacheng [1 ]
Zhang, Jinlai [3 ]
Liu, Xin [1 ]
机构
[1] Guangxi Univ, Coll Mech Engn, Nanning 530004, Peoples R China
[2] Tsinghua Univ, Dept Precis Instrument, Beijing 100000, Peoples R China
[3] Changsha Univ Sci & Technol, Coll Automot & Mech Engn, Changsha 410114, Peoples R China
关键词
Cross-attention; LiDAR point clouds; multimodal; semantic segmentation; NETWORK;
D O I
10.1109/JSTARS.2024.3472751
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Given the inherent limitations of camera-only and LiDAR-only methods in performing semantic segmentation tasks in large-scale complex environments, multimodal information fusion for semantic segmentation has become a focal point of contemporary research. However, significant modal disparities often result in existing fusion-based methods struggling with low segmentation accuracy and limited efficiency in large-scale complex environments. To address these challenges,we propose a semantic segmentation network with camera-LiDAR cross-attention fusion based on fast neighbor feature aggregation (MFSA-Net), which is better suited for large-scale semantic segmentation in complex environments. Initially, we propose a dual-distance attention feature aggregation module based on rapid 3-D nearest neighbor search. This module employs a sliding window method in point cloud perspective projections for swift proximity search, and efficiently combines feature distance and Euclidean distance information to learn more distinctive local features. This improves segmentation accuracy while ensuring computational efficiency. Furthermore, we propose a cross-attention fusion two-stream network based on residual, which allows for more effective integration of camera information into the LiDAR data stream, enhancing both accuracy and robustness. Extensive experimental results on the large-scale point cloud datasets SemanticKITTI and Nuscenes demonstrate that our proposed algorithm outperforms similar algorithms in semantic segmentation performance in large-scale complex environments.
引用
收藏
页码:19627 / 19639
页数:13
相关论文
共 50 条
  • [1] A spatially enhanced network with camera-lidar fusion for 3D semantic segmentation
    Ye, Chao
    Pan, Huihui
    Yu, Xinghu
    Gao, Huijun
    NEUROCOMPUTING, 2022, 484 : 59 - 66
  • [2] Camera-LiDAR Cross-Modality Fusion Water Segmentation for Unmanned Surface Vehicles
    Gao, Jiantao
    Zhang, Jingting
    Liu, Chang
    Li, Xiaomao
    Peng, Yan
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (06)
  • [3] CAFE-Net: Cross-Attention and Feature Exploration Network for polyp segmentation
    Liu, Guoqi
    Yao, Sheng
    Liu, Dong
    Chang, Baofang
    Chen, Zongyu
    Wang, Jiajia
    Wei, Jiangqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [4] Robust Feature-Based Camera-LiDAR Fusion for Global Localization of a Mobile Robot
    Salam, Yasir
    Li, Yinbei
    Yang, Jiaqiang
    Fan, Wei
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2024, 26 (04): : 40 - 49
  • [5] A Multi-Phase Camera-LiDAR Fusion Network for 3D Semantic Segmentation With Weak Supervision
    Chang, Xuepeng
    Pan, Huihui
    Sun, Weichao
    Gao, Huijun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3737 - 3746
  • [6] A Cross-Attention and Multilevel Feature Fusion Network for Breast Lesion Segmentation in Ultrasound Images
    Liu, Guoqi
    Zhou, Yanan
    Wang, Jiajia
    Chen, Zongyu
    Liu, Dong
    Chang, Baofang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [7] Fast Road Detection by CNN-Based Camera-Lidar Fusion and Spherical Coordinate Transformation
    Lee, Jae-Seol
    Park, Tae-Hyoung
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (09) : 5802 - 5810
  • [8] CASF-Net: Cross-attention and cross-scale fusion network for medical image segmentation
    Zheng, Jianwei
    Liu, Hao
    Feng, Yuchao
    Xu, Jinshan
    Zhao, Liang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 229
  • [9] Lightweight Semantic Segmentation Network based on Attention Feature Fusion
    Kuang, Xianyan
    Liu, Ping
    Chen, Yixi
    Zhang, Jianhua
    ENGINEERING LETTERS, 2023, 31 (04) : 1584 - 1591
  • [10] Feature Fusion Network Based on Hybrid Attention for Semantic Segmentation
    Xie Xinchen
    Li, Chen
    Tian, Lihua
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 9 - 14