PAFNet: Pillar Attention Fusion Network for Vehicle-Infrastructure Cooperative Target Detection Using LiDAR

被引:1
|
作者
Wang, Luyang [1 ,2 ]
Lan, Jinhui [1 ,2 ]
Li, Min [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Dept Instrument Sci & Technol, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing Engn Res Ctr Ind Spectrum Imaging, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
来源
SYMMETRY-BASEL | 2024年 / 16卷 / 04期
关键词
vehicle-infrastructure cooperative; LiDAR; target detection; feature fusion;
D O I
10.3390/sym16040401
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the development of autonomous driving, consensus is gradually forming around vehicle-infrastructure cooperative (VIC) autonomous driving. The VIC environment-sensing system uses roadside sensors in collaboration with automotive sensors to capture traffic target information symmetrically from both the roadside and the vehicle, thus extending the perception capabilities of autonomous driving vehicles. However, the current target detection accuracy for feature fusion based on roadside LiDAR and automotive LiDAR is relatively low, making it difficult to satisfy the sensing requirements of autonomous vehicles. This paper proposes PAFNet, a VIC pillar attention fusion network for target detection, aimed at improving LiDAR target detection accuracy under feature fusion. The proposed spatial and temporal cooperative fusion preprocessing method ensures the accuracy of the fused features through frame matching and coordinate transformation of the point cloud. In addition, this paper introduces the first anchor-free method for 3D target detection for VIC feature fusion, using a centroid-based approach for target detection. In the feature fusion stage, we propose the grid attention feature fusion method. This method uses the spatial feature attention mechanism to fuse the roadside and vehicle-side features. The experiment on the DAIR-V2X-C dataset shows that PAFNet achieved a 6.92% higher detection accuracy in 3D target detection than FFNet in urban scenes.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A LiDAR Semantic Segmentation Framework for the Cooperative Vehicle-Infrastructure System
    Liu, Hongwei
    Gu, Zihao
    Wang, Chao
    Wang, Ping
    Vukobratovic, Dejan
    [J]. 2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [2] Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection
    Yu, Haibao
    Tang, Yingjuan
    Xie, Enze
    Mao, Jilei
    Luo, Ping
    Nie, Zaiqing
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] A method of vehicle-infrastructure cooperative perception based vehicle state information fusion using improved kalman filter
    Mo, Yanghui
    Zhang, Peilin
    Chen, Zhijun
    Ran, Bin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 4603 - 4620
  • [4] A method of vehicle-infrastructure cooperative perception based vehicle state information fusion using improved kalman filter
    Yanghui Mo
    Peilin Zhang
    Zhijun Chen
    Bin Ran
    [J]. Multimedia Tools and Applications, 2022, 81 : 4603 - 4620
  • [5] A Cooperative Vehicle-Infrastructure System for Road Hazards Detection With Edge Intelligence
    Chen, Chen
    Yao, Guorun
    Liu, Lei
    Pei, Qingqi
    Song, Houbing
    Dustdar, Schahram
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5186 - 5198
  • [6] Roadside sensor network deployment based on vehicle-infrastructure cooperative intelligent driving
    An X.
    Cai B.
    [J]. International Journal of Intelligent Networks, 2023, 4 : 283 - 300
  • [7] ViT-FuseNet: Multimodal Fusion of Vision Transformer for Vehicle-Infrastructure Cooperative Perception
    Zhou, Yang
    Yang, Cai
    Wang, Ping
    Wang, Chao
    Wang, Xinhong
    Van, Nguyen Ngoc
    [J]. IEEE ACCESS, 2024, 12 : 31640 - 31651
  • [8] CoFormerNet: A Transformer-Based Fusion Approach for Enhanced Vehicle-Infrastructure Cooperative Perception
    Li, Bin
    Zhao, Yanan
    Tan, Huachun
    [J]. SENSORS, 2024, 24 (13)
  • [9] Occlusion-guided multi-modal fusion for vehicle-infrastructure cooperative 3D object detection
    Chu, Huazhen
    Liu, Haizhuang
    Zhuo, Junbao
    Chen, Jiansheng
    Ma, Huimin
    [J]. PATTERN RECOGNITION, 2025, 157
  • [10] TransIFF: An Instance-Level Feature Fusion Framework for Vehicle-Infrastructure Cooperative 3D Detection with Transformers
    Chen, Ziming
    Shi, Yifeng
    Jia, Jinrang
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18159 - 18168