DyFusion: Cross-Attention 3D Object Detection with Dynamic Fusion

被引:5
|
作者
Bi, Jiangfeng [1 ]
Wei, Haiyue [1 ]
Zhang, Guoxin [1 ]
Yang, Kuihe [1 ]
Song, Ziying [2 ]
机构
[1] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
[2] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing Key Lab Traff Data Anal & Min, Beijing, Peoples R China
关键词
cross-attention dynamic fusion; synchronous data augmentation; 3D object detection; CNN;
D O I
10.1109/TLA.2024.10412035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the realm of autonomous driving, LiDAR and camera sensors play an indispensable role, furnishing pivotal observational data for the critical task of precise 3D object detection. Existing fusion algorithms effectively utilize the complementary data from both sensors. However, these methods typically concatenate the raw point cloud data and pixel-level image features, unfortunately, a process that introduces errors and results in the loss of critical information embedded in each modality. To mitigate the problem of lost feature information, this paper proposes a Cross-Attention Dynamic Fusion (CADF) strategy that dynamically fuses the two heterogeneous data sources. In addition, we acknowledge the issue of insufficient data augmentation for these two diverse modalities. To combat this, we propose a Synchronous Data Augmentation (SDA) strategy designed to enhance training efficiency. We have tested our method using the KITTI and nuScenes datasets, and the results have been promising. Remarkably, our top-performing model attained an 82.52% mAP on the KITTI test benchmark, outperforming other state-of-the-art methods.
引用
收藏
页码:106 / 112
页数:7
相关论文
共 50 条
  • [31] Object DGCNN: 3D Object Detection using Dynamic Graphs
    Wang, Yue
    Solomon, Justin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [32] Deception Detection System with Joint Cross-Attention
    Jiang, Peili
    Wang, Yunfan
    Li, Jiajun
    Wang, Ziyang
    PROCEEDINGS OF THE 2024 6TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING SYSTEMS, SSPS 2024, 2024, : 40 - 47
  • [33] BCAF-3D: Bilateral Content Awareness Fusion for cross-modal 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [34] Towards Raw Sensor Fusion in 3D Object Detection
    Rovid, Andras
    Remeli, Viktor
    2019 IEEE 17TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2019), 2019, : 293 - 298
  • [35] VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention
    Deng, Shengheng
    Liang, Zhihao
    Sun, Lin
    Jia, Kui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8438 - 8447
  • [36] Cross-Attention Regression Flow for Defect Detection
    Liu, Binhui
    Guo, Tianchu
    Luo, Bin
    Cui, Zhen
    Yang, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5183 - 5193
  • [37] Investigating Attention Mechanism in 3D Point Cloud Object Detection
    Qiu, Shi
    Wu, Yunfan
    Anwar, Saeed
    Li, Chongyi
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 403 - 412
  • [38] Attention-based Proposals Refinement for 3D Object Detection
    Minh-Quan Dao
    Hery, Elwan
    Fremont, Vincent
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 197 - 205
  • [39] 3D Object Detection with Attention: Shell-Based Modeling
    Zhang X.
    Zhao Z.
    Sun W.
    Cui Q.
    Computer Systems Science and Engineering, 2023, 46 (01): : 537 - 550
  • [40] ARPNET: attention region proposal network for 3D object detection
    Yangyang Ye
    Chi Zhang
    Xiaoli Hao
    Science China Information Sciences, 2019, 62