DyFusion: Cross-Attention 3D Object Detection with Dynamic Fusion

被引:5
|
作者
Bi, Jiangfeng [1 ]
Wei, Haiyue [1 ]
Zhang, Guoxin [1 ]
Yang, Kuihe [1 ]
Song, Ziying [2 ]
机构
[1] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
[2] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing Key Lab Traff Data Anal & Min, Beijing, Peoples R China
关键词
cross-attention dynamic fusion; synchronous data augmentation; 3D object detection; CNN;
D O I
10.1109/TLA.2024.10412035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the realm of autonomous driving, LiDAR and camera sensors play an indispensable role, furnishing pivotal observational data for the critical task of precise 3D object detection. Existing fusion algorithms effectively utilize the complementary data from both sensors. However, these methods typically concatenate the raw point cloud data and pixel-level image features, unfortunately, a process that introduces errors and results in the loss of critical information embedded in each modality. To mitigate the problem of lost feature information, this paper proposes a Cross-Attention Dynamic Fusion (CADF) strategy that dynamically fuses the two heterogeneous data sources. In addition, we acknowledge the issue of insufficient data augmentation for these two diverse modalities. To combat this, we propose a Synchronous Data Augmentation (SDA) strategy designed to enhance training efficiency. We have tested our method using the KITTI and nuScenes datasets, and the results have been promising. Remarkably, our top-performing model attained an 82.52% mAP on the KITTI test benchmark, outperforming other state-of-the-art methods.
引用
收藏
页码:106 / 112
页数:7
相关论文
共 50 条
  • [1] CAF-RCNN: multimodal 3D object detection with cross-attention
    Liu, Junting
    Liu, Deer
    Zhu, Lei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 6131 - 6146
  • [2] Spatial Cross-Attention RGB-D Fusion Module for Object Detection
    Gao, Shangyin
    Markhasin, Lev
    Wang, Bi
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [3] CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection
    Hwang, Jyh-Jing
    Kretzschmar, Henrik
    Manela, Joshua
    Rafferty, Sean
    Armstrong-Crews, Nicholas
    Chen, Tiffany
    Anguelov, Dragomir
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 388 - 405
  • [4] ICAFusion: Iterative cross-attention guided feature fusion for multispectral object detection
    Shen, Jifeng
    Chen, Yifei
    Liu, Yue
    Zuo, Xin
    Fan, Heng
    Yang, Wankou
    PATTERN RECOGNITION, 2024, 145
  • [5] FusionPainting: Multimodal Fusion with Adaptive Attention for 3D Object Detection
    Xu, Shaoqing
    Zhou, Dingfu
    Fang, Jin
    Yin, Junbo
    Bin, Zhou
    Zhang, Liangjun
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3047 - 3054
  • [6] AEPF: Attention-Enabled Point Fusion for 3D Object Detection
    Sharma, Sachin
    Meyer, Richard T.
    Asher, Zachary D.
    SENSORS, 2024, 24 (17)
  • [7] Cascaded Cross-Modality Fusion Network for 3D Object Detection
    Chen, Zhiyu
    Lin, Qiong
    Sun, Jing
    Feng, Yujian
    Liu, Shangdong
    Liu, Qiang
    Ji, Yimu
    Xu, He
    SENSORS, 2020, 20 (24) : 1 - 14
  • [8] 3D object detection based on fusion of point cloud and image by mutual attention
    Chen J.-Y.
    Bai T.-Y.
    Zhao L.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2021, 29 (09): : 2247 - 2254
  • [9] 3D Object Detection Based on Attention and Multi-Scale Feature Fusion
    Liu, Minghui
    Ma, Jinming
    Zheng, Qiuping
    Liu, Yuchen
    Shi, Gang
    SENSORS, 2022, 22 (10)
  • [10] BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera
    Liu, Min
    Jia, Yuanjun
    Lyu, Youhao
    Dong, Qi
    Yang, Yanyu
    SENSORS, 2024, 24 (14)