DMFF: dual-way multimodal feature fusion for 3D object detection

被引:0
|
作者
Dong, Xiaopeng [1 ]
Di, Xiaoguang [1 ]
Wang, Wenzhuang [1 ]
机构
[1] Harbin Inst Technol, Control & Simulat Ctr, Harbin, Peoples R China
基金
黑龙江省自然科学基金;
关键词
3D object detection; Multimodal feature fusion; Self-attention mechanism; Lidar point clouds; RGB images;
D O I
10.1007/s11760-023-02772-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, multimodal 3D object detection that fuses the complementary information from LiDAR data and RGB images has been an active research topic. However, it is not trivial to fuse images and point clouds because of different representations of them. Inadequate feature fusion also brings bad effects on detection performance. We convert images into pseudo point clouds by using a depth completion and utilize a more efficient feature fusion method to address the problems. In this paper, we propose a dual-way multimodal feature fusion network (DMFF) for 3D object detection. Specifically, we first use a dual stream feature extraction module (DSFE) to generate homogeneous LiDAR and pseudo region of interest (RoI) features. Then, we propose a dual-way feature interaction method (DWFI) that enables intermodal and intramodal interaction of the two features. Next, we design a local attention feature fusion module (LAFF) to select which features of the input are more likely to contribute to the desired output. In addition, the proposed DMFF achieves the state-of-the-art performances on the KITTI Dataset.
引用
下载
收藏
页码:455 / 463
页数:9
相关论文
共 50 条
  • [21] Multimodal 3D Object Detection Method Based on Pseudo Point Cloud Feature Enhancement
    Kong D.-M.
    Li X.-W.
    Yang Q.-X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (04): : 759 - 775
  • [22] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
    Alaba, Simegnew Yihunie
    Ball, John E.
    IEEE ACCESS, 2024, 12 : 50165 - 50176
  • [23] EPAWFusion: multimodal fusion for 3D object detection based on enhanced points and adaptive weights
    Sun, Xiang
    Song, Shaojing
    Wu, Fan
    Lu, Tingting
    Li, Bohao
    Miao, Zhiqing
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (01)
  • [24] MFFNet: Multimodal feature fusion network for RGB-D transparent object detection
    Zhu, Li
    Li, Tuanjie
    Ning, Yuming
    Zhang, Yan
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
  • [25] DMFF: Deep multimodel feature fusion for building occupancy detection
    Sun, Kailai
    BUILDING AND ENVIRONMENT, 2024, 253
  • [26] LiDAR-camera fusion: Dual transformer enhancement for 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [27] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
    Uzair, Muhammad
    Dong, Jian
    Shi, Ronghua
    Mushtaq, Husnain
    Ullah, Irshad
    IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [28] A multilevel fusion network for 3D object detection
    Xia, Chunlong
    Wei, Ping
    Wei, Wenwen
    Zheng, Nanning
    NEUROCOMPUTING, 2021, 437 : 107 - 117
  • [29] PointPainting: Sequential Fusion for 3D Object Detection
    Vora, Sourabh
    Lang, Alex H.
    Helou, Bassam
    Beijbom, Oscar
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4603 - 4611
  • [30] Dense Voxel Fusion for 3D Object Detection
    Mahmoud, Anas
    Hu, Jordan S. K.
    Waslander, Steven L.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 663 - 672