DMFF: dual-way multimodal feature fusion for 3D object detection

被引:0
|
作者
Dong, Xiaopeng [1 ]
Di, Xiaoguang [1 ]
Wang, Wenzhuang [1 ]
机构
[1] Harbin Inst Technol, Control & Simulat Ctr, Harbin, Peoples R China
基金
黑龙江省自然科学基金;
关键词
3D object detection; Multimodal feature fusion; Self-attention mechanism; Lidar point clouds; RGB images;
D O I
10.1007/s11760-023-02772-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, multimodal 3D object detection that fuses the complementary information from LiDAR data and RGB images has been an active research topic. However, it is not trivial to fuse images and point clouds because of different representations of them. Inadequate feature fusion also brings bad effects on detection performance. We convert images into pseudo point clouds by using a depth completion and utilize a more efficient feature fusion method to address the problems. In this paper, we propose a dual-way multimodal feature fusion network (DMFF) for 3D object detection. Specifically, we first use a dual stream feature extraction module (DSFE) to generate homogeneous LiDAR and pseudo region of interest (RoI) features. Then, we propose a dual-way feature interaction method (DWFI) that enables intermodal and intramodal interaction of the two features. Next, we design a local attention feature fusion module (LAFF) to select which features of the input are more likely to contribute to the desired output. In addition, the proposed DMFF achieves the state-of-the-art performances on the KITTI Dataset.
引用
收藏
页码:455 / 463
页数:9
相关论文
共 50 条
  • [1] DMFF: dual-way multimodal feature fusion for 3D object detection
    Xiaopeng Dong
    Xiaoguang Di
    Wenzhuang Wang
    [J]. Signal, Image and Video Processing, 2024, 18 (1) : 455 - 463
  • [2] 3D Multimodal Sensing and Feedback Finger Case for Immersive Dual-Way Interaction
    Chen, Tao
    Dai, Zhiwei
    Liu, Ming
    Zhao, Yudong
    Ling, Hao
    Sun, Lining
    He, Haidong
    Lee, Chengkuo
    Zhu, Minglu
    [J]. ADVANCED MATERIALS TECHNOLOGIES, 2024, 9 (05)
  • [3] MFF-Net: Multimodal Feature Fusion Network for 3D Object Detection
    Shi, Peicheng
    Liu, Zhiqiang
    Qi, Heng
    Yang, Aixi
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 5615 - 5637
  • [4] FusionPainting: Multimodal Fusion with Adaptive Attention for 3D Object Detection
    Xu, Shaoqing
    Zhou, Dingfu
    Fang, Jin
    Yin, Junbo
    Bin, Zhou
    Zhang, Liangjun
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3047 - 3054
  • [5] Multi-feature Fusion VoteNet for 3D Object Detection
    Wang, Zhoutao
    Xie, Qian
    Wei, Mingqiang
    Long, Kun
    Wang, Jun
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [6] Dual-domain deformable feature fusion for multi-modal 3D object detection
    Wang, Shihao
    Deng, Tao
    [J]. Journal of Electronic Imaging, 2024, 33 (06)
  • [7] Anti-Noise 3D Object Detection of Multimodal Feature Attention Fusion Based on PV-RCNN
    Zhu, Yuan
    Xu, Ruidong
    An, Hao
    Tao, Chongben
    Lu, Ke
    [J]. SENSORS, 2023, 23 (01)
  • [8] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    [J]. 2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245
  • [9] MMFG: Multimodal-based Mutual Feature Gating 3D Object Detection
    Xu, Wanpeng
    Fu, Zhipeng
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (02)
  • [10] A Multimodal 3D Object Detection Method Based on Double-Fusion Framework
    Ge T.-A.
    Li H.
    Guo Y.
    Wang J.-Y.
    Zhou D.
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3100 - 3110