Multi-Sensor Fusion 3D Object Detection Based on Multi-Frame Information

被引:0
|
作者
Wu S. [1 ]
Geng J. [1 ]
Wu C. [1 ]
Yan Z. [1 ]
Chen K. [1 ]
机构
[1] School of Mechanical Engineering, Beijing Institute of Technology, Beijing
关键词
multi-frame feature; multi-sensor fusion; object detection;
D O I
10.15918/j.tbit1001-0645.2023.025
中图分类号
学科分类号
摘要
In order to improve the effectiveness of multi-sensor fusion in 3D object detection and improve the accuracy of object detection in the utilization of the feature correlation between the front and back frames, a multi-sensor feature fusion 3D object detection network based on multi frame information was proposed. Firstly, using a feature mapping module based on guidance points to convert the camera perspective features of the image into aerial features, the point cloud features and image features were fused with an adaptive fusion module. After-wards, utilizing historical frame tracking information, multiple frame features were fused. Finally, a detection head CenterPoint was used to detect the objects and to test the 3D object detection network with a dataset nuScenes and real vehicles. The experimental results show that the network can provide higher accuracy and real-time performance. © 2023 Beijing Institute of Technology. All rights reserved.
引用
收藏
页码:1282 / 1289
页数:7
相关论文
共 14 条
  • [1] WANG Jianzhong, WANG Jiale, YU Zibo, Et al., Multi-scale detection method for soldier and armored vehicle objects, Transactions of Beijing institute of Technology, 43, 2, pp. 203-212, (2023)
  • [2] CHANG Liang, BAI Jie, HUANG Libo, Multi-target detection based on camera and radar feature fusion networks, Transactions of Beijing institute of Technology, 42, 3, pp. 318-323, (2022)
  • [3] LAW H, DENG J., CornerNet: detecting objects as paired keypoints, International Journal of Computer Vision, 128, 2, pp. 642-656, (2020)
  • [4] LIU Z, WU Z, TOTH R., Smoke: Single-stage monocular 3d object detection via keypoint estimation, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 996-997, (2020)
  • [5] WANG T, ZHU X, PANG J, Et al., Fcos3d: Fully convolutional one-stage monocular 3d object detection, Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 913-922, (2021)
  • [6] CHEN X, MA H, WAN J, Et al., Multi-view 3D object detection network for autonomous driving, Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pp. 1907-1915, (2017)
  • [7] ZHOU Y, TUZEL O., Voxelnet: End-to-end learning for point cloud based 3d object detection, Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4490-4499, (2018)
  • [8] LANG A H, VORA S, CAESAR H, Et al., Pointpillars: Fast encoders for object detection from point clouds, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12697-12705, (2019)
  • [9] YIN T, ZHOU X, KRAHENBUHL P., Center-based 3d object detection and tracking, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11784-11793, (2021)
  • [10] SHI S, WANG X, LI H., Pointrcnn: 3d object proposal generation and detection from point cloud, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 770-779, (2019)