3D Object Detection Based on Improved Frustum PointNet

被引:3
|
作者
Liu Xunhua [1 ,2 ]
Sun Shaoyuan [1 ,2 ]
Gu Lipeng [1 ,2 ]
Li Xiang [1 ,2 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Donghua Univ, Engn Res Ctr Digitized Text & Fash Technol, Minist Educ, Shanghai 201620, Peoples R China
关键词
machine vision; lidar; point cloud data; 3D object detection; wide-threshold mask processing;
D O I
10.3788/LOP57.201508
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An improved F-PointNet (Frustum PointNet) for 3D target detection on image and lidar point cloud data is proposed. First, the 2D target detection model of the image is used to extract 2D region of the target, and it is mapped to the point cloud data to obtain the candidate region of the target. Then, the 3D target mask of the candidate region is predicted. Finally, the 3D target is detected by using mask. When the mask is predicted, the proposed wide-threshold mask processing is used to reduce the information loss of the original network, the attention mechanism is added to obtain the points and channel layers that require attention, the Focal Loss can solve the imbalance between the target and the background problem. Through multiple comparison experiments, it is proved that wide-threshold mask processing can improve the accuracy of 3D target detection, and the attention mechanism and Focal Loss can improve the accuracy of prediction.
引用
收藏
页数:7
相关论文
共 15 条
  • [1] Multi-View 3D Object Detection Network for Autonomous Driving
    Chen, Xiaozhi
    Ma, Huimin
    Wan, Ji
    Li, Bo
    Xia, Tian
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6526 - 6534
  • [2] Deng J, 2019, IEEE INT C INTELL TR, P279, DOI [10.1109/itsc.2019.8917126, 10.1109/ITSC.2019.8917126]
  • [3] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [4] He K., 2018, IEEE T PATTERN ANAL, DOI [10.1109/TPAMI.2018.2844175, DOI 10.1109/TPAMI.2018.2844175]
  • [5] Jaderberg Max, 2016, Spatial transformer networks, V2, P2017
  • [6] 无人驾驶中3D目标检测方法研究综述
    季一木
    陈治宇
    田鹏浩
    吴飞
    刘尚东
    孙静
    焦志鹏
    王娜
    毕强
    [J]. 南京邮电大学学报(自然科学版), 2019, 39 (04) : 72 - 79
  • [7] Deep Continuous Fusion for Multi-sensor 3D Object Detection
    Liang, Ming
    Yang, Bin
    Wang, Shenlong
    Urtasun, Raquel
    [J]. COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 663 - 678
  • [8] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327
  • [9] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [10] Indium Phosphide-Based Near-Infrared Single Photon Avalanche Photodiode Detector Arrays
    Liu Kaibao
    Yang Xiaohong
    He Tingting
    Wang Hui
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (22)