Vehicle Detection Based on Structure Perception in Point Cloud

被引:0
|
作者
Li Z. [1 ,2 ]
Yao C. [1 ]
Liu Y. [1 ]
Li H. [3 ,4 ]
机构
[1] College of Computer Science and Technology, China University of Petroleum, Qingdao
[2] Shengli College of China University of Petroleum, Dongying
[3] Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing
[4] University of Chinese Academy of Sciences, Beijing
关键词
3D point cloud object detection; Region proposal network; Structure feature;
D O I
10.3724/SP.J.1089.2021.18368
中图分类号
学科分类号
摘要
In the field of automatic driving, computer perception and understanding of the surrounding environment is essential. Compared with 2D object detection, 3D point cloud object detection can provide the three-dimensional information of the object that the 2D object detection does not have. In order to solve the problem of large disparity between the original input point cloud and the detection result in 3D object detection, a region proposal generation module based on structure awareness is proposed, in which the structural features of each point are defined, and the supervision information provided by the 3D point cloud object detection dataset is fully utilized. The network can learn more discriminative features to improve the quality of proposals. Secondly, the feature is added to the proposal fine-tuning stage to enrich the context features and local features of point cloud. Evaluated on KITTI 3D object detection dataset, in the region proposal generation stage, under the IoU threshold of 0.7, using 50 proposals, there is a more than 13% increase in the recall rate compared to previous results. In the proposal fine-tuning stage, the detection results of the 3 difficulty levels objects is obviously improved, indicating the effectiveness of the proposed method for 3D point cloud object detection. © 2021, Beijing China Science Journal Publishing Co. Ltd. All right reserved.
引用
收藏
页码:405 / 412
页数:7
相关论文
共 23 条
  • [1] He K M, Zhang X Y, Ren S Q, Et al., Deep residual learning for image recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
  • [2] Krizhevsky A, Sutskever I, Hinton G E., ImageNet classification with deep convolutional neural networks, Communications of the ACM, 60, 6, pp. 84-90, (2017)
  • [3] Zhu H, Long M S, Wang J M, Et al., Deep hashing network for efficient similarity retrieval, Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp. 2415-2421, (2016)
  • [4] Lin T Y, Goyal P, Girshick R, Et al., Focal loss for dense object detection, Proceedings of the IEEE International Conference on Computer Vision, pp. 2999-3007, (2017)
  • [5] Wang X L, Kong T, Shen C H, Et al., SOLO: segmenting objects by locations
  • [6] Lang A H, Vora S, Caesar H, Et al., PointPillars: fast encoders for object detection from point clouds, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12689-12697, (2019)
  • [7] Yang B, Liang M, Urtasun R., HDNET: exploiting HD maps for 3D object detection, Proceedings of the 2nd Conference on Robot Learning, pp. 146-155, (2018)
  • [8] Beltran J, Guindel C, Moreno F M, Et al., BirdNet: a 3D object detection framework from LiDAR information, Proceedings of the 21st International Conference on Intelligent Transportation Systems, pp. 3517-3523, (2018)
  • [9] Yang B, Luo W J, Urtasun R., PIXOR: real-time 3D object detection from point clouds, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7652-7660, (2018)
  • [10] Zhou J, Lu X Q, Tan X, Et al., FVNet: 3D front-view proposal generation for real-time object detection from point clouds, Proceedings of the 12th International Congress on Image and Sigal Processing, BioMedical Engineering and Informatics, pp. 1-8, (2019)