An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引:9
|
作者
Tao, Chongben [1 ,2 ]
Cao, Chunlin [1 ]
Cheng, Hanjing [1 ]
Gao, Zhen [3 ]
Luo, Xizhao [4 ]
Zhang, Zuofeng [5 ]
Zheng, Sifa [5 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China
[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada
[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[5] Tsinghua Univ, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;
D O I
10.1016/j.aei.2023.102069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Stereo Point Cloud Refinement for 3D Object Detection
    Liu, Wangchao
    Wang, Teng
    Wang, Yang
    Zhang, Xiangyu
    Lou, Xin
    2021 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2021) & 2021 IEEE CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS (PRIMEASIA 2021), 2021, : 61 - 64
  • [22] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719
  • [23] An Anchor-Free 3D Object Detection Approach Based on Hierarchical Pillars
    Ren, Xudie
    Li, Shenghong
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [24] A workpiece grasp detection method based on 3D object detection
    Li, Huijun
    Duan, Longbo
    Wang, Qirun
    Zhang, Yilun
    Ye, Bin
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2025,
  • [25] U-Select RCNN: An Effective Voxel-based 3D Object Detection Method with Feature Selection Strategy
    Zhang, Zhenghong
    Wang, Meiling
    Zhao, Lin
    Yue, Yufeng
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 3581 - 3586
  • [26] DeepSDP: A Real-Time Deep Stereo Detection and Positioning Method for 3D Object Detection
    Moradi, Homayoun
    Karami, Mohammad
    Shamaghdari, Saeed
    2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1309 - 1313
  • [27] Improved 3D Object Detection Method Based on PointPillars
    Han, Zhenguo
    Li, Xu
    Xu, Hengxin
    Song, Hongzheng
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 163 - 166
  • [28] Stereo R-CNN based 3D Object Detection for Autonomous Driving
    Li, Peiliang
    Chen, Xiaozhi
    Shen, Shaojie
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7636 - 7644
  • [29] A Method based on Faster RCNN Network for Object Detection
    Cao D.
    Yang S.
    Recent Advances in Computer Science and Communications, 2022, 15 (09) : 1239 - 1244
  • [30] Pose-RCNN: Joint Object Detection and Pose Estimation Using 3D Object Proposals
    Braun, Markus
    Rao, Qing
    Wang, Yikang
    Flohr, Fabian
    2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2016, : 1546 - 1551