An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引：9

作者：

Tao, Chongben ^{[1
,2
]}

Cao, Chunlin ^{[1
]}

Cheng, Hanjing ^{[1
]}

Gao, Zhen ^{[3
]}

Luo, Xizhao ^{[4
]}

Zhang, Zuofeng ^{[5
]}

Zheng, Sifa ^{[5
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China

[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada

[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

[5] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

ADVANCED ENGINEERING INFORMATICS | 2023年 / 57卷

基金：

中国国家自然科学基金;

关键词：

3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;

D O I：

10.1016/j.aei.2023.102069

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.

引用

页数：11

共 50 条

[21] Stereo Point Cloud Refinement for 3D Object Detection
Liu, Wangchao
Wang, Teng
Wang, Yang
Zhang, Xiangyu
Lou, Xin
2021 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2021) & 2021 IEEE CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS (PRIMEASIA 2021), 2021, : 61 - 64
[22] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
Cai, Qi
Pan, Yingwei
Yao, Ting
Mei, Tao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719
[23] An Anchor-Free 3D Object Detection Approach Based on Hierarchical Pillars
Ren, Xudie
Li, Shenghong
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
[24] A workpiece grasp detection method based on 3D object detection
Li, Huijun
Duan, Longbo
Wang, Qirun
Zhang, Yilun
Ye, Bin
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2025,
[25] U-Select RCNN: An Effective Voxel-based 3D Object Detection Method with Feature Selection Strategy
Zhang, Zhenghong
Wang, Meiling
Zhao, Lin
Yue, Yufeng
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 3581 - 3586
[26] DeepSDP: A Real-Time Deep Stereo Detection and Positioning Method for 3D Object Detection
Moradi, Homayoun
Karami, Mohammad
Shamaghdari, Saeed
2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1309 - 1313
[27] Improved 3D Object Detection Method Based on PointPillars
Han, Zhenguo
Li, Xu
Xu, Hengxin
Song, Hongzheng
2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 163 - 166
[28] Stereo R-CNN based 3D Object Detection for Autonomous Driving
Li, Peiliang
Chen, Xiaozhi
Shen, Shaojie
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7636 - 7644
[29] A Method based on Faster RCNN Network for Object Detection
Cao D.
Yang S.
Recent Advances in Computer Science and Communications, 2022, 15 (09) : 1239 - 1244
[30] Pose-RCNN: Joint Object Detection and Pose Estimation Using 3D Object Proposals
Braun, Markus
Rao, Qing
Wang, Yikang
Flohr, Fabian
2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2016, : 1546 - 1551

← 1 2 3 4 5 →