An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引：9

作者：

Tao, Chongben ^{[1
,2
]}

Cao, Chunlin ^{[1
]}

Cheng, Hanjing ^{[1
]}

Gao, Zhen ^{[3
]}

Luo, Xizhao ^{[4
]}

Zhang, Zuofeng ^{[5
]}

Zheng, Sifa ^{[5
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China

[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada

[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

[5] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

ADVANCED ENGINEERING INFORMATICS | 2023年 / 57卷

基金：

中国国家自然科学基金;

关键词：

3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;

D O I：

10.1016/j.aei.2023.102069

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.

引用

页数：11

共 50 条

[1] An anchor-guided 3D target detection algorithm based on stereo RCNN
Cao J.
Tao C.
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2021, 42 (12): : 191 - 201
[2] ESGN: Efficient Stereo Geometry Network for Fast 3D Object Detection
Gao, Aqi
Pang, Yanwei
Nie, Jing
Shao, Zhuang
Cao, Jiale
Guo, Yishun
Li, Xuelong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2000 - 2009
[3] Reinforced Voxel-RCNN: An Efficient 3D Object Detection Method Based on Feature Aggregation*
Jiang, Jia-ji
Wan, Hai-bin
Sun, Hong-min
Qin, Tuan-fa
Wang, Zheng-qiang
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (09) : 1228 - 1238
[4] SGM3D: Stereo Guided Monocular 3D Object Detection
Zhou, Zheyuan
Du, Liang
Ye, Xiaoqing
Zou, Zhikang
Tan, Xiao
Zhang, Li
Xue, Xiangyang
Feng, Jianfeng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10478 - 10485
[5] A 3D Circular Object Detection Method Based on Binocular Stereo Vision
Chen, Zhaoxue
Li, Mengzhuo
Yu, Haizhong
2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
[6] PLUMENet: Efficient 3D Object Detection from Stereo Images
Wang, Yan
Yang, Bin
Hu, Rui
Liang, Ming
Urtasun, Raquel
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3383 - 3390
[7] Confidence Guided Stereo 3D Object Detection with Split Depth Estimation
Li, Chengyao
Ku, Jason
Waslander, Steven L.
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5776 - 5783
[8] AFE-RCNN: Adaptive Feature Enhancement RCNN for 3D Object Detection
Shuang, Feng
Huang, Hanzhang
Li, Yong
Qu, Rui
Li, Pei
REMOTE SENSING, 2022, 14 (05)
[9] An object boundary detection system based on a 3D stereo monitor
Zhang, Shuqun
Furia, Bryan
APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVII, 2014, 9217
[10] Research on 3D object optimal grasping method based on cascaded Faster RCNN
Chen D.
Lin Q.
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (04): : 229 - 237

← 1 2 3 4 5 →