An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引：9

作者：

Tao, Chongben ^{[1
,2
]}

Cao, Chunlin ^{[1
]}

Cheng, Hanjing ^{[1
]}

Gao, Zhen ^{[3
]}

Luo, Xizhao ^{[4
]}

Zhang, Zuofeng ^{[5
]}

Zheng, Sifa ^{[5
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China

[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada

[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

[5] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

ADVANCED ENGINEERING INFORMATICS | 2023年 / 57卷

基金：

中国国家自然科学基金;

关键词：

3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;

D O I：

10.1016/j.aei.2023.102069

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.

引用

页数：11

共 50 条

[31] Efficient 3D Object Detection Models and Evaluation Method for Autonomous Driving
Lee, Jin-Hee
Lee, Jae-Keun
Lee, Joohyun
Kim, Je-Seok
Kwon, Soon
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
[32] CAF-RCNN: multimodal 3D object detection with cross-attention
Liu, Junting
Liu, Deer
Zhu, Lei
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 6131 - 6146
[33] 3D Object Proposals Using Stereo Imagery for Accurate Object Class Detection
Chen, Xiaozhi
Kundu, Kaustav
Zhu, Yukun
Ma, Huimin
Fidler, Sanja
Urtasun, Raquel
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1259 - 1272
[34] Research on Object Measurement Based on 3D Stereo Vision
Xia, Xinghua
Dai, Shilong
Qi, Hongfeng
Xu, Zilong
Wang, Shuang
Zhang, Mingxu
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 7260 - 7264
[35] PG-RCNN: Semantic Surface Point Generation for 3D Object Detection
Koo, Inyong
Lee, Inyoung
Kim, Se-Ho
Kim, Hee-Seon
Jeon, Woo-Jin
Kim, Changick
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18096 - 18105
[36] SPS-RCNN: Semantic-Guided Proposal Sampling for 3D Object Detection from LiDAR Point Clouds
Xu, Hengxin
Yang, Lei
Zhao, Shengya
Tao, Shan
Tian, Xinran
Liu, Kun
SENSORS, 2025, 25 (04)
[37] FCNet: Stereo 3D Object Detection with Feature Correlation Networks
Wu, Yingyu
Liu, Ziyan
Chen, Yunlei
Zheng, Xuhui
Zhang, Qian
Yang, Mo
Tang, Guangming
ENTROPY, 2022, 24 (08)
[38] Joint stereo 3D object detection and implicit surface reconstruction
Li, Shichao
Huang, Xijie
Liu, Zechun
Cheng, Kwang-Ting
SCIENTIFIC REPORTS, 2024, 14 (01):
[39] Stereo 3D Object Detection Using a Feature Attention Module
Zhao, Kexin
Jiang, Rui
He, Jun
ALGORITHMS, 2023, 16 (12)
[40] Channel-Based Network for Fast Object Detection of 3D LiDAR
Kwon, SoonSub
Park, TaeHyoung
ELECTRONICS, 2020, 9 (07) : 1 - 10

← 1 2 3 4 5 →