An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引：9

作者：

Tao, Chongben ^{[1
,2
]}

Cao, Chunlin ^{[1
]}

Cheng, Hanjing ^{[1
]}

Gao, Zhen ^{[3
]}

Luo, Xizhao ^{[4
]}

Zhang, Zuofeng ^{[5
]}

Zheng, Sifa ^{[5
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China

[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada

[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

[5] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

ADVANCED ENGINEERING INFORMATICS | 2023年 / 57卷

基金：

中国国家自然科学基金;

关键词：

3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;

D O I：

10.1016/j.aei.2023.102069

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.

引用

页数：11

共 50 条

[41] NV2P-RCNN: Feature Aggregation Based on Voxel Neighborhood for 3D Object Detection
Weile Huo
Tao Jing
Shuang Ren
Neural Processing Letters, 2023, 55 : 6925 - 6945
[42] Efficient object detection by prediction in 3D space
Pang, Yanwei
Jiang, Xiaoheng
Li, Xuelong
Pan, Jing
SIGNAL PROCESSING, 2015, 112 : 64 - 73
[43] NV2P-RCNN: Feature Aggregation Based on Voxel Neighborhood for 3D Object Detection
Huo, Weile
Jing, Tao
Ren, Shuang
NEURAL PROCESSING LETTERS, 2023, 55 (06) : 6925 - 6945
[44] Method of 3D vehicle object detection based on improved VoxelNet
Zhao, Yi-Fan
Wu, Shao-Bo
Dong, Shi-Peng
Journal of Computers (Taiwan), 2021, 32 (01) : 242 - 255
[45] Structure Guided Proposal Completion for 3D Object Detection
Shi, Chao
Zhang, Chongyang
Luo, Yan
COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 504 - 520
[46] Efficient 3D Object Detection Based on Pseudo-LiDAR Representation
Meng, Haitao
Li, Changcai
Chen, Gang
Chen, Long
Knoll, Alois
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1953 - 1964
[47] Fast spherical 3D location based on stereo vision
Wang, Zhong-Li
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2006, 26 (11): : 974 - 977
[48] Underwater Object Detection Method Based on Improved Faster RCNN
Wang, Hao
Xiao, Nanfeng
APPLIED SCIENCES-BASEL, 2023, 13 (04):
[49] A Fast Unified System for 3D Object Detection and Tracking
Heitzinger, Thomas
Kampel, Martin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16998 - 17008
[50] Fast 3D Object Motion Detection Algorithm Design
Li, Shih-An
Ho, Yun-Hung
Wong, Ching-Chang
Feng, Hsuan-Ming
2024 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE, SEAI 2024, 2024, : 68 - 74

← 1 2 3 4 5 →