An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引:9
|
作者
Tao, Chongben [1 ,2 ]
Cao, Chunlin [1 ]
Cheng, Hanjing [1 ]
Gao, Zhen [3 ]
Luo, Xizhao [4 ]
Zhang, Zuofeng [5 ]
Zheng, Sifa [5 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China
[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada
[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[5] Tsinghua Univ, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;
D O I
10.1016/j.aei.2023.102069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] NV2P-RCNN: Feature Aggregation Based on Voxel Neighborhood for 3D Object Detection
    Weile Huo
    Tao Jing
    Shuang Ren
    Neural Processing Letters, 2023, 55 : 6925 - 6945
  • [42] Efficient object detection by prediction in 3D space
    Pang, Yanwei
    Jiang, Xiaoheng
    Li, Xuelong
    Pan, Jing
    SIGNAL PROCESSING, 2015, 112 : 64 - 73
  • [43] NV2P-RCNN: Feature Aggregation Based on Voxel Neighborhood for 3D Object Detection
    Huo, Weile
    Jing, Tao
    Ren, Shuang
    NEURAL PROCESSING LETTERS, 2023, 55 (06) : 6925 - 6945
  • [44] Method of 3D vehicle object detection based on improved VoxelNet
    Zhao, Yi-Fan
    Wu, Shao-Bo
    Dong, Shi-Peng
    Journal of Computers (Taiwan), 2021, 32 (01) : 242 - 255
  • [45] Structure Guided Proposal Completion for 3D Object Detection
    Shi, Chao
    Zhang, Chongyang
    Luo, Yan
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 504 - 520
  • [46] Efficient 3D Object Detection Based on Pseudo-LiDAR Representation
    Meng, Haitao
    Li, Changcai
    Chen, Gang
    Chen, Long
    Knoll, Alois
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1953 - 1964
  • [47] Fast spherical 3D location based on stereo vision
    Wang, Zhong-Li
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2006, 26 (11): : 974 - 977
  • [48] Underwater Object Detection Method Based on Improved Faster RCNN
    Wang, Hao
    Xiao, Nanfeng
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [49] A Fast Unified System for 3D Object Detection and Tracking
    Heitzinger, Thomas
    Kampel, Martin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16998 - 17008
  • [50] Fast 3D Object Motion Detection Algorithm Design
    Li, Shih-An
    Ho, Yun-Hung
    Wong, Ching-Chang
    Feng, Hsuan-Ming
    2024 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE, SEAI 2024, 2024, : 68 - 74