An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引:9
|
作者
Tao, Chongben [1 ,2 ]
Cao, Chunlin [1 ]
Cheng, Hanjing [1 ]
Gao, Zhen [3 ]
Luo, Xizhao [4 ]
Zhang, Zuofeng [5 ]
Zheng, Sifa [5 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China
[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada
[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[5] Tsinghua Univ, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;
D O I
10.1016/j.aei.2023.102069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An anchor-guided 3D target detection algorithm based on stereo RCNN
    Cao J.
    Tao C.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2021, 42 (12): : 191 - 201
  • [2] ESGN: Efficient Stereo Geometry Network for Fast 3D Object Detection
    Gao, Aqi
    Pang, Yanwei
    Nie, Jing
    Shao, Zhuang
    Cao, Jiale
    Guo, Yishun
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2000 - 2009
  • [3] Reinforced Voxel-RCNN: An Efficient 3D Object Detection Method Based on Feature Aggregation*
    Jiang, Jia-ji
    Wan, Hai-bin
    Sun, Hong-min
    Qin, Tuan-fa
    Wang, Zheng-qiang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (09) : 1228 - 1238
  • [4] SGM3D: Stereo Guided Monocular 3D Object Detection
    Zhou, Zheyuan
    Du, Liang
    Ye, Xiaoqing
    Zou, Zhikang
    Tan, Xiao
    Zhang, Li
    Xue, Xiangyang
    Feng, Jianfeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10478 - 10485
  • [5] A 3D Circular Object Detection Method Based on Binocular Stereo Vision
    Chen, Zhaoxue
    Li, Mengzhuo
    Yu, Haizhong
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [6] PLUMENet: Efficient 3D Object Detection from Stereo Images
    Wang, Yan
    Yang, Bin
    Hu, Rui
    Liang, Ming
    Urtasun, Raquel
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3383 - 3390
  • [7] Confidence Guided Stereo 3D Object Detection with Split Depth Estimation
    Li, Chengyao
    Ku, Jason
    Waslander, Steven L.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5776 - 5783
  • [8] AFE-RCNN: Adaptive Feature Enhancement RCNN for 3D Object Detection
    Shuang, Feng
    Huang, Hanzhang
    Li, Yong
    Qu, Rui
    Li, Pei
    REMOTE SENSING, 2022, 14 (05)
  • [9] An object boundary detection system based on a 3D stereo monitor
    Zhang, Shuqun
    Furia, Bryan
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVII, 2014, 9217
  • [10] Research on 3D object optimal grasping method based on cascaded Faster RCNN
    Chen D.
    Lin Q.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (04): : 229 - 237