An efficient 3D object detection method based on Fast Guided Anchor Stereo RCNN

被引:9
|
作者
Tao, Chongben [1 ,2 ]
Cao, Chunlin [1 ]
Cheng, Hanjing [1 ]
Gao, Zhen [3 ]
Luo, Xizhao [4 ]
Zhang, Zuofeng [5 ]
Zheng, Sifa [5 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Tsinghua Univ, Suzhou Automobile Res Inst, Suzhou 215134, Peoples R China
[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A, Canada
[4] SOOCHOW Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[5] Tsinghua Univ, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Autonomous driving; Stereo RCNN; Key-point detection; Sparse anchor point;
D O I
10.1016/j.aei.2023.102069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In most binocular 3D detection algorithms, a large number of anchor points need to be selected, which leads to the problem of slow feature extraction. To solve this problem, an anchor-guided 3D object detection algorithm for autonomous driving is proposed based on Stereo Recurrent Convolutional Neutral Network (Stereo RCNN), which is called Fast Guided Anchored Stereo RCNN (FGAS RCNN). The proposed FGAS framework is divided into two stages. In the first stage, a probability map is generated for the left and right input images to determine the foreground position. Sparse anchor points and corresponding sparse anchor boxes are generated from the prior information. Left and right anchors are used as a whole to generate a 2D preselection box. In the second stage, a Feature Pyramid Network (FPN) based on key point generation network is used to generate key points, which are combined with stereo regression to generate 3D preselected boxes. Finally, instance-level disparity estimation is proposed to solve the problem of pixel-level information loss in the original image. Instance-level disparity is combined with instance segmentation masks to improve the accuracy of center depth on the 3D bounding box. Extensive experiments on the challenging Kitti dataset and NuScences dataset show that the proposed method reduces the computational cost while maintaining a high regression rate without any depth information and prior information of position. Compared to other methods, the proposed method has higher efficiency, better robustness and stronger generalization ability.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Efficient 3D Object Detection Models and Evaluation Method for Autonomous Driving
    Lee, Jin-Hee
    Lee, Jae-Keun
    Lee, Joohyun
    Kim, Je-Seok
    Kwon, Soon
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [32] CAF-RCNN: multimodal 3D object detection with cross-attention
    Liu, Junting
    Liu, Deer
    Zhu, Lei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 6131 - 6146
  • [33] 3D Object Proposals Using Stereo Imagery for Accurate Object Class Detection
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhu, Yukun
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1259 - 1272
  • [34] Research on Object Measurement Based on 3D Stereo Vision
    Xia, Xinghua
    Dai, Shilong
    Qi, Hongfeng
    Xu, Zilong
    Wang, Shuang
    Zhang, Mingxu
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 7260 - 7264
  • [35] PG-RCNN: Semantic Surface Point Generation for 3D Object Detection
    Koo, Inyong
    Lee, Inyoung
    Kim, Se-Ho
    Kim, Hee-Seon
    Jeon, Woo-Jin
    Kim, Changick
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18096 - 18105
  • [36] SPS-RCNN: Semantic-Guided Proposal Sampling for 3D Object Detection from LiDAR Point Clouds
    Xu, Hengxin
    Yang, Lei
    Zhao, Shengya
    Tao, Shan
    Tian, Xinran
    Liu, Kun
    SENSORS, 2025, 25 (04)
  • [37] FCNet: Stereo 3D Object Detection with Feature Correlation Networks
    Wu, Yingyu
    Liu, Ziyan
    Chen, Yunlei
    Zheng, Xuhui
    Zhang, Qian
    Yang, Mo
    Tang, Guangming
    ENTROPY, 2022, 24 (08)
  • [38] Joint stereo 3D object detection and implicit surface reconstruction
    Li, Shichao
    Huang, Xijie
    Liu, Zechun
    Cheng, Kwang-Ting
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Stereo 3D Object Detection Using a Feature Attention Module
    Zhao, Kexin
    Jiang, Rui
    He, Jun
    ALGORITHMS, 2023, 16 (12)
  • [40] Channel-Based Network for Fast Object Detection of 3D LiDAR
    Kwon, SoonSub
    Park, TaeHyoung
    ELECTRONICS, 2020, 9 (07) : 1 - 10