Decoupled 3-D object detector

被引:0
|
作者
Arafa M. [1 ]
Osama A. [1 ,2 ]
Abdelaziz M. [1 ,3 ]
Ghoneima M. [1 ,4 ]
García F. [5 ]
Maged S.A. [1 ,4 ]
机构
[1] Autotronics Research Lab (ARL), Faculty of Engineering, Ain Shams University, Cairo
[2] Centre of Mobility Research, Faculty of Engineering, Ain Shams University, Cairo
[3] Automotive Engineering Department, Faculty of Engineering, Ain Shams University, Cairo
[4] Mechatronics Engineering Department, Faculty of Engineering, Ain Shams University, Cairo
[5] Intelligent Systems Laboratory, Universidad Carlos III de Madrid, Leganés, Madrid
关键词
3D object detection; autonomous vehicles; bird eye-view; CNN; estimation; perception; point cloud;
D O I
10.1504/IJVAS.2022.133008
中图分类号
学科分类号
摘要
This paper proposes an efficient cascaded 3-D object detection architecture. Such an architecture decouples the 3-D object detection pipeline to maximise the utilisation of the inherent advantages of RGB images and LiDAR point clouds in order to perform 3-D object detection while maintaining low computational complexity. Our proposed architecture relies on a cascade of two networks, the first leverages the texture density in images and the maturity of state-of-the-art 2-D object detectors to classify and obtain initial region proposals for objects in the scene. These proposals are fed to a lightweight secondary network that leverages the compactness of bird-eye view point cloud representations to perform orientation and size estimation. The 3-D bounding box proposal is constructed by fusing predictions inferred from both networks, as predictions lie on orthogonal planes. Evaluated on the KITTI benchmark data set, we show that the proposed method obtains results on-par with more complex end-to-end 3-D detection methods while greatly reducing computational and memory requirements. This work also presents results from the deployment within a perception pipeline, and analyses challenges faced in deployment within a frontal perception pipeline. © 2022 Inderscience Enterprises Ltd.
引用
收藏
页码:143 / 160
页数:17
相关论文
共 50 条
  • [21] A SYNTACTIC APPROACH TO 3-D OBJECT REPRESENTATION
    LIN, WC
    FU, KS
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (03) : 351 - 364
  • [22] A framework for statistical 3-D object recognition
    Paulus, D
    Hornegger, J
    Niemann, H
    PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1153 - 1157
  • [23] AUTOMATIC OBJECT SEGMENTATION WITH 3-D CAMERAS
    Liu, Haowei
    Philipose, Matthai
    Sun, Ming-Ting
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 569 - 572
  • [24] 3-D OBJECT RECOGNITION BY GRAPH MATCHING
    LIU, YC
    MA, SD
    INTELLIGENT AUTONOMOUS SYSTEMS 2, VOLS 1 AND 2, 1989, : 918 - 921
  • [25] INTERPOLATION TECHNIQUES FOR 3-D OBJECT GENERATION
    GUJAR, UG
    BHAVSAR, VC
    DATAR, NN
    COMPUTERS & GRAPHICS, 1988, 12 (3-4) : 541 - 555
  • [26] A LINGUISTIC APPROACH TO 3-D OBJECT RECOGNITION
    KASPRZAK, W
    COMPUTERS & GRAPHICS, 1987, 11 (04) : 427 - 443
  • [27] CubeSLAM: Monocular 3-D Object SLAM
    Yang, Shichao
    Scherer, Sebastian
    IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (04) : 925 - 938
  • [28] Investigation of accuracy of 3-D representation of a 3-D object shape in the human visual system
    Krasilnikov, NN
    Mironenko, EP
    PERCEPTION, 2005, 34 : 113 - 113
  • [29] Voxel-RCNN-Complex: An Effective 3-D Point Cloud Object Detector for Complex Traffic Conditions
    Wang, Hai
    Chen, Zhiyu
    Cai, Yingfeng
    Chen, Long
    Li, Yicheng
    Angel Sotelo, Miguel
    Li, Zhixiong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [30] DEVELOPMENT OF 3-D DETECTOR SYSTEM FOR POSITRON CT
    SHIMIZU, K
    OHMURA, T
    WATANABE, M
    UCHIDA, H
    YAMASHITA, T
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1988, 35 (01) : 717 - 720