Decoupled 3-D object detector

被引：0

作者：

Arafa M. ^{[1
]}

Osama A. ^{[1
,2
]}

Abdelaziz M. ^{[1
,3
]}

Ghoneima M. ^{[1
,4
]}

García F. ^{[5
]}

Maged S.A. ^{[1
,4
]}

机构：

[1] Autotronics Research Lab (ARL), Faculty of Engineering, Ain Shams University, Cairo

[2] Centre of Mobility Research, Faculty of Engineering, Ain Shams University, Cairo

[3] Automotive Engineering Department, Faculty of Engineering, Ain Shams University, Cairo

[4] Mechatronics Engineering Department, Faculty of Engineering, Ain Shams University, Cairo

[5] Intelligent Systems Laboratory, Universidad Carlos III de Madrid, Leganés, Madrid

来源：

International Journal of Vehicle Autonomous Systems | 2023年 / 16卷 / 2-4期

关键词：

3D object detection; autonomous vehicles; bird eye-view; CNN; estimation; perception; point cloud;

D O I：

10.1504/IJVAS.2022.133008

中图分类号：

学科分类号：

摘要：

This paper proposes an efficient cascaded 3-D object detection architecture. Such an architecture decouples the 3-D object detection pipeline to maximise the utilisation of the inherent advantages of RGB images and LiDAR point clouds in order to perform 3-D object detection while maintaining low computational complexity. Our proposed architecture relies on a cascade of two networks, the first leverages the texture density in images and the maturity of state-of-the-art 2-D object detectors to classify and obtain initial region proposals for objects in the scene. These proposals are fed to a lightweight secondary network that leverages the compactness of bird-eye view point cloud representations to perform orientation and size estimation. The 3-D bounding box proposal is constructed by fusing predictions inferred from both networks, as predictions lie on orthogonal planes. Evaluated on the KITTI benchmark data set, we show that the proposed method obtains results on-par with more complex end-to-end 3-D detection methods while greatly reducing computational and memory requirements. This work also presents results from the deployment within a perception pipeline, and analyses challenges faced in deployment within a frontal perception pipeline. © 2022 Inderscience Enterprises Ltd.

引用

页码：143 / 160

页数：17

共 50 条

[21] A SYNTACTIC APPROACH TO 3-D OBJECT REPRESENTATION
LIN, WC
FU, KS
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (03) : 351 - 364
[22] A framework for statistical 3-D object recognition
Paulus, D
Hornegger, J
Niemann, H
PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1153 - 1157
[23] AUTOMATIC OBJECT SEGMENTATION WITH 3-D CAMERAS
Liu, Haowei
Philipose, Matthai
Sun, Ming-Ting
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 569 - 572
[24] 3-D OBJECT RECOGNITION BY GRAPH MATCHING
LIU, YC
MA, SD
INTELLIGENT AUTONOMOUS SYSTEMS 2, VOLS 1 AND 2, 1989, : 918 - 921
[25] INTERPOLATION TECHNIQUES FOR 3-D OBJECT GENERATION
GUJAR, UG
BHAVSAR, VC
DATAR, NN
COMPUTERS & GRAPHICS, 1988, 12 (3-4) : 541 - 555
[26] A LINGUISTIC APPROACH TO 3-D OBJECT RECOGNITION
KASPRZAK, W
COMPUTERS & GRAPHICS, 1987, 11 (04) : 427 - 443
[27] CubeSLAM: Monocular 3-D Object SLAM
Yang, Shichao
Scherer, Sebastian
IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (04) : 925 - 938
[28] Investigation of accuracy of 3-D representation of a 3-D object shape in the human visual system
Krasilnikov, NN
Mironenko, EP
PERCEPTION, 2005, 34 : 113 - 113
[29] Voxel-RCNN-Complex: An Effective 3-D Point Cloud Object Detector for Complex Traffic Conditions
Wang, Hai
Chen, Zhiyu
Cai, Yingfeng
Chen, Long
Li, Yicheng
Angel Sotelo, Miguel
Li, Zhixiong
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[30] DEVELOPMENT OF 3-D DETECTOR SYSTEM FOR POSITRON CT
SHIMIZU, K
OHMURA, T
WATANABE, M
UCHIDA, H
YAMASHITA, T
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1988, 35 (01) : 717 - 720

← 1 2 3 4 5 →