3ONet: 3-D Detector for Occluded Object Under Obstructed Conditions

被引：18

作者：

Hoang, Hiep Anh ^{[1
]}

Yoo, Myungsik ^{[2
]}

机构：

[1] Soongsil Univ, Dept Informat Commun Convergence Technol, Seoul, South Korea

[2] Soongsil Univ, Sch Elect Engn, Seoul 06978, South Korea

来源：

IEEE SENSORS JOURNAL | 2023年 / 23卷 / 16期

基金：

新加坡国家研究基金会;

关键词：

3-D object detection; autonomous vehicle; light detection and ranging (LiDAR); point cloud; segmentation;

D O I：

10.1109/JSEN.2023.3293515

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The ability to perceive and understand 3-D space is crucial for autonomous driving to effectively navigate their surroundings and make informed decisions. However, deep learning on point clouds is currently in its early stages due to the unique challenges of processing such data with deep neural networks. One of the major challenges lies in accurately detecting partially occluded objects under various practical conditions. To address this problem, we propose 3-D detector for Occluded Object under Obstructed conditions (3ONet), a two-stage light detection and ranging (LiDAR)-based 3-D object detection framework. Leveraging the advantages of the point-voxel-based method, 3ONet efficiently encodes multiscale features, enabling the generation of high-quality 3-D proposals while preserving detailed object shape information. Specifically, we introduce a point reconstruction network module designed to recover the missing 3-D spatial structures of foreground points. In the first stage, 3ONet identifies the regions containing foreground objects using a point segmentation network and combines them with the proposals to reconstruct the occluded object's 3-D point cloud geometry using an encoder-decoder approach. The refinement stage further enhances the performance by rescoring and adjusting the box location based on the enriched spatial shape information. We evaluate the performance of our proposed framework on the KITTI dataset and the Waymo Open dataset, and the results demonstrate its state-of-the-art performance in 3-D object detection.

引用

页码：18879 / 18892

页数：14

共 50 条

[31] A LINGUISTIC APPROACH TO 3-D OBJECT RECOGNITION
KASPRZAK, W
COMPUTERS & GRAPHICS, 1987, 11 (04) : 427 - 443
[32] CubeSLAM: Monocular 3-D Object SLAM
Yang, Shichao
Scherer, Sebastian
IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (04) : 925 - 938
[33] 3-D or not 3-D
Adam Powell
JOM, 2002, 54 : 22 - 24
[34] 3-D OR NOT 3-D
SMITH, CW
NEW SCIENTIST, 1984, 102 (1407) : 40 - 44
[35] 3-D OR NOT 3-D
KERBEL, M
FILM COMMENT, 1980, 16 (06) : 11 - 20
[36] 3-D OR NOT 3-D
Kehr, Dave
FILM COMMENT, 2010, 46 (01) : 60 - 67
[37] Robust-FusionNet: Deep Multimodal Sensor Fusion for 3-D Object Detection Under Severe Weather Conditions
Zhang, Cheng
Wang, Hai
Cai, Yingfeng
Chen, Long
Li, Yicheng
Sotelo, Miguel Angel
Li, Zhixiong
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[38] 3-D or not 3-D
Powell, A
JOM-JOURNAL OF THE MINERALS METALS & MATERIALS SOCIETY, 2002, 54 (01): : 22 - 24
[39] Orthostereoscopic conditions for 3-D HDTV
Yamanoue, Hirokazu
Nagayama, Masaru
Bitou, Mineo
Tanada, Jun
Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 1998, 52 (03): : 377 - 383
[40] A genetic aggregate stereo algorithm for 3-D classification of occluded shapes
Zaki, M
El-Ramsisi, A
Omran, R
PATTERN RECOGNITION LETTERS, 2000, 21 (05) : 349 - 363

← 1 2 3 4 5 →