3ONet: 3-D Detector for Occluded Object Under Obstructed Conditions

被引:18
|
作者
Hoang, Hiep Anh [1 ]
Yoo, Myungsik [2 ]
机构
[1] Soongsil Univ, Dept Informat Commun Convergence Technol, Seoul, South Korea
[2] Soongsil Univ, Sch Elect Engn, Seoul 06978, South Korea
基金
新加坡国家研究基金会;
关键词
3-D object detection; autonomous vehicle; light detection and ranging (LiDAR); point cloud; segmentation;
D O I
10.1109/JSEN.2023.3293515
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The ability to perceive and understand 3-D space is crucial for autonomous driving to effectively navigate their surroundings and make informed decisions. However, deep learning on point clouds is currently in its early stages due to the unique challenges of processing such data with deep neural networks. One of the major challenges lies in accurately detecting partially occluded objects under various practical conditions. To address this problem, we propose 3-D detector for Occluded Object under Obstructed conditions (3ONet), a two-stage light detection and ranging (LiDAR)-based 3-D object detection framework. Leveraging the advantages of the point-voxel-based method, 3ONet efficiently encodes multiscale features, enabling the generation of high-quality 3-D proposals while preserving detailed object shape information. Specifically, we introduce a point reconstruction network module designed to recover the missing 3-D spatial structures of foreground points. In the first stage, 3ONet identifies the regions containing foreground objects using a point segmentation network and combines them with the proposals to reconstruct the occluded object's 3-D point cloud geometry using an encoder-decoder approach. The refinement stage further enhances the performance by rescoring and adjusting the box location based on the enriched spatial shape information. We evaluate the performance of our proposed framework on the KITTI dataset and the Waymo Open dataset, and the results demonstrate its state-of-the-art performance in 3-D object detection.
引用
收藏
页码:18879 / 18892
页数:14
相关论文
共 50 条
  • [31] A LINGUISTIC APPROACH TO 3-D OBJECT RECOGNITION
    KASPRZAK, W
    COMPUTERS & GRAPHICS, 1987, 11 (04) : 427 - 443
  • [32] CubeSLAM: Monocular 3-D Object SLAM
    Yang, Shichao
    Scherer, Sebastian
    IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (04) : 925 - 938
  • [33] 3-D or not 3-D
    Adam Powell
    JOM, 2002, 54 : 22 - 24
  • [34] 3-D OR NOT 3-D
    SMITH, CW
    NEW SCIENTIST, 1984, 102 (1407) : 40 - 44
  • [35] 3-D OR NOT 3-D
    KERBEL, M
    FILM COMMENT, 1980, 16 (06) : 11 - 20
  • [36] 3-D OR NOT 3-D
    Kehr, Dave
    FILM COMMENT, 2010, 46 (01) : 60 - 67
  • [37] Robust-FusionNet: Deep Multimodal Sensor Fusion for 3-D Object Detection Under Severe Weather Conditions
    Zhang, Cheng
    Wang, Hai
    Cai, Yingfeng
    Chen, Long
    Li, Yicheng
    Sotelo, Miguel Angel
    Li, Zhixiong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [38] 3-D or not 3-D
    Powell, A
    JOM-JOURNAL OF THE MINERALS METALS & MATERIALS SOCIETY, 2002, 54 (01): : 22 - 24
  • [39] Orthostereoscopic conditions for 3-D HDTV
    Yamanoue, Hirokazu
    Nagayama, Masaru
    Bitou, Mineo
    Tanada, Jun
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 1998, 52 (03): : 377 - 383
  • [40] A genetic aggregate stereo algorithm for 3-D classification of occluded shapes
    Zaki, M
    El-Ramsisi, A
    Omran, R
    PATTERN RECOGNITION LETTERS, 2000, 21 (05) : 349 - 363