Spatial Attention Frustum: A 3D Object Detection Method Focusing on Occluded Objects

被引:0
|
作者
He, Xinglei [1 ]
Zhang, Xiaohan [1 ]
Wang, Yichun [1 ]
Ji, Hongzeng [1 ]
Duan, Xiuhui [1 ]
Guo, Fen [1 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China
关键词
visual attention mechanism; occluded object detection; multi-sensor fusion; 3D object detection; autonomous vehicles; DEPTH;
D O I
10.3390/s22062366
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Achieving the accurate perception of occluded objects for autonomous vehicles is a challenging problem. Human vision can always quickly locate important object regions in complex external scenes, while other regions are only roughly analysed or ignored, defined as the visual attention mechanism. However, the perception system of autonomous vehicles cannot know which part of the point cloud is in the region of interest. Therefore, it is meaningful to explore how to use the visual attention mechanism in the perception system of autonomous driving. In this paper, we propose the model of the spatial attention frustum to solve object occlusion in 3D object detection. The spatial attention frustum can suppress unimportant features and allocate limited neural computing resources to critical parts of the scene, thereby providing greater relevance and easier processing for higher-level perceptual reasoning tasks. To ensure that our method maintains good reasoning ability when faced with occluded objects with only a partial structure, we propose a local feature aggregation module to capture more complex local features of the point cloud. Finally, we discuss the projection constraint relationship between the 3D bounding box and the 2D bounding box and propose a joint anchor box projection loss function, which will help to improve the overall performance of our method. The results of the KITTI dataset show that our proposed method can effectively improve the detection accuracy of occluded objects. Our method achieves 89.46%, 79.91% and 75.53% detection accuracy in the easy, moderate, and hard difficulty levels of the car category, and achieves a 6.97% performance improvement especially in the hard category with a high degree of occlusion. Our one-stage method does not need to rely on another refining stage, comparable to the accuracy of the two-stage method.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, 67 (07) : 2176 - 2190
  • [42] FusionPainting: Multimodal Fusion with Adaptive Attention for 3D Object Detection
    Xu, Shaoqing
    Zhou, Dingfu
    Fang, Jin
    Yin, Junbo
    Bin, Zhou
    Zhang, Liangjun
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3047 - 3054
  • [43] Image attention transformer network for indoor 3D object detection
    Ren, Keyan
    Yan, Tong
    Hu, Zhaoxin
    Han, Honggui
    Zhang, Yunlu
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (07) : 2176 - 2190
  • [44] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, (07) : 2176 - 2190
  • [45] Stereo 3D Object Detection Using a Feature Attention Module
    Zhao, Kexin
    Jiang, Rui
    He, Jun
    ALGORITHMS, 2023, 16 (12)
  • [46] ARPNET: attention region proposal network for 3D object detection
    Yangyang YE
    Chi ZHANG
    Xiaoli HAO
    ScienceChina(InformationSciences), 2019, 62 (12) : 44 - 52
  • [47] ARPNET: attention region proposal network for 3D object detection
    Ye, Yangyang
    Zhang, Chi
    Hao, Xiaoli
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (12)
  • [48] A workpiece grasp detection method based on 3D object detection
    Li, Huijun
    Duan, Longbo
    Wang, Qirun
    Zhang, Yilun
    Ye, Bin
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2025,
  • [49] DEEP SENSOR FUSION BASED ON FRUSTUM POINT SINGLE SHOT MULTIBOX DETECTOR FOR 3D OBJECT DETECTION
    Wang, Yu
    Zhang, Ye
    Zhai, Shaohua
    Chen, Hao
    Shi, Shaoqi
    Wang, Gang
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 674 - 678
  • [50] Spatial Pruned Sparse Convolution for Efficient 3D Object Detection
    Liu, Jianhui
    Chen, Yukang
    Ye, Xiaoqing
    Tian, Zhuotao
    Tan, Xiao
    Qi, Xiaojuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,