D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence

被引:4
|
作者
Lee, Junhyung [1 ]
Koh, Junho [2 ]
Lee, Youngwoo [2 ]
Choi, Jun Won [2 ]
机构
[1] Hanyang Univ, Dept Future Mobil, Seoul 04763, South Korea
[2] Hanyang Univ, Dept Elect Engn, Seoul 04763, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/ICRA48891.2023.10160484
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
LiDAR sensors are widely used for 3D object detection in various mobile robotics applications. LiDAR sensors continuously generate point cloud data in real-time. Conventional 3D object detectors detect objects using a set of points acquired over a fixed duration. However, recent studies have shown that the performance of object detection can be further enhanced by utilizing spatio-temporal information obtained from point cloud sequences. In this paper, we propose a new 3D object detector, named D-Align, which can effectively produce strong bird's-eye-view (BEV) features by aligning and aggregating the features obtained from a sequence of point sets. The proposed method includes a novel dual-query co-attention network that uses two types of queries, including target query set (T-QS) and support query set (S-QS), to update the features of target and support frames, respectively. D-Align aligns SQS to T-QS based on the temporal context features extracted from the adjacent feature maps and then aggregates S-QS with T-QS using a gated fusion mechanism. The dual queries are updated through multiple attention layers to progressively enhance the target frame features used to produce the detection results. Our experiments on the nuScenes dataset show that the proposed D-Align method greatly improved the performance of a single frame-based baseline method and significantly outperformed the latest 3D object detectors. Code is available at https://github.com/junhyung-SPALab/D-Align.
引用
收藏
页码:9238 / 9244
页数:7
相关论文
共 50 条
  • [31] Intracranial aneurysm detection based on 3D point cloud object detection method
    Li, Jun
    Liu, Juntong
    Wang, Jiaqi
    Wang, Peipei
    Ye, Mingquan
    COGENT ENGINEERING, 2024, 11 (01):
  • [32] MPAN: Multi-Part Attention Network for Point Cloud Based 3D Shape Retrieval
    Li, Zirui
    Xu, Junyu
    Zhao, Yue
    Li, Wenhui
    Nie, Weizhi
    IEEE ACCESS, 2020, 8 (08): : 157322 - 157332
  • [33] 3DDACNN: 3D dense attention convolutional neural network for point cloud based object recognition
    Han, Xian-Feng
    Huang, Xin-Yi
    Sun, Shi-Jie
    Wang, Ming-Jie
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6655 - 6671
  • [34] 3D object detection based on point cloud in automatic driving scene
    Li, Hai-Sheng
    Lu, Yan-Ling
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 13029 - 13044
  • [35] Point Cloud 3D Object Detection Based on Improved SECOND Algorithm
    Zhang Ying
    Jiang Liangliang
    Zhang Dongbo
    Duan Wanlin
    Sun Yue
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (08)
  • [36] 3D object detection based on point cloud in automatic driving scene
    Hai-Sheng Li
    Yan-Ling Lu
    Multimedia Tools and Applications, 2024, 83 : 13029 - 13044
  • [37] 3DDACNN: 3D dense attention convolutional neural network for point cloud based object recognition
    Xian-Feng Han
    Xin-Yi Huang
    Shi-Jie Sun
    Ming-Jie Wang
    Artificial Intelligence Review, 2022, 55 : 6655 - 6671
  • [38] 3D Object Detection from Point Cloud Based on Deep Learning
    Hao, Ning
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [39] Stereo Point Cloud Refinement for 3D Object Detection
    Liu, Wangchao
    Wang, Teng
    Wang, Yang
    Zhang, Xiangyu
    Lou, Xin
    2021 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2021) & 2021 IEEE CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS (PRIMEASIA 2021), 2021, : 61 - 64
  • [40] Detection based object labeling of 3D point cloud for indoor scenes
    Liu, Wei
    Li, Shaozi
    Cao, Donglin
    Su, Songzhi
    Ji, Rongrong
    NEUROCOMPUTING, 2016, 174 : 1101 - 1106