TFEdet: Efficient Multi-Frame 3D Object Detector via Proposal-Centric Temporal Feature Extraction

被引:0
|
作者
Kim, Jongho [1 ]
Sagong, Sungpyo [1 ]
Yi, Kyongsu [1 ]
机构
[1] Seoul Natl Univ, Dept Mech Engn, Seoul 08826, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Proposals; Feature extraction; Point cloud compression; Detectors; Three-dimensional displays; Autonomous vehicles; Transformers; Convolution; Object detection; Laser radar; 3D object detection; multi-frame detection; autonomous driving; LiDAR point cloud; gated recurrent unit;
D O I
10.1109/ACCESS.2024.3482093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes the Temporal Feature Extraction Detector (TFEdet), a novel deep learning-based 3D multi-frame object detector efficiently utilizing temporal features from consecutive point clouds. To leverage previously processed frames, inter-frame bipartite matching is performed between current detections from a pre-trained single-frame detector and predicted prior detections, while considering the ego-motion. Subsequently, based on inter-frame association, two types of proposed temporal features are accumulated: temporal proposal features, which are aggregated single-frame features of proposals, and inter-frame proposal features, which containing explicit information between frames. These collected temporal features are then temporally encoded in a Gated Recurrent Unit (GRU)-based temporal feature extraction head and added as residuals to the current frame proposals, leading to the final detection. In performance evaluations on the nuScenes dataset, the proposed TFEdet, which processes a relatively smaller number of point clouds, handles more than twice the frames per second compared to previous multi-frame detectors and still demonstrates competitive detection performance through effective utilization of temporal proposal features.
引用
收藏
页码:154526 / 154534
页数:9
相关论文
共 50 条
  • [31] DASANet: A 3D Object Detector with Density-and-Sparsity Feature Aggregation
    Zhang, Qiang
    Wei, Dongdong
    REMOTE SENSING, 2023, 15 (18)
  • [32] 3D Object Detection Using Multiple-Frame Proposal Features Fusion
    Huang, Minyuan
    Leung, Henry
    Hou, Ming
    SENSORS, 2023, 23 (22)
  • [33] Sparse 3D Reconstruction via Object-Centric Ray Sampling
    Cerkezi, Llukman
    Favaro, Paolo
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 432 - 441
  • [34] Unstructured road parameter cognition for ICVs using multi-frame 3D point clouds
    Xie, Guotao
    Yan, Kangjian
    Wang, Dongsheng
    Sun, Ning
    Gao, Hongbo
    COGNITIVE COMPUTATION AND SYSTEMS, 2021, 3 (02) : 169 - 182
  • [35] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence
    Lee, Junhyung
    Koh, Junho
    Lee, Youngwoo
    Choi, Jun Won
    arXiv, 2022,
  • [36] 3D Object detector: A multiscale region proposal network based on autonomous driving
    Chen, Xiu
    Yang, Shuo
    Li, Yingfei
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [37] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence
    Lee, Junhyung
    Koh, Junho
    Lee, Youngwoo
    Choi, Jun Won
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9238 - 9244
  • [38] MULTI-FRAME CT-VIDEO REGISTRATION FOR 3D AIRWAY-WALL ANALYSIS
    Byrnes, Patrick D.
    Higgins, William E.
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1638 - 1641
  • [39] A Background Study on Feature Extraction for 2D and 3D Object Models
    Yuan, Xiaobu
    Pachika, Shivani
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 265 - 273
  • [40] Multi-feature Fusion VoteNet for 3D Object Detection
    Wang, Zhoutao
    Xie, Qian
    Wei, Mingqiang
    Long, Kun
    Wang, Jun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)