TFEdet: Efficient Multi-Frame 3D Object Detector via Proposal-Centric Temporal Feature Extraction

被引:0
|
作者
Kim, Jongho [1 ]
Sagong, Sungpyo [1 ]
Yi, Kyongsu [1 ]
机构
[1] Seoul Natl Univ, Dept Mech Engn, Seoul 08826, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Proposals; Feature extraction; Point cloud compression; Detectors; Three-dimensional displays; Autonomous vehicles; Transformers; Convolution; Object detection; Laser radar; 3D object detection; multi-frame detection; autonomous driving; LiDAR point cloud; gated recurrent unit;
D O I
10.1109/ACCESS.2024.3482093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes the Temporal Feature Extraction Detector (TFEdet), a novel deep learning-based 3D multi-frame object detector efficiently utilizing temporal features from consecutive point clouds. To leverage previously processed frames, inter-frame bipartite matching is performed between current detections from a pre-trained single-frame detector and predicted prior detections, while considering the ego-motion. Subsequently, based on inter-frame association, two types of proposed temporal features are accumulated: temporal proposal features, which are aggregated single-frame features of proposals, and inter-frame proposal features, which containing explicit information between frames. These collected temporal features are then temporally encoded in a Gated Recurrent Unit (GRU)-based temporal feature extraction head and added as residuals to the current frame proposals, leading to the final detection. In performance evaluations on the nuScenes dataset, the proposed TFEdet, which processes a relatively smaller number of point clouds, handles more than twice the frames per second compared to previous multi-frame detectors and still demonstrates competitive detection performance through effective utilization of temporal proposal features.
引用
收藏
页码:154526 / 154534
页数:9
相关论文
共 50 条
  • [1] MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection
    Chen, Xuesong
    Shi, Shaoshuai
    Zhu, Benjin
    Cheung, Ka Chun
    Xu, Hang
    Li, Hongsheng
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 680 - 697
  • [2] 3D Object Detection With Multi-Frame RGB-Lidar Feature Alignment
    Ercelik, Emec
    Yurtsever, Ekim
    Knoll, Alois
    IEEE ACCESS, 2021, 9 : 143138 - 143149
  • [3] Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
    Zhang, Yifan
    Zhu, Zhiyu
    Hou, Junhui
    Wu, Dapeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10614 - 10628
  • [4] 3D-MAN: 3D Multi-frame Attention Network for Object Detection
    Yang, Zetong
    Zhou, Yin
    Chen, Zhifeng
    Ngiam, Jiquan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1863 - 1872
  • [5] Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
    Wang, Shihao
    Liu, Yingfei
    Wang, Tiancai
    Li, Ying
    Zhang, Xiangyu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3598 - 3608
  • [6] Multi-Sensor Fusion 3D Object Detection Based on Multi-Frame Information
    Wu S.
    Geng J.
    Wu C.
    Yan Z.
    Chen K.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (12): : 1282 - 1289
  • [7] TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection
    Luo, Zhipeng
    Zhang, Gongjie
    Zhou, Changqing
    Liu, Tianrui
    Lu, Shijian
    Pan, Liang
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4219 - 4228
  • [8] Boosting Single-Frame 3D Object Detection by Simulating Multi-Frame Point Clouds
    Zheng, Wu
    Jiang, Li
    Lu, FanBin
    Ye, Yangyang
    Fu, Chi-Wing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4848 - 4856
  • [9] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
    Zong, Zhuofan
    Jiang, Dongzhi
    Song, Guanglu
    Xue, Zeyue
    Su, Jingyong
    Li, Hongsheng
    Liu, Yu
    Proceedings of the IEEE International Conference on Computer Vision, 2023, : 3758 - 3767
  • [10] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
    Zong, Zhuofan
    Jiang, Dongzhi
    Song, Guanglu
    Xue, Zeyue
    Su, Jingyong
    Li, Hongsheng
    Liu, Yu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3758 - 3767