TFEdet: Efficient Multi-Frame 3D Object Detector via Proposal-Centric Temporal Feature Extraction

被引:0
|
作者
Kim, Jongho [1 ]
Sagong, Sungpyo [1 ]
Yi, Kyongsu [1 ]
机构
[1] Seoul Natl Univ, Dept Mech Engn, Seoul 08826, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Proposals; Feature extraction; Point cloud compression; Detectors; Three-dimensional displays; Autonomous vehicles; Transformers; Convolution; Object detection; Laser radar; 3D object detection; multi-frame detection; autonomous driving; LiDAR point cloud; gated recurrent unit;
D O I
10.1109/ACCESS.2024.3482093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes the Temporal Feature Extraction Detector (TFEdet), a novel deep learning-based 3D multi-frame object detector efficiently utilizing temporal features from consecutive point clouds. To leverage previously processed frames, inter-frame bipartite matching is performed between current detections from a pre-trained single-frame detector and predicted prior detections, while considering the ego-motion. Subsequently, based on inter-frame association, two types of proposed temporal features are accumulated: temporal proposal features, which are aggregated single-frame features of proposals, and inter-frame proposal features, which containing explicit information between frames. These collected temporal features are then temporally encoded in a Gated Recurrent Unit (GRU)-based temporal feature extraction head and added as residuals to the current frame proposals, leading to the final detection. In performance evaluations on the nuScenes dataset, the proposed TFEdet, which processes a relatively smaller number of point clouds, handles more than twice the frames per second compared to previous multi-frame detectors and still demonstrates competitive detection performance through effective utilization of temporal proposal features.
引用
收藏
页码:154526 / 154534
页数:9
相关论文
共 50 条
  • [21] Beyond Appearance: Multi-Frame Spatio-Temporal Context Memory Networks for Efficient and Robust Video Object Segmentation
    Dang, Jisheng
    Zheng, Huicheng
    Xu, Xiaohao
    Wang, Longguang
    Guo, Yulan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4853 - 4866
  • [22] 3D object feature extraction and classification using 3D MF-DFA
    Wang, Jian
    Han, Ziwei
    Jiang, Wenjing
    Kim, Junseok
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [23] Single and multi-frame auto-calibration for 3D endoscopy with differential rendering
    Furukawa, Ryo
    Sagawa, Ryusulce
    Oka, Shiro
    Tanaka, Shinji
    Kawasaki, Hiroshi
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [24] NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation
    Zheng, Zehan
    Wu, Danni
    Lu, Ruisi
    Lu, Fan
    Chen, Guang
    Jiang, Changjun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 909 - 918
  • [25] Efficient representation and feature extraction for neural network-based 3D object pose estimation
    Kouskouridas, Rigas
    Gasteratos, Antonios
    Emmanouilidis, Christos
    NEUROCOMPUTING, 2013, 120 : 90 - 100
  • [26] EFFICIENT IMPLEMENTATION OF 3D FILTER FOR MOVING OBJECT EXTRACTION
    Chen Ken Wang Ping Wang Lunyao (Institute of Circuits and Systems
    Journal of Electronics(China), 2007, (06) : 792 - 797
  • [27] Feature extraction for 3D object detection using integral imaging
    Aloni, Doron
    Yitzhaky, Yitzhak
    IMAGE RECONSTRUCTION FROM INCOMPLETE DATA VIII, 2015, 9600
  • [28] LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving
    Meyer, Gregory P.
    Laddha, Ankit
    Kee, Eric
    Vallespi-Gonzalez, Carlos
    Wellington, Carl K.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12669 - 12678
  • [29] A Fast and Robust Framework for 3D/2D Model to Multi-Frame Fluoroscopy Registration
    Saadat, Shabnam
    Asikuzzaman, Md.
    Pickering, Mark R.
    Perriman, Diana M.
    Scarvell, Jennie M.
    Smith, Paul N.
    IEEE ACCESS, 2021, 9 : 134223 - 134239
  • [30] ObjectFusion: Multi-modal 3D Object Detection with Object-Centric Fusion
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Ngo, Chong-Wah
    Mei, Tao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18021 - 18030