Interactive Multi-Scale Fusion of 2D and 3D Features for Multi-Object Vehicle Tracking

被引：14

作者：

Wang, Guangming ^{[1
,2
]}

Peng, Chensheng ^{[1
,2
]}

Gu, Yingying ^{[2
]}

Zhang, Jinpeng ^{[3
]}

Wang, Hesheng ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai Engn Res Ctr Intelligent Control & Manage, Key Lab Marine Intelligent Equipment & Syst, Dept Automat,Key Lab Syst Control & Informat Proc,, Shanghai 200240, Peoples R China

[2] Beijing Inst Control Engn, Space Optoelect Measurement & Percept Lab, Beijing 100190, Peoples R China

[3] China Aerosp Sci & Ind Corp, X Lab, Acad 2, Beijing 100854, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 10期

关键词：

Multi object tracking; 3D point clouds; feature fusion; computer vision; deep learning;

D O I：

10.1109/TITS.2023.3275954

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Multiple Object Tracking (MOT) is a significant task in autonomous driving. Nonetheless, relying on one single sensor is not robust enough, because one modality tends to fail in some challenging situations. Texture information from RGB cameras and 3D structure information from Light Detection and Ranging (LiDAR) have respective advantages under different circumstances. Therefore, feature fusion from multiple modalities contributes to the learning of discriminative features. However, it is nontrivial to achieve effective feature fusion due to the completely distinct information modality. Previous fusion methods usually fuse the top-level features after the backbones extract the features from different modalities. The feature fusion happens solely once, which limits the information interaction between different modalities. In this paper, we propose multiscale interactive query and fusion between pixel-wise and point-wise features to obtain more discriminative features. In addition, an attention mechanism is utilized to conduct soft feature fusion between multiple pixels and points to avoid inaccurate match problems of previous single pixel-point fusion methods. We introduce PointNet++ to obtain multi-scale deep representations of point clouds and make it adaptive to our proposed interactive feature fusion between multi-scale features of images and point clouds. Through the interaction module, each modality can integrate more complementary information from the other modality. Besides, we explore the effectiveness of pre-training on each single modality and fine-tuning on the fusion-based model. Our method can achieve 90.32% MOTA and 72.44% HOTA on the KITTI benchmark and outperform other approaches without using multi-scale soft feature fusion.

引用

页码：10618 / 10627

页数：10

共 50 条

[31] Anchor Distance for 3D Multi-Object Distance Estimation From 2D Single Shot
Yu, Hyeonwoo
Oh, Jean
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3405 - 3412
[32] Multi-scale feature matching between 2D image and 3D model
GIST, Korea, Republic of
SIGGRAPH Asia Posters, SA,
[33] Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences
Mueller, Norman
Wong, Yu-Shiang
Mitra, Niloy J.
Dai, Angela
Niessner, Matthias
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6067 - 6076
[34] LiDAR-based 3D Multi-object Tracking for Unmanned Vehicles
Xiong Z.-K.
Cheng X.-Q.
Wu Y.-D.
Zuo Z.-Q.
Liu J.-S.
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (10): : 2073 - 2083
[35] SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking
Papais, Sandro
Ren, Robert
Waslander, Steven
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4939 - 4945
[36] Factor Graph based 3D Multi-Object Tracking in Point Clouds
Poeschmann, Johannes
Pfeifer, Tim
Protzel, Peter
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10343 - 10350
[37] Poly-MOT: A Polyhedral Framework For 3D Multi-Object Tracking
Li, Xiaoyu
Xie, Tao
Liu, Dedong
Gao, Jinghan
Dai, Kun
Jiang, Zhiqiang
Zhao, Lijun
Wang, Ke
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9391 - 9398
[38] Score refinement for confidence-based 3D multi-object tracking
Benbarka, Nuri
Schroder, Jona
Zell, Andreas
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8083 - 8090
[39] 3D multi-object tracking based on parallel multimodal data association
Tan, Shiyu
Li, Xu
Xu, Qimin
Zhu, Jianxiao
MACHINE VISION AND APPLICATIONS, 2025, 36 (03)
[40] Multi-Scale Regional Fusion Based Interactive 3D Tumor Segmentation in Lung Cancer
Wang, R.
Yang, J.
Mu, Z.
Wang, Z.
Xu, R.
Zhou, Z.
MEDICAL PHYSICS, 2021, 48 (06)

← 1 2 3 4 5 →