Interactive Multi-Scale Fusion of 2D and 3D Features for Multi-Object Vehicle Tracking

被引:14
|
作者
Wang, Guangming [1 ,2 ]
Peng, Chensheng [1 ,2 ]
Gu, Yingying [2 ]
Zhang, Jinpeng [3 ]
Wang, Hesheng [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Engn Res Ctr Intelligent Control & Manage, Key Lab Marine Intelligent Equipment & Syst, Dept Automat,Key Lab Syst Control & Informat Proc,, Shanghai 200240, Peoples R China
[2] Beijing Inst Control Engn, Space Optoelect Measurement & Percept Lab, Beijing 100190, Peoples R China
[3] China Aerosp Sci & Ind Corp, X Lab, Acad 2, Beijing 100854, Peoples R China
关键词
Multi object tracking; 3D point clouds; feature fusion; computer vision; deep learning;
D O I
10.1109/TITS.2023.3275954
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Multiple Object Tracking (MOT) is a significant task in autonomous driving. Nonetheless, relying on one single sensor is not robust enough, because one modality tends to fail in some challenging situations. Texture information from RGB cameras and 3D structure information from Light Detection and Ranging (LiDAR) have respective advantages under different circumstances. Therefore, feature fusion from multiple modalities contributes to the learning of discriminative features. However, it is nontrivial to achieve effective feature fusion due to the completely distinct information modality. Previous fusion methods usually fuse the top-level features after the backbones extract the features from different modalities. The feature fusion happens solely once, which limits the information interaction between different modalities. In this paper, we propose multiscale interactive query and fusion between pixel-wise and point-wise features to obtain more discriminative features. In addition, an attention mechanism is utilized to conduct soft feature fusion between multiple pixels and points to avoid inaccurate match problems of previous single pixel-point fusion methods. We introduce PointNet++ to obtain multi-scale deep representations of point clouds and make it adaptive to our proposed interactive feature fusion between multi-scale features of images and point clouds. Through the interaction module, each modality can integrate more complementary information from the other modality. Besides, we explore the effectiveness of pre-training on each single modality and fine-tuning on the fusion-based model. Our method can achieve 90.32% MOTA and 72.44% HOTA on the KITTI benchmark and outperform other approaches without using multi-scale soft feature fusion.
引用
收藏
页码:10618 / 10627
页数:10
相关论文
共 50 条
  • [21] Motion Compensation Optimization Method for 3D Multi-Object Tracking
    Wang S.-H.
    Zhang Y.
    Shen J.-N.
    Ji J.-M.
    Zhang Y.-Y.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (02): : 528 - 539
  • [22] 3D Multi-Object Tracking: A Baseline and New Evaluation Metrics
    Weng, Xinshuo
    Wang, Jianren
    Held, David
    Kitani, Kris
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10359 - 10366
  • [23] Learnable Online Graph Representations for 3D Multi-Object Tracking
    Zaech, Jan-Nico
    Liniger, Alexander
    Dai, Dengxin
    Danelljan, Martin
    Van Gool, Luc
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 5103 - 5110
  • [24] A 3D Multi-Object Tracking Based on Bounding Box and Depth
    Feng, Chun-Hao
    Chang, Jen-Yuan
    PROCEEDINGS OF THE ASME 2021 30TH CONFERENCE ON INFORMATION STORAGE AND PROCESSING SYSTEMS (ISPS2021), 2021,
  • [25] FANTrack: 3D Multi-Object Tracking with Feature Association Network
    Baser, Erkan
    Balasubramanian, Venkateshwaran
    Bhattacharyya, Prarthana
    Czarnecki, Krzysztof
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 1426 - 1433
  • [26] Exploring Simple 3D Multi-Object Tracking for Autonomous Driving
    Luo, Chenxu
    Yang, Xiaodong
    Yuille, Alan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10468 - 10477
  • [27] Automated Vehicle Multi-Object Tracking at Scale with CAN
    Nice, Matthew Walter
    Gloudemans, Derek
    Work, Dan
    2022 13TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2022), 2022, : 292 - 293
  • [28] Multi-Scale PointPillars 3D Object Detection Network
    Ya, Hang
    Luo, Guiming
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 174 - 179
  • [29] PF-MOT: Probability Fusion Based 3D Multi-Object Tracking for Autonomous Vehicles
    Wen, Tao
    Zhang, Yanyong
    Freris, Nikolaos M.
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
  • [30] A Bayesian Filter for Multi-View 3D Multi-Object Tracking With Occlusion Handling
    Ong, Jonah
    Ba-Tuong Vo
    Ba-Ngu Vo
    Kim, Du Yong
    Nordholm, Sven
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) : 2246 - 2263