ShaSTA: Modeling Shape and Spatio-Temporal Affinities for 3D Multi-Object Tracking

被引:3
|
作者
Sadjadpour, Tara [1 ]
Li, Jie [2 ]
Ambrus, Rares [3 ]
Bohg, Jeannette [1 ]
机构
[1] Stanford Univ, Sch Engn, Comp Sci Dept, Stanford, CA 94305 USA
[2] NVIDIA, Santa Clara, CA 95051 USA
[3] Toyota Res Inst, Los Altos, CA 94022 USA
来源
关键词
Computer vision for transportation; deep learning for visual perception; visual tracking;
D O I
10.1109/LRA.2023.3323124
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Multi-object tracking (MOT) is a cornerstone capability of any robotic system. Tracking quality is largely dependent on the quality of input detections. In many applications, such as autonomous driving, it is preferable to over-detect objects to avoid catastrophic outcomes due to missed detections. As a result, current state-of-the-art 3D detectors produce high rates of false-positives to ensure a low number of false-negatives. This can negatively affect tracking by making data association and track lifecycle management more challenging. Additionally, occasional false-negative detections due to difficult scenarios like occlusions can harm tracking performance. To address these issues in a unified framework, we propose ShaSTA which learns shape and spatio-temporal affinities between tracks and detections in consecutive frames. The affinity is a probabilistic matching that leads to robust data association, track lifecycle management, false-positive elimination, false-negative propagation, and sequential track confidence refinement. We offer the first self-contained framework that addresses all aspects of the 3D MOT problem. We quantitatively evaluate ShaSTA on the nuScenes tracking benchmark with 5 metrics, including the most common tracking accuracy metric called AMOTA, to demonstrate how ShaSTA may impact the ultimate goal of an autonomous mobile agent. ShaSTA achieves 1st place amongst LiDAR-only trackers that use CenterPoint detections.
引用
收藏
页码:4273 / 4280
页数:8
相关论文
共 50 条
  • [1] Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking
    Pang, Ziqi
    Li, Jie
    Tokmakov, Pavel
    Chen, Dian
    Zagoruyko, Sergey
    Wang, Yu-Xiong
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17928 - 17938
  • [2] Learning Spatio-Temporal Information for Multi-Object Tracking
    Wei, Jian
    Yang, Mei
    Liu, Feng
    [J]. IEEE ACCESS, 2017, 5 : 3869 - 3877
  • [3] STMT: Spatio-temporal memory transformer for multi-object tracking
    Gu, Songbo
    Ma, Jianxin
    Hui, Guancheng
    Xiao, Qiyang
    Shi, Wentao
    [J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 23426 - 23441
  • [4] STMT: Spatio-temporal memory transformer for multi-object tracking
    Songbo Gu
    Jianxin Ma
    Guancheng Hui
    Qiyang Xiao
    Wentao Shi
    [J]. Applied Intelligence, 2023, 53 : 23426 - 23441
  • [5] STAT: Multi-Object Tracking Based on Spatio-Temporal Topological Constraints
    Zhang, Junjie
    Wang, Mingyan
    Jiang, Haoran
    Zhang, Xinyu
    Yan, Chenggang
    Zeng, Dan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4445 - 4457
  • [6] Spatio-Temporal Correlation Graph for Association Enhancement in Multi-object Tracking
    Zhong, Zhijie
    Sheng, Hao
    Zhang, Yang
    Wu, Yubin
    Chen, Jiahui
    Ke, Wei
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 394 - 405
  • [7] STTracker: Spatio-Temporal Tracker for 3D Single Object Tracking
    Cui, Yubo
    Li, Zhiheng
    Fang, Zheng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4967 - 4974
  • [8] Wildlife 3D multi-object tracking
    Klasen, Morris
    Steinhage, Volker
    [J]. ECOLOGICAL INFORMATICS, 2022, 71
  • [9] Spatio-temporal object detection by deep learning: Video-interlacing to improve multi-object tracking
    Mhalla, Ala
    Chateau, Thierry
    Ben Amara, Najoua Essoukri
    [J]. IMAGE AND VISION COMPUTING, 2019, 88 : 120 - 131
  • [10] Efficient Multi-object Detection for Complexity Spatio-Temporal Scenes
    Wang, Kai
    Song, Xiangyu
    Sun, Shijie
    Zhao, Juan
    Xu, Cai
    Song, Huansheng
    [J]. WEB AND BIG DATA, PT IV, APWEB-WAIM 2023, 2024, 14334 : 186 - 200