M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

被引:0
|
作者
Liu, Jiaming [1 ]
Wu, Yue [1 ]
Gong, Maoguo [1 ]
Miao, Qiguang [1 ]
Ma, Wenping [1 ]
Xu, Cai [1 ]
Qin, Can [2 ]
机构
[1] Xidian Univ, Xian, Peoples R China
[2] Northeastern Univ, Boston, MA USA
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D Single Object Tracking (SOT) stands a forefront task of computer vision, proving essential for applications like autonomous driving. Sparse and occluded data in scene point clouds introduce variations in the appearance of tracked objects, adding complexity to the task. In this research, we unveil M3SOT, a novel 3D SOT framework, which synergizes multiple input frames (template sets), multiple receptive fields (continuous contexts), and multiple solution spaces (distinct tasks) in ONE model. Remarkably, M3SOT pioneers in modeling temporality, contexts, and tasks directly from point clouds, revisiting a perspective on the key factors influencing SOT. To this end, we design a transformer-based network centered on point cloud targets in the search area, aggregating diverse contextual representations and propagating target cues by employing historical frames. As M3SOT spans varied processing perspectives, we've streamlined the network-trimming its depth and optimizing its structure-to ensure a lightweight and efficient deployment for SOT applications. We posit that, backed by practical construction, M3SOT sidesteps the need for complex frameworks and auxiliary components to deliver sterling results. Extensive experiments on benchmarks such as KITTI, nuScenes, and Waymo Open Dataset demonstrate that M3SOT achieves state-of-the-art performance at 38 FPS. Our code and models are available at https://github.com/ywu0912/TeamCode.git.
引用
收藏
页码:3630 / 3638
页数:9
相关论文
共 50 条
  • [21] MMF-Track: Multi-Modal Multi-Level Fusion for 3D Single Object Tracking
    Li, Zhiheng
    Cui, Yubo
    Lin, Yu
    Fang, Zheng
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1817 - 1829
  • [22] A Multi-Level Eigenvalue Fusion Algorithm for 3D Multi-Object Tracking
    Liu, Hantao
    Hu, Jianming
    Li, Xingyu
    Peng, Lihui
    INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2022: APPLICATION OF EMERGING TECHNOLOGIES, 2022, : 235 - 245
  • [23] Probabilistic 3D Multi-Modal, Multi-Object Tracking for Autonomous Driving
    Chiu, Hsu-kuang
    Lie, Jie
    Ambrus, Rares
    Bohg, Jeannette
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14227 - 14233
  • [24] Exploiting Multi-Modal Synergies for Enhancing 3D Multi-Object Tracking
    Xu, Xinglong
    Ren, Weihong
    Chen, Xi'ai
    Fan, Huijie
    Han, Zhi
    Liu, Honghai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (10): : 8643 - 8650
  • [25] TFEdet: Efficient Multi-Frame 3D Object Detector via Proposal-Centric Temporal Feature Extraction
    Kim, Jongho
    Sagong, Sungpyo
    Yi, Kyongsu
    IEEE ACCESS, 2024, 12 : 154526 - 154534
  • [26] InterTrack: Interaction Transformer for 3D Multi-Object Tracking
    Willes, John
    Reading, Cody
    Waslander, Steven L.
    2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 73 - 80
  • [27] A Bayesian framework for multi-cue 3D object tracking
    Giebel, J
    Gavrila, DM
    Schnörr, C
    COMPUTER VISION - ECCV 2004, PT 4, 2004, 2034 : 241 - 252
  • [28] A Bayesian Filter for Multi-View 3D Multi-Object Tracking With Occlusion Handling
    Ong, Jonah
    Ba-Tuong Vo
    Ba-Ngu Vo
    Kim, Du Yong
    Nordholm, Sven
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) : 2246 - 2263
  • [29] A multi-frame based motion estimation for semantic object tracking in the presence of occlusion
    Gao, J
    Kak, A
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2002, : 881 - 884
  • [30] Detection-by-Tracking Boosted Online 3D Multi-Object Tracking
    Chen, Quei-An
    Tsukada, Akihiro
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 295 - 301