Motion-to-Matching: A Mixed Paradigm for 3D Single Object Tracking

被引:1
|
作者
Li, Zhiheng [1 ,2 ,3 ]
Lin, Yu [1 ]
Cui, Yubo [1 ]
Li, Shuo [1 ]
Fang, Zheng [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Natl Frontiers Sci Ctr Ind Intelligence & Syst Opt, Shenyang 110819, Peoples R China
[3] Northeastern Univ, Key Lab Data Analyt & Optimizat Smart Ind, Minist Educ, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Three-dimensional displays; Feature extraction; Point cloud compression; Transformers; Object tracking; Image matching; 3D object tracking; deep learning; point clouds;
D O I
10.1109/LRA.2023.3347143
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
3D single object tracking with LiDAR points is an important task in the computer vision field. Previous methods usually adopt the matching-based or motion-centric paradigms to estimate the current target status. However, the former is sensitive to the similar distractors and the sparseness of point clouds due to relying on appearance matching, while the latter usually focuses on short-term motion clues (eg. two frames) and ignores the long-term motion pattern of target. To address these issues, we propose a mixed paradigm with two stages, named MTM-Tracker, which combines motion modeling with feature matching into a single network. Specifically, in the first stage, we exploit the continuous historical boxes as motion prior and propose an encoder-decoder structure to locate target coarsely. Then, in the second stage, we introduce a feature interaction module to extract motion-aware features from consecutive point clouds and match them to refine target movement as well as regress other target states. Extensive experiments validate that our paradigm achieves competitive performance on large-scale datasets (70.9% in KITTI and 51.70% in NuScenes).
引用
收藏
页码:1468 / 1475
页数:8
相关论文
共 50 条
  • [1] Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
    Zheng, Chaoda
    Yan, Xu
    Zhang, Haiming
    Wang, Baoyuan
    Cheng, Shenghui
    Cui, Shuguang
    Li, Zhen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8101 - 8110
  • [2] An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
    Zheng, Chaoda
    Yan, Xu
    Zhang, Haiming
    Wang, Baoyuan
    Cheng, Shenghui
    Cui, Shuguang
    Li, Zhen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 43 - 60
  • [3] Delving into Motion-Aware Matching for Monocular 3D Object Tracking
    Huang, Kuan-Chih
    Yang, Ming-Hsuan
    Tsai, Yi-Hsuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6886 - 6895
  • [4] Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking
    Ma, Teli
    Wang, Mengmeng
    Xiao, Jimin
    Wu, Huifeng
    Liu, Yong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9919 - 9929
  • [5] Context Matching-Guided Motion Modeling for 3D Point Cloud Object Tracking
    Nie, Jiahao
    Xu, Anqi
    Bao, Zhengyi
    He, Zhiwei
    Lv, Xudong
    Gao, Mingyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2289 - 2300
  • [6] 3D OBJECT TRACKING AND MOTION SHAPE RECOGNITION
    Amirgaliyev, Y. N.
    Nussipbekov, A. K.
    BULLETIN OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN, 2014, (02): : 21 - 24
  • [7] Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training
    Wu, Qiangqiang
    Xia, Yan
    Wan, Jia
    Chan, Antoni B.
    COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 270 - 288
  • [8] An hardware architecture for 3D object tracking and motion estimation
    Lanvin, P
    Noyer, JC
    Benjelloun, M
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 330 - 333
  • [9] Tracking 3d Pose of Rigid Object by Sparse Template Matching
    Oka, Yuki
    Kuroda, Toshiyuki
    Migita, Tsuyoshi
    Shakunaga, Takeshi
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS (ICIG 2009), 2009, : 390 - 397
  • [10] Tracking a rigid object in 3D from a single camera
    Wang, H
    Li, Z
    AUTOMATIC INSPECTION AND NOVEL INSTRUMENTATION, 1997, 3185 : 78 - 89