Temporal Attention for Robust Multiple Object Pose Tracking

被引:0
|
作者
Li, Zhongluo [1 ]
Yoshimoto, Junichiro [2 ]
Ikeda, Kazushi [1 ]
机构
[1] Nara Inst Sci & Technol, Nara 6300192, Japan
[2] Fujita Hlth Univ, Sch Med, Toyoake, Aichi 4701192, Japan
关键词
Pose Estimation; Vision Transformer; Temporal Information;
D O I
10.1007/978-981-99-8070-3_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the pose of multiple objects has improved substantially since deep learning became widely used. However, the performance deteriorates when the objects are highly similar in appearance or when occlusions are present. This issue is usually addressed by leveraging temporal information that takes previous frames as priors to improve the robustness of estimation. Existing methods are either computationally expensive by using multiple frames, or are inefficiently integrated with ad hoc procedures. In this paper, we perform computationally efficient object association between two consecutive frames via attention through a video sequence. Furthermore, instead of heatmap-based approaches, we adopt a coordinate classification strategy that excludes post-processing, where the network is built in an end-to-end fashion. Experiments on real data show that our approach achieves state-of-the-art results on Pose-Track datasets.
引用
收藏
页码:551 / 561
页数:11
相关论文
共 50 条
  • [41] Multiple Object Tracking Based on Robust Network Flow Model
    Yi, Shuangyan
    Wang, Hongpeng
    He, Zhenyu
    Li, Yi
    Chen, Wen-Sheng
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 299 - 303
  • [42] Robust object tracking with multiple basic mean shift tracker
    Qi, Yuanchen
    Wu, Chengdong
    Chen, Dongyue
    Yu, Xiaosheng
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
  • [43] TSTrack: A Robust Object Tracking Framework Integrated Temporal and Spatial Features
    Mu, Qi
    Wang, Xueqian
    He, Zuohui
    Li, Zhanli
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 : 344 - 360
  • [44] Pose Selection for Underwater Object Detection, Pose Estimation, and Tracking
    Teigland, Hakon
    Hassani, Vahid
    Tore Moller, Ments
    IEEE ACCESS, 2024, 12 : 142331 - 142342
  • [45] Spatio-Temporal Point Process for Multiple Object Tracking
    Wang, Tao
    Chen, Kean
    Lin, Weiyao
    See, John
    Zhang, Zenghui
    Xu, Qian
    Jia, Xia
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 1777 - 1788
  • [46] SPATIO-TEMPORAL CORRELATION LEARNING FOR MULTIPLE OBJECT TRACKING
    Jian, Yajun
    Zhuang, Chihui
    He, Wenyan
    Du, Kaiwen
    Lu, Yang
    Wang, Hanzi
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6170 - 6174
  • [47] Robust Adaptive Pose Tracking Control for Multiple Spacecraft Rendezvous with Collision Avoidance
    Jiang, Renjian
    Sun, Zewen
    Sun, Liang
    Jiang, Jingjing
    2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 1016 - 1021
  • [48] Exploiting spatio-temporal constraints for robust 2D pose tracking
    Rogez, Gregory
    Rius, Ignasi
    Martinez-del-Rincon, Jesus
    Orrite, Carlos
    HUMAN MOTION - UNDERSTANDING, MODELING, CAPTURE AND ANIMATION, PROCEEDINGS, 2007, 4814 : 58 - +
  • [49] Pose robust face tracking by combining view-based AAMs and temporal filters
    Huang, Chen
    Ding, Xiaoqing
    Fang, Chi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (07) : 777 - 792
  • [50] Bidirectional Temporal Pose Matching for Tracking
    Fang, Yichuan
    Shi, Qingxuan
    Yang, Zhen
    ELECTRONICS, 2024, 13 (02)