A unified multi-view multi-person tracking framework

被引:0
|
作者
Yang, Fan [1 ]
Odashima, Shigeyuki [1 ]
Yamao, Sosuke [1 ]
Fujimoto, Hiroaki [1 ]
Masui, Shoichi [1 ]
Jiang, Shan [1 ]
机构
[1] Fujitsu Res, Tokyo, Japan
关键词
multi-camera multi-person tracking; pose tracking; footprint tracking; triangulation; spatiotemporal clustering; MULTITARGET;
D O I
10.1007/s41095-023-0334-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Despite significant developments in 3D multi-view multi-person (3D MM) tracking, current frameworks separately target footprint tracking, or pose tracking. Frameworks designed for the former cannot be used for the latter, because they directly obtain 3D positions on the ground plane via a homography projection, which is inapplicable to 3D poses above the ground. In contrast, frameworks designed for pose tracking generally isolate multi-view and multi-frame associations and may not be sufficiently robust for footprint tracking, which utilizes fewer key points than pose tracking, weakening multi-view association cues in a single frame. This study presents a unified multi-view multi-person tracking framework to bridge the gap between footprint tracking and pose tracking. Without additional modifications, the framework can adopt monocular 2D bounding boxes and 2D poses as its input to produce robust 3D trajectories for multiple persons. Importantly, multi-frame and multi-view information are jointly employed to improve association and triangulation. Our framework is shown to provide state-of-the-art performance on the Campus and Shelf datasets for 3D pose tracking, with comparable results on the WILDTRACK and MMPTRACK datasets for 3D footprint tracking.
引用
收藏
页码:137 / 160
页数:24
相关论文
共 50 条
  • [1] A unified multi-view multi-person tracking framework
    Fan Yang
    Shigeyuki Odashima
    Sosuke Yamao
    Hiroaki Fujimoto
    Shoichi Masui
    Shan Jiang
    [J]. Computational Visual Media, 2024, 10 : 137 - 160
  • [2] Feature compression: A framework for multi-view multi-person tracking in visual sensor networks
    Cosar, Serhan
    Cetin, Mujdat
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (05) : 864 - 873
  • [3] Direct Multi-view Multi-person 3D Pose Estimation
    Wang, Tao
    Zhang, Jianfeng
    Cai, Yujun
    Yan, Shuicheng
    Feng, Jiashi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Dynamic Multi-Person Mesh Recovery From Uncalibrated Multi-View Cameras
    Huang, Buzhen
    Shu, Yuan
    Zhang, Tianshu
    Wang, Yangang
    [J]. 2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 710 - 720
  • [5] Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras With Human Semantics
    Huang, Buzhen
    Ju, Jingyi
    Shu, Yuan
    Wang, Yangang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4229 - 4242
  • [6] Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation
    Niu, Zehai
    Lu, Ke
    Xue, Jian
    Wang, Jinbao
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [7] A Unified Framework for Multi-view Spectral Clustering
    Zhong, Guo
    Pun, Chi-Man
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1854 - 1857
  • [8] Shape-aware Multi-Person Pose Estimation from Multi-View Images
    Dong, Zijian
    Song, Jie
    Chen, Xu
    Guo, Chen
    Hilliges, Otmar
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11138 - 11148
  • [9] Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras
    Zhang, Yuxiang
    Li, Zhe
    An, Liang
    Li, Mengcheng
    Yu, Tao
    Liu, Yebin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5540 - 5549
  • [10] Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo
    Lin, Jiahao
    Lee, Gim Hee
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11881 - 11890