Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS

被引:42
|
作者
Chen, Long [1 ]
Ai, Haizhou [1 ]
Chen, Rui [1 ]
Zhuang, Zijie [1 ]
Liu, Shuang [2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] AiFi Inc, Sunnyvale, CA USA
关键词
D O I
10.1109/CVPR42600.2020.00334
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating 3D poses of multiple humans in real-time is a classic but still challenging task in computer vision. Its major difficulty lies in the ambiguity in cross-view association of 2D poses and the huge state space when there are multiple people in multiple views. In this paper, we present a novel solution for multi-human 3D pose estimation from multiple calibrated camera views. It takes 2D poses in different camera coordinates as inputs and aims for the accurate 3D poses in the global coordinate. Unlike previous methods that associate 2D poses among all pairs of views from scratch at every frame, we exploit the temporal consistency in videos to match the 2D inputs with 3D poses directly in 3-space. More specifically, we propose to retain the 3D pose for each person and update them iteratively via the cross-view multi-human tracking. This novel formulation improves both accuracy and efficiency, as we demonstrated on widely-used public datasets. To further verify the scalability of our method, we propose a new large-scale multi-human dataset with 12 to 28 camera views. Without bells and whistles, our solution achieves 154 FPS on 12 cameras and 34 FPS on 28 cameras, indicating its ability to handle large-scale real-world applications. The proposed dataset will be released at https://github.com/longcw/crossview_3d_pose_tracking.
引用
收藏
页码:3276 / 3285
页数:10
相关论文
共 50 条
  • [1] Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking
    Chu, Hau
    Lee, Jia-Hong
    Lee, Yao-Chih
    Hsu, Ching-Hsien
    Li, Jia-Da
    Chen, Chu-Song
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1472 - 1481
  • [2] Cross View Fusion for 3D Human Pose Estimation
    Qiu, Haibo
    Wang, Chunyu
    Wang, Jingdong
    Wang, Naiyan
    Zeng, Wenjun
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4341 - 4350
  • [3] Cross-View Self-fusion for Self-supervised 3D Human Pose Estimation in the Wild
    Kim, Hyun-Woo
    Lee, Gun-Hee
    Oh, Myeong-Seok
    Lee, Seong-Whan
    [J]. COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 193 - 210
  • [4] A Strong Geometric Baseline for Cross-View Matching of Multi-person 3D Pose Estimation from Multi-view Images
    Dehaeck, Sam
    Domken, Corentin
    Bey-Temsamani, Abdellatif
    Abedrabbo, Gabriel
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 77 - 88
  • [5] Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video
    Sun, Cheng
    Thomas, Diego
    Kawasaki, Hiroshi
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5959 - 5964
  • [6] Human pose estimation based on cross-view feature fusion
    Sun, Dandan
    Wang, Siqi
    Xia, Hailun
    Zhang, Changan
    Gao, Jianlong
    Mao, Mingyu
    [J]. VISUAL COMPUTER, 2024, 40 (09): : 6581 - 6597
  • [7] Single-view multi-human pose estimation by attentive cross-dimension matching
    Tian, Wei
    Gao, Zhong
    Tan, Dayi
    [J]. FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [8] Weakly-Supervised 3D Human Pose Estimation With Cross-View U-Shaped Graph Convolutional Network
    Hua, Guoliang
    Liu, Hong
    Li, Wenhao
    Zhang, Qian
    Ding, Runwei
    Xu, Xin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1832 - 1843
  • [9] Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation
    Niu, Zehai
    Lu, Ke
    Xue, Jian
    Wang, Jinbao
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [10] Multi-view Pictorial Structures for 3D Human Pose Estimation
    Amin, Sikandar
    Andriluka, Mykhaylo
    Rohrbach, Marcus
    Schiele, Bernt
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,