Unsupervised Multi-view Multi-person 3D Pose Estimation Using Reprojection Error

被引:0
|
作者
de Franca Silva, Diogenes Wallis [1 ]
Do Monte Lima, Joao Paulo Silva [1 ,2 ]
Macedo, David [3 ]
Zanchettin, Cleber [3 ]
Thomas, Diego Gabriel Francis [4 ]
Uchiyama, Hideaki [5 ]
Teichrieb, Veronica [1 ]
机构
[1] Univ Fed Pernambuco, Centro Informat, Voxar Labs, Recife, PE, Brazil
[2] Univ Fed Rural Pernambuco, Dept Computaao, Visual Comp Lab, Recife, PE, Brazil
[3] Univ Fed Pernambuco, Centro Informat, Recife, PE, Brazil
[4] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka, Japan
[5] Nara Inst Sci & Technol, Grad Sch Sci & Technol, Nara, Japan
关键词
3D human pose estimation; Unsupervised learning; Deep learning; Reprojection error;
D O I
10.1007/978-3-031-15934-3_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work addresses multi-view multi-person 3D pose estimation in synchronized and calibrated camera views. Recent approaches estimate neural network weights in a supervised way; they rely on ground truth annotated datasets to compute the loss function and optimize the weights in the network. However, manually labeling ground truth datasets is labor-intensive, expensive, and prone to errors. Consequently, it is preferable not to rely heavily on labeled datasets. This work proposes an unsupervised approach to estimating 3D human poses requiring only an off-the-shelf 2D pose estimation method and the intrinsic and extrinsic camera parameters. Our approach uses reprojection error as a loss function instead of comparing the predicted 3D pose with the ground truth. First, we estimate the 3D pose of each person using the plane sweep stereo approach, in which the depth of each 2D joint related to each person is estimated in a selected target view. The estimated 3D pose is then projected onto each of the other views using camera parameters. Finally, the 2D reprojection error in the image plane is computed by comparing it with the estimated 2D pose corresponding to the same person. The 2D poses that correspond to the same person are identified using virtual depth planes, where each 3D pose is projected onto the reference view and compared to find the nearest 2D pose. Our proposed method learns to estimate 3D pose in an end-to-end unsupervised manner and does not require any manual parameter tuning, yet we achieved results close to state-of-the-art supervised methods on a public dataset. Our method achieves only 5.8% points below the fully supervised state-ofthe-art method and only 5.1% points below the best geometric approach in the Campus dataset.
引用
收藏
页码:482 / 494
页数:13
相关论文
共 50 条
  • [1] Direct Multi-view Multi-person 3D Pose Estimation
    Wang, Tao
    Zhang, Jianfeng
    Cai, Yujun
    Yan, Shuicheng
    Feng, Jiashi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo
    Lin, Jiahao
    Lee, Gim Hee
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11881 - 11890
  • [3] VTP: volumetric transformer for multi-view multi-person 3D pose estimation
    Yuxing Chen
    Renshu Gu
    Ouhan Huang
    Gangyong Jia
    [J]. Applied Intelligence, 2023, 53 : 26568 - 26579
  • [4] VTP: volumetric transformer for multi-view multi-person 3D pose estimation
    Chen, Yuxing
    Gu, Renshu
    Huang, Ouhan
    Jia, Gangyong
    [J]. APPLIED INTELLIGENCE, 2023, 53 (22) : 26568 - 26579
  • [5] RF-based Multi-view Pose Machine for Multi-Person 3D Pose Estimation
    Xie, Chunyang
    Zhang, Dongheng
    Wu, Zhi
    Yu, Cong
    Hu, Yang
    Sun, Qibin
    Chen, Yan
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2669 - 2674
  • [6] Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
    Wu, Size
    Jin, Sheng
    Liu, Wentao
    Bai, Lei
    Qian, Chen
    Liu, Dong
    Ouyang, Wanli
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11128 - 11137
  • [7] Enhanced 3D Pose Estimation in Multi-Person, Multi-View Scenarios through Unsupervised Domain Adaptation with Dropout Discriminator
    Deng, Junli
    Yao, Haoyuan
    Shi, Ping
    [J]. SENSORS, 2023, 23 (20)
  • [8] Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation
    Niu, Zehai
    Lu, Ke
    Xue, Jian
    Wang, Jinbao
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [9] ER-Net: Efficient Recalibration Network for Multi-View Multi-Person 3D Pose Estimation
    Zhou, Mi
    Liu, Rui
    Yi, Pengfei
    Zhou, Dongsheng
    [J]. CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (02): : 2093 - 2109
  • [10] Unsupervised universal hierarchical multi-person 3D pose estimation for natural scenes
    Renshu Gu
    Zhongyu Jiang
    Gaoang Wang
    Kevin McQuade
    Jenq-Neng Hwang
    [J]. Multimedia Tools and Applications, 2022, 81 : 32883 - 32906