AirPose: Multi-View Fusion Network for Aeria 3D Human Pose and Shape Estimation

被引:12
|
作者
Saini, Nitin [1 ,2 ]
Bonetto, Elia [1 ,2 ]
Price, Eric [1 ,2 ]
Ahmad, Aamir [1 ,2 ]
Black, Michael J. [1 ]
机构
[1] Max Planck Inst Intelligent Syst, D-72076 Tubingen, Germany
[2] Univ Stuttgart, Inst Flight Mech & Controls, Fac Aerosp Engn & Geodesy, D-70569 Stuttgart, Germany
关键词
Aerial systems; deep learning for visual perception; datasets for human motion; human detection and tracking; perception and autonomy; sensor fusion;
D O I
10.1109/LRA.2022.3145494
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter, we present a novel markerless 3D human motion capture (MoCap) system for unstructured, outdoor environments that uses a team of autonomous unmanned aerial vehicles (UAVs) with on-hoard RGB cameras and computation. Existing methods are limited by calibrated cameras and off-line processing. Thus, we present the first method (AirPose) to estimate human pose and shape using images captured by multiple extrinsically uncalibrated flying cameras. AirPose itself calibrates the cameras relative to the person instead of relying on any pre-calibration. It uses distributed neural networks running on each UAV that communicate viewpoint-independent information with each other about the person (i.e., their 3D shape and articulated pose). The person's shape and pose are parameterized using the SMPL-X body model, resulting in a compact representation, that minimizes communication between the UAVs. The network is trained using synthetic images of realistic virtual environments, and fine-tuned on a small set of real images. We also introduce an optimization-based post-processing method (AirPose+) for offline applications that require higher MoCap quality. We make our method's code and data available for research at https://github.com/robot-perception-group/AirPose. A video describing the approach and results is available at https://youtu.be/xLYelTNHsfs.
引用
收藏
页码:4805 / 4812
页数:8
相关论文
共 50 条
  • [1] PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
    Zhang, Lijun
    Zhou, Kangkang
    Liu, Liangchen
    Li, Zhenghao
    Zhao, Xunyi
    Zhou, Xiang-Dong
    Shi, Yu
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1600 - 1604
  • [2] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
    Zhou, Kangkang
    Zhang, Lijun
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
  • [3] Multi-view Pictorial Structures for 3D Human Pose Estimation
    Amin, Sikandar
    Andriluka, Mykhaylo
    Rohrbach, Marcus
    Schiele, Bernt
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [4] Multi-view 3D Human Pose Estimation in Complex Environment
    M. Hofmann
    D. M. Gavrila
    [J]. International Journal of Computer Vision, 2012, 96 : 103 - 124
  • [5] Generative Multi-View Based 3D Human Pose Estimation
    Sabri, Motaz
    [J]. PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 2 - 9
  • [6] Multi-view 3D Human Pose Estimation in Complex Environment
    Hofmann, M.
    Gavrila, D. M.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 103 - 124
  • [7] Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
    Jiang, Boyuan
    Hu, Lei
    Xia, Shihong
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14804 - 14814
  • [8] 3D Human Pose Estimation from Deep Multi-View 2D Pose
    Schwarcz, Steven
    Pollard, Thomas
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2326 - 2331
  • [9] 3d human pose estimation based on multi view information fusion
    Zhang, Shuo
    Liu, Ming
    Zhao, Yuejin
    Dong, Liquan
    Kong, Lingqin
    [J]. OPTICAL METROLOGY AND INSPECTION FOR INDUSTRIAL APPLICATIONS IX, 2022, 12319
  • [10] 3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-view Model-fitting
    Li, Zhongguo
    Oskarsson, Magnus
    Heyden, Anders
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1887 - 1896