Deep NRSFM for multi-view multi-body pose estimation

被引:0
|
作者
Fothi, Aron [1 ]
Skaf, Joul [1 ]
Lu, Fengjiao [1 ]
Fenech, Kristian [1 ]
机构
[1] Eotvos Lorand Univ, Dept Artificial Intelligence, Pazmany Peter stny 1-A, H-1117 Budapest, Hungary
关键词
Non-rigid structure from motion; Multi-view multi-body pose estimation; Dictionary learning; SHAPE;
D O I
10.1016/j.patrec.2024.08.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the challenging task of unsupervised relative human pose estimation. Our solution exploits the potential offered by utilizing multiple uncalibrated cameras. It is assumed that spatial human pose and camera parameter estimation can be solved as a block sparse dictionary learning problem with zero supervision. The resulting structures and camera parameters can fit individual skeletons into a common space. To do so, we exploit the fact that all individuals in the image are viewed from the same camera viewpoint, thus exploiting the information provided by multiple camera views and overcoming the lack of information on camera parameters. To the best of our knowledge, this is the first solution that requires neither 3D ground truth nor knowledge of the intrinsic or extrinsic camera parameters. Our approach demonstrates the potential of using multiple viewpoints to solve challenging computer vision problems. Additionally, we provide access to the code, encouraging further development and experimentation. https://github.com/Jeryoss/MVMB-NRSFM.
引用
收藏
页码:218 / 224
页数:7
相关论文
共 50 条
  • [1] Deep learning based camera pose estimation in multi-view environment
    Charco, Jorge L.
    Vintimilla, Boris X.
    Sappa, Angel D.
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 224 - 228
  • [2] Epipolar Transformer for Multi-view Human Pose Estimation
    He, Yihui
    Yan, Rui
    Fragkiadaki, Katerina
    Yu, Shoou-, I
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4466 - 4471
  • [3] A Unified Framework for Multi-view Multi-class Object Pose Estimation
    Li, Chi
    Bai, Jin
    Hager, Gregory D.
    [J]. COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 263 - 281
  • [4] Multi-view segmentation based on human pose estimation in images
    Liu, Meng
    Qingxuan, Jia
    [J]. International Journal of Applied Mathematics and Statistics, 2013, 44 (14): : 104 - 111
  • [5] Efficient Multi-View Object Recognition and Full Pose Estimation
    Collet, Alvaro
    Srinivasa, Siddhartha S.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 2050 - 2055
  • [6] Multi-view head pose estimation using neural networks
    Voit, M
    Nickel, K
    Stiefelhagen, R
    [J]. 2ND CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, PROCEEDINGS, 2005, : 347 - 352
  • [7] TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting
    Choudhury, Rohan
    Kitani, Kris M.
    Jeni, Laszlo A.
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14704 - 14714
  • [8] Multi-view Pose Estimation with Flexible Mixtures-of-Parts
    Dogan, Emre
    Eren, Gonen
    Wolf, Christian
    Lombardi, Eric
    Baskurt, Atilla
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017), 2017, 10617 : 180 - 190
  • [9] An Automatic System for Multi-View Face Detection and Pose Estimation
    Ying, Ying
    Wang, Han
    Xu, Jian
    [J]. 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 1101 - 1108
  • [10] Human Pose Estimation through a Novel Multi-view Scheme
    Charco, Jorge L.
    Sappa, Angel D.
    Vintimilla, Boris X.
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 855 - 862