Overview of 3D Human Pose Estimation

被引:3
|
作者
Lin, Jianchu [1 ,2 ]
Li, Shuang [3 ]
Qin, Hong [3 ,4 ,5 ]
Wang, Hongchang [3 ]
Cui, Ning [7 ]
Jiang, Qian [8 ]
Jian, Haifang [3 ]
Wang, Gongming [6 ]
机构
[1] Huaiyin Inst Technol, Huaian 223000, Peoples R China
[2] Jiangsu Outlook Shenzhou Big Data Technol Co Ltd, Nanjing 210002, Peoples R China
[3] Chinese Acad Sci, Inst Semicond, Beijing 100083, Peoples R China
[4] Univ Chinese Acad Sci, Ctr Mat Sci & Optoelect Engn, Beijing 100049, Peoples R China
[5] Univ Chinese Acad Sci, Sch Integrated Circuits, Beijing 100049, Peoples R China
[6] Inspur Software Grp Co Ltd, Jinan 250104, Peoples R China
[7] China Great Wall Ind Corp, Beijing 100054, Peoples R China
[8] China Great Wall Ind Corp Nav Co Ltd, Beijing 100144, Peoples R China
来源
关键词
human pose estimation; monocular camera; deep learning; multi-view; indicator; REPRESENTATION; MODEL;
D O I
10.32604/cmes.2022.020857
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
3D human pose estimation is a major focus area in the field of computer vision, which plays an important role in practical applications. This article summarizes the framework and research progress related to the estimation of monocular RGB images and videos. An overall perspective of methods integrated with deep learning is introduced. Novel image-based and video-based inputs are proposed as the analysis framework. From this viewpoint, common problems are discussed. The diversity of human postures usually leads to problems such as occlusion and ambiguity, and the lack of training datasets often results in poor generalization ability of the model. Regression methods are crucial for solving such problems. Considering image-based input, the multi-view method is commonly used to solve occlusion problems. Here, the multi-view method is analyzed comprehensively. By referring to video-based input, the human prior knowledge of restricted motion is used to predict human postures. In addition, structural constraints are widely used as prior knowledge. Furthermore, weakly supervised learning methods are studied and discussed for these two types of inputs to improve the model generalization ability. The problem of insufficient training datasets must also be considered, especially because 3D datasets are usually biased and limited. Finally, emerging and popular datasets and evaluation indicators are discussed. The characteristics of the datasets and the relationships of the indicators are explained and highlighted. Thus, this article can be useful and instructive for researchers who are lacking in experience and find this field confusing. In addition, by providing an overview of 3D human pose estimation, this article sorts and refines recent studies on 3D human pose estimation. It describes kernel problems and common useful methods, and discusses the scope for further research.
引用
收藏
页码:1621 / 1651
页数:31
相关论文
共 50 条
  • [1] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Liu, Shuangjun
    Sehgal, Naveen
    Ostadabbas, Sarah
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
  • [2] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    [J]. Applied Intelligence, 2022, 52 : 14491 - 14506
  • [3] On the Robustness of 3D Human Pose Estimation
    Chen, Zerui
    Huang, Yan
    Wang, Liang
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5326 - 5332
  • [4] SlowFastFormer for 3D human pose estimation
    Zhou, Lu
    Chen, Yingying
    Wang, Jinqiao
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 243
  • [5] 3D Human Pose Estimation=2D Pose Estimation plus Matching
    Chen, Ching-Hang
    Ramanan, Deva
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5759 - 5767
  • [6] A survey on monocular 3D human pose estimation
    Ji, Xiaopeng
    Fang, Qi
    Dong, Junting
    Shuai, Qing
    Jiang, Wen
    Zhou, Xiaowei
    [J]. Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500
  • [8] Precise 3D Pose Estimation of Human Faces
    Pernek, Akos
    Hajder, Levente
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 618 - 625
  • [9] Deep 3D human pose estimation: A review
    Wang, Jinbao
    Tan, Shujie
    Zhen, Xiantong
    Xu, Shuo
    Zheng, Feng
    He, Zhenyu
    Shao, Ling
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210
  • [10] 3D human pose estimation by depth map
    Wu, Jianzhai
    Hu, Dewen
    Xiang, Fengtao
    Yuan, Xingsheng
    Su, Jiongming
    [J]. VISUAL COMPUTER, 2020, 36 (07): : 1401 - 1410