Real-Time Human Pose Recognition in Parts from Single Depth Images

被引:1168
|
作者
Shotton, Jamie [1 ]
Sharp, Toby [1 ]
Kipman, Alex
Fitzgibbon, Andrew [1 ]
Finocchio, Mark
Blake, Andrew [1 ]
Cook, Mat [1 ]
Moore, Richard
机构
[1] Microsoft Res, Cambridge, England
关键词
D O I
10.1145/2398356.2398381
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new method to quickly and accurately - predict human pose-the 3D positions of body joints-from a single depth image, without depending on information from preceding frames. Our approach is strongly rooted in current object recognition strategies. By designing an intermediate - representation in terms of body parts, the difficult pose estimation problem is transformed into a simpler per-pixel classification problem, for which efficient machine learning techniques exist. By using computer graphics to synthesize a very large dataset of training image pairs, one can train a classifier that estimates body part labels from test images invariant to pose, body shape, clothing, and other irrelevances. Finally, we generate confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes. The system runs in under 5ms on the Xbox 360. Our evaluation shows high accuracy on both synthetic and real test sets, and investigates the effect of several training parameters. We achieve state-of-the-art accuracy in our comparison with related work and demonstrate improved generalization over exact whole-skeleton nearest neighbor matching.
引用
收藏
页码:116 / 124
页数:9
相关论文
共 50 条
  • [1] Real-Time Human Pose Recognition in Parts from Single Depth Images
    Shotton, Jamie
    Fitzgibbon, Andrew
    Cook, Mat
    Sharp, Toby
    Finocchio, Mark
    Moore, Richard
    Kipman, Alex
    Blake, Andrew
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1297 - 1304
  • [2] Real-Time Human Pose Estimation and Gesture Recognition from Depth Images Using Superpixels and SVM Classifier
    Kim, Hanguen
    Lee, Sangwon
    Lee, Dongsung
    Choi, Soonmin
    Ju, Jinsun
    Myung, Hyun
    [J]. SENSORS, 2015, 15 (06) : 12410 - 12427
  • [3] Real-time 3D Pose Estimation from Single Depth Images
    Schnuerer, Thomas
    Fuchs, Stefan
    Eisenbach, Markus
    Gross, Horst-Michael
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 716 - 724
  • [4] REAL-TIME UPPER BODY POSE ESTIMATION FROM DEPTH IMAGES
    Tsai, Ming-Han
    Chen, Kuan-Hua
    Lin, I-Chen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2234 - 2238
  • [5] Real-Time Human Body Pose Estimation for In-Car Depth Images
    Torres, Helena R.
    Oliveira, Bruno
    Fonseca, Jaime
    Queiros, Sandro
    Borges, Joao
    Rodrigues, Nelson
    Coelho, Victor
    Pallauf, Johannes
    Brito, Jose
    Mendes, Jose
    [J]. TECHNOLOGICAL INNOVATION FOR INDUSTRY AND SERVICE SYSTEMS, DOCEIS 2019, 2019, 553 : 169 - 182
  • [6] Real-time Identification and Localization of Body Parts from Depth Images
    Plagemann, Christian
    Ganapathi, Varun
    Koller, Daphne
    Thrun, Sebastian
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 3108 - 3113
  • [7] Generalized Sum of Gaussians for Real-Time Human Pose Tracking from a Single Depth Sensor
    Ding, Meng
    Fan, Guoliang
    [J]. 2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 47 - 54
  • [8] Real-time face pose estimation from single range images
    Breitenstein, Michael D.
    Kuettel, Daniel
    Weise, Thibaut
    van Gool, Luc
    Pfister, Hanspeter
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3613 - +
  • [9] Recognition Combined Human Pose Tracking Using Single Depth Images
    Kim, Wonjun
    Yoo, ByungIn
    Han, Jae-Joon
    Choi, Changkyu
    [J]. VISUAL INFORMATION PROCESSING AND COMMUNICATION V, 2014, 9029
  • [10] Efficient Human Pose Estimation from Single Depth Images
    Shotton, Jamie
    Girshick, Ross
    Fitzgibbon, Andrew
    Sharp, Toby
    Cook, Mat
    Finocchio, Mark
    Moore, Richard
    Kohli, Pushmeet
    Criminisi, Antonio
    Kipman, Alex
    Blake, Andrew
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) : 2821 - 2840