Real-time 3D Pose Estimation from Single Depth Images

被引:6
|
作者
Schnuerer, Thomas [1 ,2 ]
Fuchs, Stefan [2 ]
Eisenbach, Markus [1 ]
Gross, Horst-Michael [1 ]
机构
[1] Ilmenau Univ Technol, Neuroinformat & Cognit Robot Lab, D-98684 Ilmenau, Germany
[2] Honda Res Inst Europe GmbH, D-63073 Offenbach, Germany
关键词
Real-time 3D Joint Estimation; Human-Robot-Interaction; Deep Learning;
D O I
10.5220/0007394707160724
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
To allow for safe Human-Robot-Interaction in industrial scenarios like manufacturing plants, it is essential to always be aware of the location and pose of humans in the shared workspace. We introduce a real-time 3D pose estimation system using single depth images that is aimed to run on limited hardware, such as a mobile robot. For this, we optimized a CNN-based 2D pose estimation architecture to achieve high frame rates while simultaneously requiring fewer resources. Building upon this architecture, we extended the system for 3D estimation to directly predict Cartesian body joint coordinates. We evaluated our system on a newly created dataset by applying it to a specific industrial workbench scenario. The results show that our system's performance is competitive to the state of the art at more than five times the speed for single person pose estimation.
引用
收藏
页码:716 / 724
页数:9
相关论文
共 50 条
  • [1] REAL-TIME UPPER BODY POSE ESTIMATION FROM DEPTH IMAGES
    Tsai, Ming-Han
    Chen, Kuan-Hua
    Lin, I-Chen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2234 - 2238
  • [2] Real-time upper body detection and 3D pose estimation in monoscopic images
    Micilotta, Antonio S.
    Ong, Eng-Jon
    Bowden, Richard
    [J]. COMPUTER VISION - ECCV 2006, PT 3, PROCEEDINGS, 2006, 3953 : 139 - 150
  • [3] Real-time 3D Head Pose and Facial Landmark Estimation from Depth Images Using Triangular Surface Patch Features
    Papazov, Chavdar
    Marks, Tim K.
    Jones, Michael
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4722 - 4730
  • [4] Real-time face pose estimation from single range images
    Breitenstein, Michael D.
    Kuettel, Daniel
    Weise, Thibaut
    van Gool, Luc
    Pfister, Hanspeter
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3613 - +
  • [5] Real-Time Human Pose Recognition in Parts from Single Depth Images
    Shotton, Jamie
    Fitzgibbon, Andrew
    Cook, Mat
    Sharp, Toby
    Finocchio, Mark
    Moore, Richard
    Kipman, Alex
    Blake, Andrew
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1297 - 1304
  • [6] Real-Time Human Pose Recognition in Parts from Single Depth Images
    Shotton, Jamie
    Sharp, Toby
    Kipman, Alex
    Fitzgibbon, Andrew
    Finocchio, Mark
    Blake, Andrew
    Cook, Mat
    Moore, Richard
    [J]. COMMUNICATIONS OF THE ACM, 2013, 56 (01) : 116 - 124
  • [7] VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera
    Mehta, Dushyant
    Sridhar, Srinath
    Sotnychenko, Oleksandr
    Rhodin, Helge
    Shafiei, Mohammad
    Seidel, Hans-Peter
    Xu, Weipeng
    Casas, Dan
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
  • [8] 3D Hand Pose Estimation from Single Depth Images with Label Distribution Learning
    Xu, Yuanfei
    Wang, Xupeng
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2020,
  • [9] Real-Time 3D Hand Gesture Detection from Depth Images
    Song, Lin
    Hu, Ruimin
    Zhang, Hua
    Xiao, Yulian
    Gong, Liyu
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 785 - 788
  • [10] REAL-TIME DEPTH ESTIMATION FOR IMMERSIVE 3D VIDEOCONFERENCING
    Feldmann, I.
    Waizenegger, W.
    Atzpadin, N.
    Schreer, O.
    [J]. 2010 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON 2010), 2010,