Real-time Hand Tracking under Occlusion from an Egocentric RGB-D Sensor

被引:89
|
作者
Mueller, Franziska [1 ,2 ]
Mehta, Dushyant [1 ,2 ]
Sotnychenko, Oleksandr [1 ]
Sridhar, Srinath [1 ]
Casas, Dan [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Saarland Univ, Saarbrucken, Germany
[3] Univ Rey Juan Carlos, Mostoles, Spain
关键词
D O I
10.1109/ICCV.2017.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for real-time, robust and accurate hand pose estimation from moving egocentric RGB-D cameras in cluttered real environments. Existing methods typically fail for hand-object interactions in cluttered scenes imaged from egocentric viewpoints-common for virtual or augmented reality applications. Our approach uses two subsequently applied Convolutional Neural Networks (CNNs) to localize the hand and regress 3D joint locations. Hand localization is achieved by using a CNN to estimate the 2D position of the hand center in the input, even in the presence of clutter and occlusions. The localized hand position, together with the corresponding input depth value, is used to generate a normalized cropped image that is fed into a second CNN to regress relative 3D hand joint locations in real time. For added accuracy, robustness and temporal stability, we refine the pose estimates using a kinematic pose tracking energy. To train the CNNs, we introduce a new photorealistic dataset that uses a merged reality approach to capture and synthesize large amounts of annotated data of natural hand interaction in cluttered scenes. Through quantitative and qualitative evaluation, we show that our method is robust to self-occlusion and occlusions by objects, particularly in moving egocentric perspectives.
引用
收藏
页码:1163 / 1172
页数:10
相关论文
共 50 条
  • [21] RGB-D Object Tracking with Occlusion Detection
    Xie, Yujun
    Lu, Yao
    Gu, Shuang
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 11 - 15
  • [22] GANerated Hands for Real-Time 3D Hand Tracking from Monocular RGB
    Mueller, Franziska
    Bernard, Florian
    Sotnychenko, Oleksandr
    Mehta, Dushyant
    Sridhar, Srinath
    Casas, Dan
    Theobalt, Christian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 49 - 59
  • [23] Real-Time Visual Odometry from Dense RGB-D Images
    Steinbruecker, Frank
    Sturm, Juergen
    Cremers, Daniel
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [24] SlamDunk: Affordable Real-Time RGB-D SLAM
    Fioraio, Nicola
    Di Stefano, Luigi
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 401 - 414
  • [25] Detecting and tracking people in real time with RGB-D camera
    Liu, Jun
    Liu, Ye
    Zhang, Guyue
    Zhu, Peiru
    Chen, Yan Qiu
    PATTERN RECOGNITION LETTERS, 2015, 53 : 16 - 23
  • [26] Real-Time and Fast RGB-D based People Detection and Tracking for Service Robots
    Sun, Yue
    Sun, Lei
    Liu, Jingtai
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 1514 - 1519
  • [27] Real-time Tracking-by-Detection of Human Motion in RGB-D Camera Networks
    Malaguti, Alessandro
    Carraro, Marco
    Guidolin, Mattia
    Tagliapietra, Luca
    Menegatti, Emanuele
    Ghidoni, Stefano
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3198 - 3204
  • [28] Real-Time RGB-D based People Detection and Tracking System for Mobile Robots
    Fang, Fang
    Qian, Kun
    Zhou, Bo
    Ma, Xudong
    2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 1937 - 1941
  • [29] Real-time human body tracking based on data fusion from multiple RGB-D sensors
    Juan C. Núñez
    Raúl Cabido
    Antonio S. Montemayor
    Juan J. Pantrigo
    Multimedia Tools and Applications, 2017, 76 : 4249 - 4271
  • [30] Real-time human body tracking based on data fusion from multiple RGB-D sensors
    Nunez, Juan C.
    Cabido, Raul
    Montemayor, Antonio S.
    Pantrigo, Juan J.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4249 - 4271