Real-time Hand Tracking under Occlusion from an Egocentric RGB-D Sensor

被引:89
|
作者
Mueller, Franziska [1 ,2 ]
Mehta, Dushyant [1 ,2 ]
Sotnychenko, Oleksandr [1 ]
Sridhar, Srinath [1 ]
Casas, Dan [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Saarland Univ, Saarbrucken, Germany
[3] Univ Rey Juan Carlos, Mostoles, Spain
关键词
D O I
10.1109/ICCV.2017.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for real-time, robust and accurate hand pose estimation from moving egocentric RGB-D cameras in cluttered real environments. Existing methods typically fail for hand-object interactions in cluttered scenes imaged from egocentric viewpoints-common for virtual or augmented reality applications. Our approach uses two subsequently applied Convolutional Neural Networks (CNNs) to localize the hand and regress 3D joint locations. Hand localization is achieved by using a CNN to estimate the 2D position of the hand center in the input, even in the presence of clutter and occlusions. The localized hand position, together with the corresponding input depth value, is used to generate a normalized cropped image that is fed into a second CNN to regress relative 3D hand joint locations in real time. For added accuracy, robustness and temporal stability, we refine the pose estimates using a kinematic pose tracking energy. To train the CNNs, we introduce a new photorealistic dataset that uses a merged reality approach to capture and synthesize large amounts of annotated data of natural hand interaction in cluttered scenes. Through quantitative and qualitative evaluation, we show that our method is robust to self-occlusion and occlusions by objects, particularly in moving egocentric perspectives.
引用
收藏
页码:1163 / 1172
页数:10
相关论文
共 50 条
  • [41] Real-Time Video Inpainting for RGB-D Pipeline Reconstruction
    Wang, Luyuan
    Tian, Tina
    Yan, Xinzhi
    Ruan, Fujun
    Aadityaa, G. Jaya
    Choset, Howie
    Li, Lu
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9543 - 9550
  • [42] A Real-time Virtual Dressing System with RGB-D Camera
    Chen, Mingliang
    Lin, Weiyao
    Zhou, Bing
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1041 - 1044
  • [43] A Real-Time Pedestrian Counting System Based on RGB-D
    Yao, Yang
    Zhang, Xu
    Liang, Yu
    Zhang, Xin
    Shen, Furao
    Zhao, Jian
    2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 110 - 117
  • [44] Real-time depth enhancement by fusion for RGB-D cameras
    Garcia, Frederic
    Aouada, Djamila
    Solignac, Thomas
    Mirbach, Bruno
    Ottersten, Bjoern
    IET COMPUTER VISION, 2013, 7 (05) : 335 - 345
  • [45] Real-Time RGB-D Activity Prediction by Soft Regression
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Ma, Lianyang
    Wang, Gang
    Lai, Jianhuang
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 280 - 296
  • [46] RGB2Hands: Real-Time Tracking of 3D Hand Interactions from Monocular RGB Video
    Wang, Jiayi
    Mueller, Franziska
    Bernard, Florian
    Sorli, Suzanne
    Sotnychenko, Oleksandr
    Qian, Neng
    Otaduy, Miguel A.
    Casas, Dan
    Theobalt, Christian
    ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
  • [47] Hand Pose Estimation and Motion Recognition Using Egocentric RGB-D Video
    Yamazaki, Wataru
    Ding, Ming
    Takamatsu, Jun
    Ogasawara, Tsukasa
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 147 - 152
  • [48] Real-Time Accurate 3D Head Tracking and Pose Estimation with Consumer RGB-D Cameras
    David Joseph Tan
    Federico Tombari
    Nassir Navab
    International Journal of Computer Vision, 2018, 126 : 158 - 183
  • [49] Real-Time Accurate 3D Head Tracking and Pose Estimation with Consumer RGB-D Cameras
    Tan, David Joseph
    Tombari, Federico
    Navab, Nassir
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (2-4) : 158 - 183
  • [50] REAL-TIME SLAM FROM RGB-D DATA ON A LEGGED ROBOT: AN EXPERIMENTAL STUDY
    Belter, Dominik
    Kostusiak, Aleksander
    Nowicki, Michal
    Skrzypczynski, Piotr
    ADVANCES IN COOPERATIVE ROBOTICS, 2017, : 320 - 328