Real-time Hand Tracking under Occlusion from an Egocentric RGB-D Sensor

被引:89
|
作者
Mueller, Franziska [1 ,2 ]
Mehta, Dushyant [1 ,2 ]
Sotnychenko, Oleksandr [1 ]
Sridhar, Srinath [1 ]
Casas, Dan [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Saarland Univ, Saarbrucken, Germany
[3] Univ Rey Juan Carlos, Mostoles, Spain
关键词
D O I
10.1109/ICCV.2017.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for real-time, robust and accurate hand pose estimation from moving egocentric RGB-D cameras in cluttered real environments. Existing methods typically fail for hand-object interactions in cluttered scenes imaged from egocentric viewpoints-common for virtual or augmented reality applications. Our approach uses two subsequently applied Convolutional Neural Networks (CNNs) to localize the hand and regress 3D joint locations. Hand localization is achieved by using a CNN to estimate the 2D position of the hand center in the input, even in the presence of clutter and occlusions. The localized hand position, together with the corresponding input depth value, is used to generate a normalized cropped image that is fed into a second CNN to regress relative 3D hand joint locations in real time. For added accuracy, robustness and temporal stability, we refine the pose estimates using a kinematic pose tracking energy. To train the CNNs, we introduce a new photorealistic dataset that uses a merged reality approach to capture and synthesize large amounts of annotated data of natural hand interaction in cluttered scenes. Through quantitative and qualitative evaluation, we show that our method is robust to self-occlusion and occlusions by objects, particularly in moving egocentric perspectives.
引用
收藏
页码:1163 / 1172
页数:10
相关论文
共 50 条
  • [31] Real-time dense 3D object reconstruction using RGB-D sensor
    Ruchay, Alexey
    Dorofeev, Konstantin
    Kalschikov, Vsevolod
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIII, 2020, 11510
  • [32] 3D Hand Pose Detection in Egocentric RGB-D Images
    Rogez, Gregory
    Khademi, Maryam
    Supancic, J. S., III
    Montiel, J. M. M.
    Ramanan, Deva
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 356 - 371
  • [33] Employing a RGB-D Sensor for Real-Time Tracking of Humans across Multiple Re-Entries in a Smart Environment
    Han, Jungong
    Pauwels, Eric J.
    de Zeeuw, Paul M.
    de With, Peter H. N.
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (02) : 255 - 263
  • [34] Hand detection with RGB-D data from kinect sensor
    Zhang, Weizhong
    Wang, Guodong
    Liu, Cunliang
    Jia, Shiyu
    Yang, Jinbao
    Wang, Jun
    Journal of Information and Computational Science, 2015, 12 (10): : 3755 - 3763
  • [35] Catch the Shadow: Person Tracking Under Occlusion with a Single RGB-D Camera
    Gai, Wei
    Qi, Meng
    Wang, Lu
    Yang, Chenglei
    Liu, Juan
    Bian, Yulong
    de Melo, Gerard
    Liu, Shijun
    Meng, Xiangxu
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 413 - 418
  • [36] AN INTEGRATED SYSTEM FOR OBJECT TRACKING, DETECTION, AND ONLINE LEARNING WITH REAL-TIME RGB-D VIDEO
    Chen, I-Kuei
    Chi, Chung-Yu
    Hsu, Szu-Lu
    Chen, Liang-Gee
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [37] Real-time Visual Target Tracking in RGB-D Data for Person-following Robots
    Yoon, Youngwoo
    Yun, Woo-han
    Yoon, Hosub
    Kim, Jaehong
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2227 - 2232
  • [38] Real-time Object Pose Recognition and Tracking with an Imprecisely Calibrated Moving RGB-D Camera
    Pauwels, Karl
    Ivan, Vladimir
    Ros, Eduardo
    Vijayakumar, Sethu
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 2733 - 2740
  • [39] Real-time reconstruction of pipes using RGB-D cameras
    Kim, Dong-Min
    Ahn, JeongHyeon
    Kim, Seung-wook
    Lee, Jongmin
    Kim, Myungho
    Han, JungHyun
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (01)
  • [40] Real-time SLAM algorithm based on RGB-D data
    Fu, Mengyin
    Lü, Xianwei
    Liu, Tong
    Yang, Yi
    Li, Xinghe
    Li, Yu
    Jiqiren/Robot, 2015, 37 (06): : 683 - 692