Real-Time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

被引:138
|
作者
Sridhar, Srinath [1 ]
Mueller, Franziska [1 ]
Zollhoefer, Michael [1 ]
Casas, Dan [1 ]
Oulasvirta, Antti [2 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Aalto Univ, Espoo, Finland
来源
关键词
MOTION; MODEL; SHAPE; POSE;
D O I
10.1007/978-3-319-46475-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time simultaneous tracking of hands manipulating and interacting with external objects has many potential applications in augmented reality, tangible computing, and wearable computing. However, due to difficult occlusions, fast motions, and uniform hand appearance, jointly tracking hand and object pose is more challenging than tracking either of the two separately. Many previous approaches resort to complex multi-camera setups to remedy the occlusion problem and often employ expensive segmentation and optimization steps which makes real-time tracking impossible. In this paper, we propose a real-time solution that uses a single commodity RGB-D camera. The core of our approach is a 3D articulated Gaussian mixture alignment strategy tailored to handobject tracking that allows fast pose optimization. The alignment energy uses novel regularizers to address occlusions and hand-object contacts. For added robustness, we guide the optimization with discriminative part classification of the hand and segmentation of the object. We conducted extensive experiments on several existing datasets and introduce a new annotated hand-object dataset. Quantitative and qualitative results show the key advantages of our method: speed, accuracy, and robustness.
引用
收藏
页码:294 / 310
页数:17
相关论文
共 50 条
  • [21] Real-time 3D Object Detection from Point Clouds using an RGB-D Camera
    Wang, Ya
    Xu, Shu
    Zell, Andreas
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 407 - 414
  • [22] RGB-D Object Modelling for Object Recognition and Tracking
    Prankl, Johann
    Aldoma, Aitor
    Svejda, Alexander
    Vincze, Markus
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 96 - 103
  • [23] GANerated Hands for Real-Time 3D Hand Tracking from Monocular RGB
    Mueller, Franziska
    Bernard, Florian
    Sotnychenko, Oleksandr
    Mehta, Dushyant
    Sridhar, Srinath
    Casas, Dan
    Theobalt, Christian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 49 - 59
  • [24] Real-Time Visual Odometry from Dense RGB-D Images
    Steinbruecker, Frank
    Sturm, Juergen
    Cremers, Daniel
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [25] SlamDunk: Affordable Real-Time RGB-D SLAM
    Fioraio, Nicola
    Di Stefano, Luigi
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 401 - 414
  • [26] Detecting and tracking people in real time with RGB-D camera
    Liu, Jun
    Liu, Ye
    Zhang, Guyue
    Zhu, Peiru
    Chen, Yan Qiu
    PATTERN RECOGNITION LETTERS, 2015, 53 : 16 - 23
  • [27] Real-Time and Fast RGB-D based People Detection and Tracking for Service Robots
    Sun, Yue
    Sun, Lei
    Liu, Jingtai
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 1514 - 1519
  • [28] Real-time Tracking-by-Detection of Human Motion in RGB-D Camera Networks
    Malaguti, Alessandro
    Carraro, Marco
    Guidolin, Mattia
    Tagliapietra, Luca
    Menegatti, Emanuele
    Ghidoni, Stefano
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3198 - 3204
  • [29] Real-Time RGB-D based People Detection and Tracking System for Mobile Robots
    Fang, Fang
    Qian, Kun
    Zhou, Bo
    Ma, Xudong
    2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 1937 - 1941
  • [30] Real-time human body tracking based on data fusion from multiple RGB-D sensors
    Juan C. Núñez
    Raúl Cabido
    Antonio S. Montemayor
    Juan J. Pantrigo
    Multimedia Tools and Applications, 2017, 76 : 4249 - 4271