Depth-based 3D Hand Pose Tracking

被引:0
|
作者
Quach, Kha Gia [1 ]
Chi Nhan Duong [1 ]
Luu, Khoa [2 ,3 ]
Bui, Tien D. [1 ]
机构
[1] Concordia Univ, Dept Comp Sci & Software Engn, Montreal, PQ, Canada
[2] Carnegie Mellon Univ, CyLab Biometr Ctr, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose two new approaches using the Convolution Neural Network (CNN) and the Recurrent Neural Network (RNN) for tracking 3D hand poses. The first approach is a detection based algorithm while the second is a data driven method. Our first contribution is a new tracking-by- detection strategy extending the CNN based single frame detection method to a multiple frame tracking approach by taking into account prediction history using RNN. Our second contribution is the use of RNN to simulate the fitting of a 3D model to the input data. It helps to relax the need of a carefully designed fitting function and optimization algorithm. With such strategies, we show that our tracking frameworks can automatically correct the fail detection made in previous frames due to occlusions. Our proposed method is evaluated on two public hand datasets, i.e. NYU and ICVL, and compared against other recent hand tracking methods. Experimental results show that our approaches achieve the state-of-the-art accuracy and efficiency in the challenging problem of 3D hand pose estimation.
引用
收藏
页码:2746 / 2751
页数:6
相关论文
共 50 条
  • [31] Survey on depth and RGB image-based 3D hand shape and pose estimation
    Lin HUANG
    Boshen ZHANG
    Zhilin GUO
    Yang XIAO
    Zhiguo CAO
    Junsong YUAN
    [J]. 虚拟现实与智能硬件(中英文), 2021, 3 (03) : 207 - 234
  • [32] Survey on depth and RGB image-based 3D hand shape and pose estimation
    Huang, Lin
    Zhang, Boshen
    Guo, Zhilin
    Xiao, Yang
    Cao, Zhiguo
    Yuan, Junsong
    [J]. Virtual Reality and Intelligent Hardware, 2021, 3 (03): : 207 - 234
  • [33] Mono-DCNet: Monocular 3D Object Detection via Depth-based Centroid Refinement and Pose Estimation
    Astudillo, Armando
    Al-Kaff, Abdulla
    Garcia, Fernando
    [J]. 2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 664 - 669
  • [34] Depth-Based Stereoscopic Projection Approach for 3D Saliency Detection
    Lin, Hongyun
    Lin, Chunyu
    Zhao, Yao
    Xiao, Jimin
    Tillo, Tammam
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT I, 2015, 9314 : 664 - 673
  • [35] Depth-Based Image Processing for 3D Video Rendering Applications
    Zarb, Terence
    Debono, Carl James
    [J]. 21ST INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2014), 2014, : 215 - 218
  • [36] A Unified Trifocal System for Advanced Depth-Based 3D Capture
    Brune, Thomas
    Gutberlet, Nicola M.
    Tanger, Ralf
    Gandolph, Dirk
    [J]. SMPTE MOTION IMAGING JOURNAL, 2012, 121 (07): : 48 - 55
  • [37] Depth-Based Motion Vector Prediction in 3D Video Coding
    Su, Wenyi
    Rusanovskyy, Dmytro
    Hannuksela, Miska
    Li, Houqiang
    [J]. 2012 PICTURE CODING SYMPOSIUM (PCS), 2012, : 37 - 40
  • [38] Locating 3D Object Proposals: A Depth-Based Online Approach
    Pahwa, Ramanpreet Singh
    Lu, Jiangbo
    Jiang, Nianjuan
    Ng, Tian Tsong
    Do, Minh N.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (03) : 626 - 639
  • [39] Depth-based hand gesture recognition
    Chih-Hung Wu
    Wei-Lun Chen
    Chang Hong Lin
    [J]. Multimedia Tools and Applications, 2016, 75 : 7065 - 7086
  • [40] Depth-based hand gesture recognition
    Wu, Chih-Hung
    Chen, Wei-Lun
    Lin, Chang Hong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (12) : 7065 - 7086