3D hand pose estimation algorithm based on cascaded features and graph convolution

被引:1
|
作者
Lin, Yi-lin [1 ,2 ]
Lin, Shan-ling [2 ,3 ]
Lin, Zhi-xian [1 ,2 ,3 ]
机构
[1] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China
[2] Fujian Sci & Technol Innovat Lab Optoelect Inform, Fuzhou 350116, Peoples R China
[3] Fuzhou Univ, Sch Adv Mfg, Quanzhou 362200, Peoples R China
基金
国家重点研发计划;
关键词
3D pose estimation; target detection; gesture recognition; feature enhancement; convolutional neural network; graph convolutional neural network;
D O I
10.37188/CJLCD.2021-0307
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
For the 3D key point pose estimation error caused by the high degree of freedom problem and structural similarity problem of the hand, this paper proposes a novel 3D hand skeleton pose regression framework for joint identification, detection, and pose estimation. The framework firstly adopts a YOLOv3-based detector to obtain the position of hands, then a cascade pose estimation network is designed to get initial hand poses with 2D and 3D pose supervisions. Finally, considering the natural constrains in hand graph connection, we present progressive GCN module to further refine the initial hand pose from coarse to fine. This paper compares PCK metrics and AUC metrics with the state-of-the-art approaches under different public benchmarks, and the proposed method achieves the highest AUC metrics on different test sets, with an average AUC accuracy of 92. 9%. The experiments illustrate that the proposed method is able to effectively and robustly predict 3D hand pose from monocular image, performing well in both test sets and in the wild.
引用
收藏
页码:736 / 745
页数:10
相关论文
共 22 条
  • [1] [Anonymous], 2016, ARXIV
  • [2] [Anonymous], 2019, [No title captured], V34, P417
  • [3] 3D Hand Shape and Pose from Images in the Wild
    Boukhayma, Adnane
    de Bem, Rodrigo
    Torr, Philip H. S.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10835 - 10844
  • [4] Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images
    Cai, Yujun
    Ge, Liuhao
    Cai, Jianfei
    Yuan, Junsong
    [J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 678 - 694
  • [5] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [6] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [7] Farhadi A., 2018, YOLOv3: An incremental improvement, DOI DOI 10.48550/ARXIV.1804.02767
  • [8] 3D Hand Shape and Pose Estimation from a Single RGB Image
    Ge, Liuhao
    Ren, Zhou
    Li, Yuncheng
    Xue, Zehao
    Wang, Yingying
    Cai, Jianfei
    Yuan, Junsong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10825 - 10834
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera
    Mehta, Dushyant
    Sridhar, Srinath
    Sotnychenko, Oleksandr
    Rhodin, Helge
    Shafiei, Mohammad
    Seidel, Hans-Peter
    Xu, Weipeng
    Casas, Dan
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):