3D hand pose estimation algorithm based on cascaded features and graph convolution

被引：1

作者：

Lin, Yi-lin ^{[1
,2
]}

Lin, Shan-ling ^{[2
,3
]}

Lin, Zhi-xian ^{[1
,2
,3
]}

机构：

[1] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China

[2] Fujian Sci & Technol Innovat Lab Optoelect Inform, Fuzhou 350116, Peoples R China

[3] Fuzhou Univ, Sch Adv Mfg, Quanzhou 362200, Peoples R China

来源：

CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS | 2022年 / 37卷 / 06期

基金：

国家重点研发计划;

关键词：

3D pose estimation; target detection; gesture recognition; feature enhancement; convolutional neural network; graph convolutional neural network;

D O I：

10.37188/CJLCD.2021-0307

中图分类号：

O7 [晶体学];

学科分类号：

0702 ; 070205 ; 0703 ; 080501 ;

摘要：

For the 3D key point pose estimation error caused by the high degree of freedom problem and structural similarity problem of the hand, this paper proposes a novel 3D hand skeleton pose regression framework for joint identification, detection, and pose estimation. The framework firstly adopts a YOLOv3-based detector to obtain the position of hands, then a cascade pose estimation network is designed to get initial hand poses with 2D and 3D pose supervisions. Finally, considering the natural constrains in hand graph connection, we present progressive GCN module to further refine the initial hand pose from coarse to fine. This paper compares PCK metrics and AUC metrics with the state-of-the-art approaches under different public benchmarks, and the proposed method achieves the highest AUC metrics on different test sets, with an average AUC accuracy of 92. 9%. The experiments illustrate that the proposed method is able to effectively and robustly predict 3D hand pose from monocular image, performing well in both test sets and in the wild.

引用

页码：736 / 745

页数：10

共 22 条

[1] [Anonymous], 2016, ARXIV
[2] [Anonymous], 2019, [No title captured], V34, P417
[3] 3D Hand Shape and Pose from Images in the Wild
Boukhayma, Adnane
de Bem, Rodrigo
Torr, Philip H. S.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10835 - 10844
[4] Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images
Cai, Yujun
Ge, Liuhao
Cai, Jianfei
Yuan, Junsong
[J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 678 - 694
[5] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Cao, Zhe
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
[6] RMPE: Regional Multi-Person Pose Estimation
Fang, Hao-Shu
Xie, Shuqin
Tai, Yu-Wing
Lu, Cewu
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
[7] Farhadi A., 2018, YOLOv3: An incremental improvement, DOI DOI 10.48550/ARXIV.1804.02767
[8] 3D Hand Shape and Pose Estimation from a Single RGB Image
Ge, Liuhao
Ren, Zhou
Li, Yuncheng
Xue, Zehao
Wang, Yingying
Cai, Jianfei
Yuan, Junsong
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10825 - 10834
[9] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[10] VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera
Mehta, Dushyant
Sridhar, Srinath
Sotnychenko, Oleksandr
Rhodin, Helge
Shafiei, Mohammad
Seidel, Hans-Peter
Xu, Weipeng
Casas, Dan
Theobalt, Christian
[J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):

← 1 2 3 →