SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation

被引:1
|
作者
Zhu, Chenfei [1 ]
Hu, Boce [1 ]
Chen, Jiawei [1 ]
Ai, Xupeng [1 ]
Agrawal, Sunil K. K. [1 ,2 ]
机构
[1] Columbia Univ, Dept Mech Engn, New York, NY 10027 USA
[2] Columbia Univ, Dept Rehabil Med, New York, NY 10027 USA
来源
BIOENGINEERING-BASEL | 2023年 / 10卷 / 02期
基金
美国国家卫生研究院;
关键词
hand pose estimation; finger tapping test; hand movement recognition; deep learning; computer vision; depth camera;
D O I
10.3390/bioengineering10020126
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Hand pose estimation (HPE) plays an important role during the functional assessment of the hand and in potential rehabilitation. It is a challenge to predict the pose of the hand conveniently and accurately during functional tasks, and this limits the application of HPE. In this paper, we propose a novel architecture of a shifted attention regression network (SARN) to perform HPE. Given a depth image, SARN first predicts the spatial relationships between points in the depth image and a group of hand keypoints that determine the pose of the hand. Then, SARN uses these spatial relationships to infer the 3D position of each hand keypoint. To verify the effectiveness of the proposed method, we conducted experiments on three open-source datasets of 3D hand poses: NYU, ICVL, and MSRA. The proposed method achieved state-of-the-art performance with 7.32 mm, 5.91 mm, and 7.17 mm of mean error at the hand keypoints, i.e., mean Euclidean distance between the predicted and ground-truth hand keypoint positions. Additionally, to test the feasibility of SARN in hand movement recognition, a hand movement dataset of 26K depth images from 17 healthy subjects was constructed based on the finger tapping test, an important component of neurological exams administered to Parkinson's patients. Each image was annotated with the tips of the index finger and the thumb. For this dataset, the proposed method achieved a mean error of 2.99 mm at the hand keypoints and comparable performance on three task-specific metrics: the distance, velocity, and acceleration of the relative movement of the two fingertips. Results on the open-source datasets demonstrated the effectiveness of the proposed method, and results on our finger tapping dataset validated its potential for applications in functional task characterization.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Residual Attention Regression for 3D Hand Pose Estimation
    Li, Jing
    Zhang, Long
    Ju, Zhaojie
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT IV, 2019, 11743 : 605 - 614
  • [2] Dense 3D Regression for Hand Pose Estimation
    Wan, Chengde
    Probst, Thomas
    Van Gool, Luc
    Yao, Angela
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5147 - 5156
  • [3] Attention-Based Pose Sequence Machine for 3D Hand Pose Estimation
    Guo, Fangtai
    He, Zaixing
    Zhang, Shuyou
    Zhao, Xinyue
    Tan, Jianrong
    [J]. IEEE ACCESS, 2020, 8 : 18258 - 18269
  • [4] AWR: Adaptive Weighting Regression for 3D Hand Pose Estimation
    Huang, Weiting
    Ren, Pengfei
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11061 - 11068
  • [5] PEAN: 3D Hand Pose Estimation Adversarial Network
    Sun, Linhui
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1251 - 1258
  • [6] CASCADED POINT NETWORK FOR 3D HAND POSE ESTIMATION
    Dou, Yikun
    Wang, Xuguang
    Zhu, Yuying
    Deng, Xiaoming
    Ma, Cuixia
    Chang, Liang
    Wang, Hongan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1982 - 1986
  • [7] Dynamic Graph CNN with Attention Module for 3D Hand Pose Estimation
    Jiang, Xu
    Ma, Xiaohong
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT I, 2019, 11554 : 87 - 96
  • [8] Differentiable Spatial Regression: A Novel Method for 3D Hand Pose Estimation
    Zhang, Xingyuan
    Zhang, Fuhai
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 166 - 176
  • [9] Point-to-Point Regression PointNet for 3D Hand Pose Estimation
    Ge, Liuhao
    Ren, Zhou
    Yuan, Junsong
    [J]. COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 489 - 505
  • [10] Regression-based 3D Hand Pose Estimation using Heatmaps
    Bandi, Chaitanya
    Thomas, Ulrike
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 636 - 643