Dual Regression for Efficient Hand Pose Estimation

被引:7
|
作者
Wei, Dong [1 ]
An, Shan [1 ]
Zhang, Xiajie [1 ]
Tian, Jiayi [1 ]
Tsintotas, Konstantinos A. [2 ]
Gasteratos, Antonios [2 ]
Zhu, Haogang [3 ]
机构
[1] JD COM Inc, Tech Data Ctr, Beijing 100108, Peoples R China
[2] Democritus Univ Thrace, Dept Prod & Management Engn, Xanthi 67132, Greece
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/ICRA46639.2022.9812217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hand pose estimation constitutes prime attainment for human-machine interaction-based applications. Real-time operation is vital in such tasks. Thus, a reliable estimator should exhibit low computational complexity and high precision at the same time. Previous works have explored the regression techniques, including the coordinate regression and heatmap regression methods. Primarily incorporating ideas from them, in this paper, we propose a novel, fast and accurate method for hand pose estimation, which adopts a lightweight network architecture and a post-processing scheme. Hence, our architecture uses a Dual Regression strategy, consisting of two regression branches, namely the coordinate and the heatmap ones, and we refer to the proposed method as DRHand. By carefully selecting the branches' characteristics, the proposed structure has been designed to exploit the benefits of the two methods mentioned above while impoverishing their weaknesses to some extent. The two branches are supervised separately during training, and a post-processing module estimates their outputs to boost reliability. This way, our novel pipeline is considerably faster, reaching 44.39 frames-per-second on an NVIDIA Jetson TX2 graphics processing unit, offering a beyond real-time performance for any custom robotics application. Lastly, extensive experiments conducted on two publicly-available datasets demonstrate that the proposed framework outperforms previous state-of-the-art techniques and can generalize on various hand pose scenarios.
引用
收藏
页码:6423 / 6429
页数:7
相关论文
共 50 条
  • [1] Dense 3D Regression for Hand Pose Estimation
    Wan, Chengde
    Probst, Thomas
    Van Gool, Luc
    Yao, Angela
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5147 - 5156
  • [2] Guided optimisation through classification and regression for hand pose estimation
    Krejov, Philip
    Gilbert, Andrew
    Bowden, Richard
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 155 : 124 - 138
  • [3] Multibranch Adversarial Regression for Domain Adaptative Hand Pose Estimation
    Jin, Rui
    Zhang, Jing
    Yang, Jianyu
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6125 - 6136
  • [4] Cascaded Hand Pose Regression
    Sun, Xiao
    Wei, Yichen
    Liang, Shuang
    Tang, Xiaoou
    Sun, Jian
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 824 - 832
  • [5] Residual Attention Regression for 3D Hand Pose Estimation
    Li, Jing
    Zhang, Long
    Ju, Zhaojie
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT IV, 2019, 11743 : 605 - 614
  • [6] Improve Regression Network on Depth Hand Pose Estimation With Auxiliary Variable
    Xu, Lu
    Hu, Chen
    Tao, Jian
    Xue, Jianru
    Mei, Kuizhi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 890 - 904
  • [7] Efficient Hand Pose Estimation from a Single Depth Image
    Xu, Chi
    Cheng, Li
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3456 - 3462
  • [8] Efficient Multimodal Fusion for Hand Pose Estimation With Hourglass Network
    Hoang, Dinh-Cuong
    Xuan Tan, Phan
    Pham, Duc-Long
    Pham, Hai-Nam
    Bui, Son-Anh
    Nguyen, Chi-Minh
    Phi, An-Binh
    Tran, Khanh-Duong
    Trinh, Viet-Anh
    Tran, van-Duc
    Tran, Duc-Thanh
    Duong, van-Hiep
    Phan, Khanh-Toan
    Nguyen, van-Thiep
    Vu, van-Duc
    Nguyen, Thu-Uyen
    [J]. IEEE ACCESS, 2024, 12 : 113810 - 113825
  • [9] Computationally Efficient Regression on a Dependency Graph for Human Pose Estimation
    Hara, Kota
    Chellappa, Rama
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3390 - 3397
  • [10] RCENet: an efficient pose estimation network based on regression correction
    Su, Shuzhi
    She, Benjie
    Zhu, Yanmin
    Fang, Xianjin
    Xu, Yang
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (05)