Dual Regression for Efficient Hand Pose Estimation

被引:7
|
作者
Wei, Dong [1 ]
An, Shan [1 ]
Zhang, Xiajie [1 ]
Tian, Jiayi [1 ]
Tsintotas, Konstantinos A. [2 ]
Gasteratos, Antonios [2 ]
Zhu, Haogang [3 ]
机构
[1] JD COM Inc, Tech Data Ctr, Beijing 100108, Peoples R China
[2] Democritus Univ Thrace, Dept Prod & Management Engn, Xanthi 67132, Greece
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
基金
国家重点研发计划;
关键词
D O I
10.1109/ICRA46639.2022.9812217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hand pose estimation constitutes prime attainment for human-machine interaction-based applications. Real-time operation is vital in such tasks. Thus, a reliable estimator should exhibit low computational complexity and high precision at the same time. Previous works have explored the regression techniques, including the coordinate regression and heatmap regression methods. Primarily incorporating ideas from them, in this paper, we propose a novel, fast and accurate method for hand pose estimation, which adopts a lightweight network architecture and a post-processing scheme. Hence, our architecture uses a Dual Regression strategy, consisting of two regression branches, namely the coordinate and the heatmap ones, and we refer to the proposed method as DRHand. By carefully selecting the branches' characteristics, the proposed structure has been designed to exploit the benefits of the two methods mentioned above while impoverishing their weaknesses to some extent. The two branches are supervised separately during training, and a post-processing module estimates their outputs to boost reliability. This way, our novel pipeline is considerably faster, reaching 44.39 frames-per-second on an NVIDIA Jetson TX2 graphics processing unit, offering a beyond real-time performance for any custom robotics application. Lastly, extensive experiments conducted on two publicly-available datasets demonstrate that the proposed framework outperforms previous state-of-the-art techniques and can generalize on various hand pose scenarios.
引用
下载
收藏
页码:6423 / 6429
页数:7
相关论文
共 50 条
  • [31] Constrained kernel regression for pose estimation
    Zhang, Haopeng
    Jiang, Zhiguo
    ELECTRONICS LETTERS, 2014, 50 (02) : 77 - 78
  • [32] POSE ESTIMATION BY LOCAL PROCRUSTES REGRESSION
    Raytchev, Bisser
    Terakado, Kazuya
    Tamaki, Toru
    Kaneda, Kazufumi
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [33] Local Regression Based Hourglass Network for Hand Pose Estimation from a Single Depth Image
    Li, Jia
    Wang, Zengfu
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1767 - 1772
  • [34] Hand pose estimation based on regression method from monocular RGB cameras for handling occlusion
    Roumaissa, Bekiri
    Chaouki, Babahenini Mohamed
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 21497 - 21523
  • [35] Hand pose estimation based on regression method from monocular RGB cameras for handling occlusion
    Bekiri Roumaissa
    Babahenini Mohamed Chaouki
    Multimedia Tools and Applications, 2024, 83 : 21497 - 21523
  • [36] Accurate and efficient 3D hand pose regression for robot hand teleoperation using a monocular RGB camera
    Gomez-Donoso, Francisco
    Orts-Escolano, Sergio
    Cazorla, Miguel
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 136 : 327 - 337
  • [37] Efficient Pose: Efficient human pose estimation with neural architecture search
    Wenqiang Zhang
    Jiemin Fang
    Xinggang Wang
    Wenyu Liu
    Computational Visual Media, 2021, 7 (03) : 335 - 347
  • [38] Hierarchical neural network for hand pose estimation
    Chen, Zheng
    Du, Kuo
    Sun, Yi
    Lin, Xiangbo
    Ma, Xiaohong
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87
  • [39] Proactive Sensing for Improving Hand Pose Estimation
    Hsiao, Dun-Yu
    Sun, Min
    Ballweber, Christy
    Cooper, Seth
    Popovic, Zoran
    34TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2016, 2016, : 2348 - 2352
  • [40] Hand pose estimation using hierarchical detection
    Stenger, B
    Thayananthan, A
    Torr, PHS
    Cipolla, R
    COMPUTER VISION IN HUMAN-COMPUTER INTERACTION, PROCEEDINGS, 2004, 3058 : 105 - 116