HandFormer: Hand pose reconstructing from a single RGB image

被引:0
|
作者
Jiao, Zixun [1 ,2 ]
Wang, Xihan [1 ,2 ]
Li, Jingcao [1 ,2 ]
Gao, Rongxin [1 ,2 ]
He, Miao [1 ,2 ]
Liang, Jiao [1 ,2 ]
Xia, Zhaoqiang [3 ]
Gao, Quanli [1 ,2 ]
机构
[1] State & Local Joint Engn Res Ctr Adv Networking &, Xian 710048, Peoples R China
[2] Xian Polytech Univ, Sch Comp Sci, Informat Serv, Xian 710048, Shaanxi, Peoples R China
[3] Northwestern Polytech Univ, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Hand attitude estimation; Hand attitude estimation and segmentation; Multitasking learning; Multitask progressive transformer framework; Multi-scale features;
D O I
10.1016/j.patrec.2024.05.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a multi -task progressive Transformer framework to reconstruct hand poses from a single RGB image to address challenges such as hand occlusion hand distraction, and hand shape bias. Our proposed framework comprises three key components: the feature extraction branch, palm segmentation branch, and parameter prediction branch. The feature extraction branch initially employs the progressive Transformer to extract multiscale features from the input image. Subsequently, these multi-scale features are fed into a multi-layer perceptron layer (MLP) for acquiring palm alignment features. We employ an efficient fusion module to enhance the parameter prediction further features to integrate the palm alignment features with the backbone features. A dense hand model is generated using a pre-computed articulated mesh deformed hand model. We evaluate the performance of our proposed method on STEREO, FreiHAND, and HO3D datasets separately. The experimental results demonstrate that our approach achieves 3D mean error metrics of 10.92 mm, 12.33 mm and 9.6 mm for the respective datasets.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [1] Hand Pose Estimation from a Single RGB-D Image
    Kuznetsova, Alina
    Rosenhahn, Bodo
    ADVANCES IN VISUAL COMPUTING, PT II, 2013, 8034 : 592 - 602
  • [2] 3D Hand Shape and Pose Estimation from a Single RGB Image
    Ge, Liuhao
    Ren, Zhou
    Li, Yuncheng
    Xue, Zehao
    Wang, Yingying
    Cai, Jianfei
    Yuan, Junsong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10825 - 10834
  • [3] Back To RGB: Deep Articulated Hand Pose Estimation From a Single Camera Image
    Ma, Wan-Duo Kurt
    Lewis, J. P.
    Frean, Marcus
    Balduzzi, David
    2017 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2017,
  • [4] EGOCENTRIC HAND POSE ESTIMATION AND DISTANCE RECOVERY IN A SINGLE RGB IMAGE
    Liang, Hui
    Yuan, Junsong
    Thalman, Daniel
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [5] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    NEUROCOMPUTING, 2022, 474 : 25 - 36
  • [6] Grasp Pose Detection from a Single RGB Image
    Cheng, Hu
    Wang, Yingying
    Meng, Max Q-H
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4686 - 4691
  • [7] A hybrid network for estimating 3D interacting hand pose from a single RGB image
    Bao, Wenxia
    Gao, Qiuyue
    Yang, Xianjun
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3801 - 3814
  • [8] A hybrid network for estimating 3D interacting hand pose from a single RGB image
    Wenxia Bao
    Qiuyue Gao
    Xianjun Yang
    Signal, Image and Video Processing, 2024, 18 : 3801 - 3814
  • [9] 3D hand pose estimation from a single RGB image by weighting the occlusion and classification
    Mahdikhanlou, Khadijeh
    Ebrahimnezhad, Hossein
    PATTERN RECOGNITION, 2023, 136
  • [10] Variational Object-Aware 3-D Hand Pose From a Single RGB Image
    Gao, Yafei
    Wang, Yida
    Falco, Pietro
    Navab, Nassir
    Tombari, Federico
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04): : 4239 - 4246