HandFormer: Hand pose reconstructing from a single RGB image

被引:0
|
作者
Jiao, Zixun [1 ,2 ]
Wang, Xihan [1 ,2 ]
Li, Jingcao [1 ,2 ]
Gao, Rongxin [1 ,2 ]
He, Miao [1 ,2 ]
Liang, Jiao [1 ,2 ]
Xia, Zhaoqiang [3 ]
Gao, Quanli [1 ,2 ]
机构
[1] State & Local Joint Engn Res Ctr Adv Networking &, Xian 710048, Peoples R China
[2] Xian Polytech Univ, Sch Comp Sci, Informat Serv, Xian 710048, Shaanxi, Peoples R China
[3] Northwestern Polytech Univ, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Hand attitude estimation; Hand attitude estimation and segmentation; Multitasking learning; Multitask progressive transformer framework; Multi-scale features;
D O I
10.1016/j.patrec.2024.05.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a multi -task progressive Transformer framework to reconstruct hand poses from a single RGB image to address challenges such as hand occlusion hand distraction, and hand shape bias. Our proposed framework comprises three key components: the feature extraction branch, palm segmentation branch, and parameter prediction branch. The feature extraction branch initially employs the progressive Transformer to extract multiscale features from the input image. Subsequently, these multi-scale features are fed into a multi-layer perceptron layer (MLP) for acquiring palm alignment features. We employ an efficient fusion module to enhance the parameter prediction further features to integrate the palm alignment features with the backbone features. A dense hand model is generated using a pre-computed articulated mesh deformed hand model. We evaluate the performance of our proposed method on STEREO, FreiHAND, and HO3D datasets separately. The experimental results demonstrate that our approach achieves 3D mean error metrics of 10.92 mm, 12.33 mm and 9.6 mm for the respective datasets.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [41] 6D Pose Estimation of Transparent Object From Single RGB Image for Robotic Manipulation
    Byambaa, Munkhtulga
    Koutaki, Gou
    Choimaa, Lodoiravsal
    IEEE ACCESS, 2022, 10 : 114897 - 114906
  • [42] InstancePose: Fast 6DoF Pose Estimation for Multiple Objects from a Single RGB Image
    Aing, Lee
    Lie, Wen-Nung
    Chiang, Jui-Chiu
    Lin, Guo-Shiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2621 - 2630
  • [43] Real-Time and Efficient 6-D Pose Estimation From a Single RGB Image
    Cheng, Jun
    Liu, Penglei
    Zhang, Qieshi
    Ma, Hui
    Wang, Fei
    Zhang, Jin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [44] 6DoF Pose Estimation of Transparent Object from a Single RGB-D Image
    Xu, Chi
    Chen, Jiale
    Yao, Mengyang
    Zhou, Jun
    Zhang, Lijun
    Liu, Yi
    SENSORS, 2020, 20 (23) : 1 - 19
  • [45] ON THE FUSION OF RGB AND DEPTH INFORMATION FOR HAND POSE ESTIMATION
    Kazakos, Evangelos
    Nikou, Christophoros
    Kakadiaris, Ioannis A.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 868 - 872
  • [46] Hand Pose Estimation from RGB Images Based on Deep Learning: A Survey
    Liu, Yang
    Jiang, Jie
    Sun, Jiahao
    2021 IEEE 7TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY (ICVR 2021), 2021, : 82 - 89
  • [47] CFAM: Estimating 3D Hand Poses from a Single RGB Image with Attention
    Wang, Xianghan
    Jiang, Jie
    Guo, Yanming
    Kang, Lai
    Wei, Yingmei
    Li, Dan
    APPLIED SCIENCES-BASEL, 2020, 10 (02):
  • [48] Mask-Pose Cascaded CNN for 2D Hand Pose Estimation From Single Color Image
    Wang, Yangang
    Peng, Cong
    Liu, Yebin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) : 3258 - 3268
  • [49] A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image
    Jiang, Changlong
    Xiao, Yang
    Wu, Cunlin
    Zhang, Mingyang
    Zheng, Jinghong
    Cao, Zhiguo
    Zhou, Joey Tianyi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8846 - 8855
  • [50] Reconstructing 3D Human Pose from RGB-D Data with Occlusions
    Dang, Bowen
    Zhao, Xi
    Zhang, Bowen
    Wang, He
    COMPUTER GRAPHICS FORUM, 2023, 42 (07)