Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation

被引:1
|
作者
Khaleghi, Leyla [1 ,2 ]
Marshall, Joshua [1 ,2 ]
Etemad, Ali [1 ,2 ]
机构
[1] Queens Univ Kingston, Dept ECE, Kingston, ON, Canada
[2] Queens Univ Kingston, Ingenu Labs, Res Inst, Kingston, ON, Canada
关键词
D O I
10.1109/ICPR56361.2022.9955633
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D hand pose estimation (HPE) is the process of locating the joints of the hand in 3D from any visual input. HPE has recently received an increased amount of attention due to its key role in a variety of human-computer interaction applications. Recent HPE methods have demonstrated the advantages of employing videos or multi-view images, allowing for more robust HPE systems. Accordingly, in this study, we propose a new method to perform Sequential learning with Transformer for Hand Pose (SeTHPose) estimation. Our SeTHPose pipeline begins by extracting visual embeddings from individual hand images. We then use a transformer encoder to learn the sequential context along time or viewing angles and generate accurate 21) hand joint locations. Then, a graph convolutional neural network with a U-Net configuration is used to convert the 2D hand joint locations to 3D poses. Our experiments show that SeTHPose performs well on both hand sequence varieties, temporal and angular. Also, SeTHPose outperforms other methods in the lield to achieve new state-of-the-art results on two public available sequential datasets, STB and MuViHand.
引用
收藏
页码:535 / 541
页数:7
相关论文
共 50 条
  • [31] Enhancing 3D hand pose estimation using SHaF: synthetic hand dataset including a forearm
    Lee, Jeongho
    Kim, Jaeyun
    Kim, Seon Ho
    Choi, Sang-Il
    APPLIED INTELLIGENCE, 2024, 54 (20) : 9565 - 9578
  • [32] Two-hand Global 3D Pose Estimation using Monocular RGB
    Lin, Fanqing
    Wilhelm, Connor
    Martinez, Tony
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2372 - 2380
  • [33] Simultaneous 3D hand detection and pose estimation using single depth images
    Zhang, Yu
    Mi, Siya
    Wu, Jianxin
    Geng, Xin
    PATTERN RECOGNITION LETTERS, 2020, 140 (140) : 43 - 48
  • [34] 3D Hand Pose Estimation Using Semantic Dynamic Hypergraph Convolutional Networks
    Wu, Yalei
    Li, Jinghua
    Kong, Dehui
    Li, Qianxing
    Yin, Baocai
    Journal of Shanghai Jiaotong University (Science), 2024,
  • [35] 3D Hand Pose Estimation from Single Depth Images with Label Distribution Learning
    Xu, Yuanfei
    Wang, Xupeng
    2020 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2020,
  • [36] 3D Hand Pose Estimation with a Single Infrared Camera via Domain Transfer Learning
    Park, Gabyong
    Kim, Tae-Kyun
    Woo, Woontack
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2020), 2020, : 588 - 599
  • [37] Dual-Path Transformer for 3D Human Pose Estimation
    Zhou, Lu
    Chen, Yingying
    Wang, Jinqiao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3260 - 3270
  • [38] DGFormer: Dynamic graph transformer for 3D human pose estimation
    Chen Z.
    Dai J.
    Bai J.
    Pan J.
    Pattern Recognition, 2024, 152
  • [39] 3D Human Pose Estimation With Adversarial Learning
    Meng, Wenming
    Hu, Tao
    Shuai, Li
    2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 93 - 99
  • [40] HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation
    Huang, Lin
    Tan, Jianchao
    Meng, Jingjing
    Liu, Ji
    Yuan, Junsong
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3136 - 3145