Repeated Cross-Scale Structure-Induced Feature Fusion Network for 2D Hand Pose Estimation

被引:1
|
作者
Guan, Xin [1 ]
Shen, Huan [1 ]
Nyatega, Charles Okanda [2 ]
Li, Qiang [1 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
hand pose estimation; RGB image; self-occluded; multi-layer features; feature fusion;
D O I
10.3390/e25050724
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Recently, the use of convolutional neural networks for hand pose estimation from RGB images has dramatically improved. However, self-occluded keypoint inference in hand pose estimation is still a challenging task. We argue that these occluded keypoints cannot be readily recognized directly from traditional appearance features, and sufficient contextual information among the keypoints is especially needed to induce feature learning. Therefore, we propose a new repeated cross-scale structure-induced feature fusion network to learn about the representations of keypoints with rich information, 'informed' by the relationships between different abstraction levels of features. Our network consists of two modules: GlobalNet and RegionalNet. GlobalNet roughly locates hand joints based on a new feature pyramid structure by combining higher semantic information and more global spatial scale information. RegionalNet further refines keypoint representation learning via a four-stage cross-scale feature fusion network, which learns shallow appearance features induced by more implicit hand structure information, so that when identifying occluded keypoints, the network can use augmented features to better locate the positions. The experimental results show that our method outperforms the state-of-the-art methods for 2D hand pose estimation on two public datasets, STB and RHD.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] LCFFNet: A Lightweight Cross-scale Feature Fusion Network for human pose estimation
    Zou, Xuelian
    Bi, Xiaojun
    Neural Networks, 2025, 183
  • [2] SDFPoseGraphNet: Spatial Deep Feature Pose Graph Network for 2D Hand Pose Estimation
    Salman, Sartaj Ahmed
    Zakir, Ali
    Takahashi, Hiroki
    SENSORS, 2023, 23 (22)
  • [3] Multiscale feature fusion network for monocular complex hand pose estimation
    Zhan, Zhi
    Luo, Guang
    ELECTRONICS LETTERS, 2023, 59 (24)
  • [4] Nonparametric Structure Regularization Machine for 2D Hand Pose Estimation
    Chen, Yifei
    Ma, Haoyu
    Kong, Deying
    Yan, Xiangyi
    Wu, Jianbao
    Fan, Wei
    Xie, Xiaohui
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 370 - 379
  • [5] HandFoldingNet: A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton
    Cheng, Wencan
    Park, Jae Hyun
    Ko, Jong Hwan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11240 - 11249
  • [6] Optimized convolutional pose machine for 2D hand pose estimation
    Pan, Tianhong
    Wang, Zheng
    Fan, Yuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83
  • [7] High-resolution network with an auxiliary channel for 2D hand pose estimation
    Tianhong Pan
    Zheng Wang
    Multimedia Tools and Applications, 2024, 83 : 36683 - 36694
  • [8] High-resolution network with an auxiliary channel for 2D hand pose estimation
    Pan, Tianhong
    Wang, Zheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36683 - 36694
  • [9] Cascaded Deep Graphical Convolutional Neural Network for 2D Hand Pose Estimation
    Salman, Sartaj Ahmed
    Zakir, Ali
    Takahashi, Hiroki
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [10] CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation
    Sun, Xiaojing
    Wang, Bin
    Huang, Longxiang
    Zhang, Qian
    Zhu, Sulei
    Ma, Yan
    SENSORS, 2021, 21 (18)