CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation

被引:5
|
作者
Sun, Xiaojing [1 ]
Wang, Bin [1 ]
Huang, Longxiang [2 ]
Zhang, Qian [1 ]
Zhu, Sulei [1 ]
Ma, Yan [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
[2] Shenzhen Guangjian Technol Co Ltd, Shanghai 200135, Peoples R China
关键词
hand pose estimation; convolutional neural network; RGBD fusion; 3D HAND;
D O I
10.3390/s21186095
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Despite recent successes in hand pose estimation from RGB images or depth maps, inherent challenges remain. RGB-based methods suffer from heavy self-occlusions and depth ambiguity. Depth sensors rely heavily on distance and can only be used indoors, thus there are many limitations to the practical application of depth-based methods. The aforementioned challenges have inspired us to combine the two modalities to offset the shortcomings of the other. In this paper, we propose a novel RGB and depth information fusion network to improve the accuracy of 3D hand pose estimation, which is called CrossFuNet. Specifically, the RGB image and the paired depth map are input into two different subnetworks, respectively. The feature maps are fused in the fusion module in which we propose a completely new approach to combine the information from the two modalities. Then, the common method is used to regress the 3D key-points by heatmaps. We validate our model on two public datasets and the results reveal that our model outperforms the state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Repeated Cross-Scale Structure-Induced Feature Fusion Network for 2D Hand Pose Estimation
    Guan, Xin
    Shen, Huan
    Nyatega, Charles Okanda
    Li, Qiang
    ENTROPY, 2023, 25 (05)
  • [32] Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
    He, Wangyong
    Xie, Zhongzhao
    Li, Yongbo
    Wang, Xinmei
    Cai, Wendi
    SENSORS, 2019, 19 (13)
  • [33] In-Hand Pose Estimation Using Hand-Mounted RGB Cameras and Visuotactile Sensors
    Gao, Yuan
    Matsuoka, Shogo
    Wan, Weiwei
    Kiyokawa, Takuya
    Koyama, Keisuke
    Harada, Kensuke
    IEEE ACCESS, 2023, 11 : 17218 - 17232
  • [34] Efficient Hand Pose Estimation from a Single Depth Image
    Xu, Chi
    Cheng, Li
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3456 - 3462
  • [35] Real Time Hand Pose Estimation using Depth Sensors
    Keskin, Cem
    Kirac, Furkan
    Kara, Yunus Emre
    Akarun, Lale
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [36] Hand Pose Estimation from RGB Images Based on Deep Learning: A Survey
    Liu, Yang
    Jiang, Jie
    Sun, Jiahao
    2021 IEEE 7TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY (ICVR 2021), 2021, : 82 - 89
  • [37] Review on 3D Hand Pose Estimation Based on a RGB Image
    Xiao Y.
    Liu Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (02): : 161 - 172
  • [38] Pairwise Guided Multilayer Cross-Fusion Network for Bird Image Recognition
    Lei, Jingsheng
    Jin, Yao
    Huang, Liya
    Ji, Yuan
    Yang, Shengying
    ELECTRONICS, 2023, 12 (18)
  • [39] POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition
    Zheng, Ce
    Mendieta, Matias
    Chen, Chen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3138 - 3147
  • [40] LCFFNet: A Lightweight Cross-scale Feature Fusion Network for human pose estimation
    Zou, Xuelian
    Bi, Xiaojun
    NEURAL NETWORKS, 2025, 183