CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation

被引:5
|
作者
Sun, Xiaojing [1 ]
Wang, Bin [1 ]
Huang, Longxiang [2 ]
Zhang, Qian [1 ]
Zhu, Sulei [1 ]
Ma, Yan [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
[2] Shenzhen Guangjian Technol Co Ltd, Shanghai 200135, Peoples R China
关键词
hand pose estimation; convolutional neural network; RGBD fusion; 3D HAND;
D O I
10.3390/s21186095
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Despite recent successes in hand pose estimation from RGB images or depth maps, inherent challenges remain. RGB-based methods suffer from heavy self-occlusions and depth ambiguity. Depth sensors rely heavily on distance and can only be used indoors, thus there are many limitations to the practical application of depth-based methods. The aforementioned challenges have inspired us to combine the two modalities to offset the shortcomings of the other. In this paper, we propose a novel RGB and depth information fusion network to improve the accuracy of 3D hand pose estimation, which is called CrossFuNet. Specifically, the RGB image and the paired depth map are input into two different subnetworks, respectively. The feature maps are fused in the fusion module in which we propose a completely new approach to combine the information from the two modalities. Then, the common method is used to regress the 3D key-points by heatmaps. We validate our model on two public datasets and the results reveal that our model outperforms the state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A Shared Pose Regression Network for Pose Estimation of Objects from RGB Images
    Bengtson, Stefan Hein
    Astrom, Hampus
    Moeslund, Thomas B.
    Topp, Elin A.
    Krueger, Volker
    2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 91 - 97
  • [22] Hand Pose Estimation from a Single RGB-D Image
    Kuznetsova, Alina
    Rosenhahn, Bodo
    ADVANCES IN VISUAL COMPUTING, PT II, 2013, 8034 : 592 - 602
  • [23] SASE: RGB-Depth Database for Human Head Pose Estimation
    Lusi, Iiris
    Escarela, Sergio
    Anbarjafari, Gholamreza
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 325 - 336
  • [24] CCFNet: Collaborative Cross-Fusion Network for Medical Image Segmentation
    Chen, Jialu
    Yuan, Baohua
    ALGORITHMS, 2024, 17 (04)
  • [25] HYPERSPECTRAL IMAGE DENOISING BASED ON PARALLEL CROSS-FUSION NETWORK
    Gong, Zhuoran
    Gao, Feng
    Dong, Junyu
    Qi, Lin
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1528 - 1531
  • [26] Hierarchical neural network for hand pose estimation
    Chen, Zheng
    Du, Kuo
    Sun, Yi
    Lin, Xiangbo
    Ma, Xiaohong
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87
  • [27] A RGB-D feature fusion network for occluded object 6D pose estimation
    Song, Yiwei
    Tang, Chunhui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6309 - 6319
  • [28] DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation
    Chen, Liangjian
    Lin, Shih-Yao
    Xie, Yusheng
    Lin, Yen-Yu
    Fan, Wei
    Xie, Xiaohui
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 400 - 408
  • [29] Local Regression Based Hourglass Network for Hand Pose Estimation from a Single Depth Image
    Li, Jia
    Wang, Zengfu
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1767 - 1772
  • [30] Context-Aware Deep Spatiotemporal Network for Hand Pose Estimation From Depth Images
    Wu, Yiming
    Ji, Wei
    Li, Xi
    Wang, Gang
    Yin, Jianwei
    Wu, Fei
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 787 - 797