CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation

被引:5
|
作者
Sun, Xiaojing [1 ]
Wang, Bin [1 ]
Huang, Longxiang [2 ]
Zhang, Qian [1 ]
Zhu, Sulei [1 ]
Ma, Yan [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
[2] Shenzhen Guangjian Technol Co Ltd, Shanghai 200135, Peoples R China
关键词
hand pose estimation; convolutional neural network; RGBD fusion; 3D HAND;
D O I
10.3390/s21186095
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Despite recent successes in hand pose estimation from RGB images or depth maps, inherent challenges remain. RGB-based methods suffer from heavy self-occlusions and depth ambiguity. Depth sensors rely heavily on distance and can only be used indoors, thus there are many limitations to the practical application of depth-based methods. The aforementioned challenges have inspired us to combine the two modalities to offset the shortcomings of the other. In this paper, we propose a novel RGB and depth information fusion network to improve the accuracy of 3D hand pose estimation, which is called CrossFuNet. Specifically, the RGB image and the paired depth map are input into two different subnetworks, respectively. The feature maps are fused in the fusion module in which we propose a completely new approach to combine the information from the two modalities. Then, the common method is used to regress the 3D key-points by heatmaps. We validate our model on two public datasets and the results reveal that our model outperforms the state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] ON THE FUSION OF RGB AND DEPTH INFORMATION FOR HAND POSE ESTIMATION
    Kazakos, Evangelos
    Nikou, Christophoros
    Kakadiaris, Ioannis A.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 868 - 872
  • [2] Lightweight Cross-Fusion Network on Human Pose Estimation for Edge Device
    Zhu, Xian
    Zeng, Xiaoqin
    Ma, Wei
    IEEE ACCESS, 2023, 11 : 134899 - 134907
  • [3] Hourglass Network for Hand Pose Estimation ith RGB Images
    Wang, Qizhi
    Yang, Yonggang
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 1342 - 1347
  • [4] Two-stage cross-fusion network for stereo event-based depth estimation
    Ghosh, Dipon Kumar
    Jung, Yong Ju
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [5] Efficient Multimodal Fusion for Hand Pose Estimation With Hourglass Network
    Hoang, Dinh-Cuong
    Xuan Tan, Phan
    Pham, Duc-Long
    Pham, Hai-Nam
    Bui, Son-Anh
    Nguyen, Chi-Minh
    Phi, An-Binh
    Tran, Khanh-Duong
    Trinh, Viet-Anh
    Tran, van-Duc
    Tran, Duc-Thanh
    Duong, van-Hiep
    Phan, Khanh-Toan
    Nguyen, van-Thiep
    Vu, van-Duc
    Nguyen, Thu-Uyen
    IEEE ACCESS, 2024, 12 : 113810 - 113825
  • [6] Multi-scale RGB and NIR image Cross-fusion based on Generative Adversarial Network
    Xiang, Sen
    Hu, Zishan
    Deng, Huiping
    Wu, Jin
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4172 - 4177
  • [7] CFEINet: Cross-fusion and feature enhancement interaction network for RGB-D semantic segmentation
    Ge, Bin
    Lu, Yiming
    Xia, Chenxing
    Zhu, Xu
    Zhang, Mengge
    Gao, Mengya
    Chen, Ningjie
    DIGITAL SIGNAL PROCESSING, 2025, 160
  • [8] Improve Regression Network on Depth Hand Pose Estimation With Auxiliary Variable
    Xu, Lu
    Hu, Chen
    Tao, Jian
    Xue, Jianru
    Mei, Kuizhi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 890 - 904
  • [9] Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation
    Liu, Xingyu
    Ren, Pengfei
    Gao, Yuanyuan
    Wang, Jingyu
    Sun, Haifeng
    Qi, Qi
    Zhuang, Zirui
    Liao, Jianxin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3756 - 3764
  • [10] Multiscale feature fusion network for monocular complex hand pose estimation
    Zhan, Zhi
    Luo, Guang
    ELECTRONICS LETTERS, 2023, 59 (24)