CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation

被引：5

作者：

Sun, Xiaojing ^{[1
]}

Wang, Bin ^{[1
]}

Huang, Longxiang ^{[2
]}

Zhang, Qian ^{[1
]}

Zhu, Sulei ^{[1
]}

Ma, Yan ^{[1
]}

机构：

[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China

[2] Shenzhen Guangjian Technol Co Ltd, Shanghai 200135, Peoples R China

来源：

SENSORS | 2021年 / 21卷 / 18期

关键词：

hand pose estimation; convolutional neural network; RGBD fusion; 3D HAND;

D O I：

10.3390/s21186095

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Despite recent successes in hand pose estimation from RGB images or depth maps, inherent challenges remain. RGB-based methods suffer from heavy self-occlusions and depth ambiguity. Depth sensors rely heavily on distance and can only be used indoors, thus there are many limitations to the practical application of depth-based methods. The aforementioned challenges have inspired us to combine the two modalities to offset the shortcomings of the other. In this paper, we propose a novel RGB and depth information fusion network to improve the accuracy of 3D hand pose estimation, which is called CrossFuNet. Specifically, the RGB image and the paired depth map are input into two different subnetworks, respectively. The feature maps are fused in the fusion module in which we propose a completely new approach to combine the information from the two modalities. Then, the common method is used to regress the 3D key-points by heatmaps. We validate our model on two public datasets and the results reveal that our model outperforms the state-of-the-art methods.

引用

页数：17

共 50 条

[1] ON THE FUSION OF RGB AND DEPTH INFORMATION FOR HAND POSE ESTIMATION
Kazakos, Evangelos
Nikou, Christophoros
Kakadiaris, Ioannis A.
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 868 - 872
[2] Lightweight Cross-Fusion Network on Human Pose Estimation for Edge Device
Zhu, Xian
Zeng, Xiaoqin
Ma, Wei
IEEE ACCESS, 2023, 11 : 134899 - 134907
[3] Hourglass Network for Hand Pose Estimation ith RGB Images
Wang, Qizhi
Yang, Yonggang
2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 1342 - 1347
[4] Two-stage cross-fusion network for stereo event-based depth estimation
Ghosh, Dipon Kumar
Jung, Yong Ju
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
[5] Efficient Multimodal Fusion for Hand Pose Estimation With Hourglass Network
Hoang, Dinh-Cuong
Xuan Tan, Phan
Pham, Duc-Long
Pham, Hai-Nam
Bui, Son-Anh
Nguyen, Chi-Minh
Phi, An-Binh
Tran, Khanh-Duong
Trinh, Viet-Anh
Tran, van-Duc
Tran, Duc-Thanh
Duong, van-Hiep
Phan, Khanh-Toan
Nguyen, van-Thiep
Vu, van-Duc
Nguyen, Thu-Uyen
IEEE ACCESS, 2024, 12 : 113810 - 113825
[6] Multi-scale RGB and NIR image Cross-fusion based on Generative Adversarial Network
Xiang, Sen
Hu, Zishan
Deng, Huiping
Wu, Jin
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4172 - 4177
[7] CFEINet: Cross-fusion and feature enhancement interaction network for RGB-D semantic segmentation
Ge, Bin
Lu, Yiming
Xia, Chenxing
Zhu, Xu
Zhang, Mengge
Gao, Mengya
Chen, Ningjie
DIGITAL SIGNAL PROCESSING, 2025, 160
[8] Improve Regression Network on Depth Hand Pose Estimation With Auxiliary Variable
Xu, Lu
Hu, Chen
Tao, Jian
Xue, Jianru
Mei, Kuizhi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 890 - 904
[9] Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation
Liu, Xingyu
Ren, Pengfei
Gao, Yuanyuan
Wang, Jingyu
Sun, Haifeng
Qi, Qi
Zhuang, Zirui
Liao, Jianxin
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3756 - 3764
[10] Multiscale feature fusion network for monocular complex hand pose estimation
Zhan, Zhi
Luo, Guang
ELECTRONICS LETTERS, 2023, 59 (24)

← 1 2 3 4 5 →