Mask-Pose Cascaded CNN for 2D Hand Pose Estimation From Single Color Image

被引:67
|
作者
Wang, Yangang [1 ,2 ]
Peng, Cong [3 ]
Liu, Yebin [4 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Sch Automat, Nanjing 211106, Jiangsu, Peoples R China
[4] Tsinghua Univ, Dept Automat, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Two dimensional displays; Pose estimation; Three-dimensional displays; Color; Image segmentation; Heating systems; Convolutional neural networks; Hand pose estimation; cascaded CNN; mask prediction;
D O I
10.1109/TCSVT.2018.2879980
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a cascaded convolutional neural network for 2D hand pose estimation from single in-the-wild RGB images. Inspired by the commonly used silhouette information in the generative pose estimation approaches, we build the cascaded network with two stages, including mask prediction stage as well as pose estimation stage. We find that the two stages network architecture for end-to-end training could benefit from each other for detecting the hand mask and 2D pose. To further improve the hand pose detection accuracy, we contribute a new RGB hand dataset named OneHand10K, which contains 10K RGB images. Each image contains one single hand. We manually obtain the segmented mask and labeled keypoints for guided learning. We hope that this dataset will be a benchmark and encourage more people to conduct research on this challenging topic. Experiments on the validation dataset have demonstrated the superior performance of the proposed cascaded convolutional neural network.
引用
收藏
页码:3258 / 3268
页数:11
相关论文
共 50 条
  • [21] CASCADED POINT NETWORK FOR 3D HAND POSE ESTIMATION
    Dou, Yikun
    Wang, Xuguang
    Zhu, Yuying
    Deng, Xiaoming
    Ma, Cuixia
    Chang, Liang
    Wang, Hongan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1982 - 1986
  • [22] Enhancement of human 3D pose estimation using a novel concept of depth prediction with pose alignment from a single 2D image?
    Kushwaha, Mohit
    Choudhary, Jaytrilok
    Singh, Dhirendra Pratap
    [J]. COMPUTERS & GRAPHICS-UK, 2022, 107 : 172 - 185
  • [23] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    [J]. NEUROCOMPUTING, 2022, 474 : 25 - 36
  • [24] Nonparametric Structure Regularization Machine for 2D Hand Pose Estimation
    Chen, Yifei
    Ma, Haoyu
    Kong, Deying
    Yan, Xiangyi
    Wu, Jianbao
    Fan, Wei
    Xie, Xiaohui
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 370 - 379
  • [25] Efficient 2D Keypoint-based Hand Pose Estimation
    Hsiao, Shan-Chien
    Chiu, Ching-Te
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 1648 - 1652
  • [26] 3D Hand Shape and Pose Estimation based on 2D Hand Keypoints
    Drosakis, Drosakis
    Argyros, Antonis
    [J]. PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2023, 2023, : 148 - 153
  • [27] RoPose: CNN-based 2D Pose Estimation of Industrial Robots
    Gulde, Thomas
    Ludl, Dennis
    Curio, Cristobal
    [J]. 2018 IEEE 14TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2018, : 463 - 470
  • [28] Single Shot Corrective CNN for Anatomically Correct 3D Hand Pose Estimation
    Isaac, Joseph H. R.
    Manivannan, Muniyandi
    Ravindran, Balaraman
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [29] 3D Human Pose Estimation=2D Pose Estimation plus Matching
    Chen, Ching-Hang
    Ramanan, Deva
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5759 - 5767
  • [30] Hierarchical topology based hand pose estimation from a single depth image
    Yanli Ji
    Haoxin Li
    Yang Yang
    Shuying Li
    [J]. Multimedia Tools and Applications, 2018, 77 : 10553 - 10568