Mask-Pose Cascaded CNN for 2D Hand Pose Estimation From Single Color Image

被引:67
|
作者
Wang, Yangang [1 ,2 ]
Peng, Cong [3 ]
Liu, Yebin [4 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Sch Automat, Nanjing 211106, Jiangsu, Peoples R China
[4] Tsinghua Univ, Dept Automat, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Two dimensional displays; Pose estimation; Three-dimensional displays; Color; Image segmentation; Heating systems; Convolutional neural networks; Hand pose estimation; cascaded CNN; mask prediction;
D O I
10.1109/TCSVT.2018.2879980
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a cascaded convolutional neural network for 2D hand pose estimation from single in-the-wild RGB images. Inspired by the commonly used silhouette information in the generative pose estimation approaches, we build the cascaded network with two stages, including mask prediction stage as well as pose estimation stage. We find that the two stages network architecture for end-to-end training could benefit from each other for detecting the hand mask and 2D pose. To further improve the hand pose detection accuracy, we contribute a new RGB hand dataset named OneHand10K, which contains 10K RGB images. Each image contains one single hand. We manually obtain the segmented mask and labeled keypoints for guided learning. We hope that this dataset will be a benchmark and encourage more people to conduct research on this challenging topic. Experiments on the validation dataset have demonstrated the superior performance of the proposed cascaded convolutional neural network.
引用
收藏
页码:3258 / 3268
页数:11
相关论文
共 50 条
  • [31] Hierarchical topology based hand pose estimation from a single depth image
    Ji, Yanli
    Li, Haoxin
    Yang, Yang
    Li, Shuying
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (09) : 10553 - 10568
  • [32] 3D hand pose estimation from a single RGB image by weighting the occlusion and classification
    Mahdikhanlou, Khadijeh
    Ebrahimnezhad, Hossein
    [J]. PATTERN RECOGNITION, 2023, 136
  • [33] Occlusion-Robust 3D Hand Pose Estimation from a Single RGB Image
    Ishii, Asuka
    Nakano, Gaku
    Inoshita, Tetsuo
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [34] Recurrent 3D Hand Pose Estimation Using Cascaded Pose-Guided 3D Alignments
    Deng, Xiaoming
    Zuo, Dexin
    Zhang, Yinda
    Cui, Zhaopeng
    Cheng, Jian
    Tan, Ping
    Chang, Liang
    Pollefeys, Marc
    Fanello, Sean
    Wang, Hongan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 932 - 945
  • [35] HMTNet: 3D Hand Pose Estimation From Single Depth Image Based on Hand Morphological Topology
    Zhou, Weiguo
    Jiang, Xin
    Chen, Chen
    Mei, Sijia
    Liu, Yun-Hui
    [J]. IEEE SENSORS JOURNAL, 2020, 20 (11) : 6004 - 6011
  • [36] 6D Pose Estimation Based on Multiple Appearance Features from Single Color Image
    Pan, Wang
    Zhu, Feng
    Hao, Yingming
    Zhang, Limin
    [J]. 2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 406 - 411
  • [37] Of Mice and Pose: 2D Mouse Pose Estimation from Unlabelled Data and Synthetic Prior
    Sosa, Jose
    Perry, Sharn
    Alty, Jane
    Hogg, David
    [J]. COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 125 - 136
  • [38] 2D Human pose estimation: a survey
    Haoming Chen
    Runyang Feng
    Sifan Wu
    Hao Xu
    Fengcheng Zhou
    Zhenguang Liu
    [J]. Multimedia Systems, 2023, 29 : 3115 - 3138
  • [39] 2D Human pose estimation: a survey
    Chen, Haoming
    Feng, Runyang
    Wu, Sifan
    Xu, Hao
    Zhou, Fengcheng
    Liu, Zhenguang
    [J]. MULTIMEDIA SYSTEMS, 2023, 29 (05) : 3115 - 3138
  • [40] Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image
    Zhang, Baowen
    Wang, Yangang
    Deng, Xiaoming
    Zhang, Yinda
    Tan, Ping
    Ma, Cuixia
    Wang, Hongan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11334 - 11343