Mask-Pose Cascaded CNN for 2D Hand Pose Estimation From Single Color Image

被引:67
|
作者
Wang, Yangang [1 ,2 ]
Peng, Cong [3 ]
Liu, Yebin [4 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Sch Automat, Nanjing 211106, Jiangsu, Peoples R China
[4] Tsinghua Univ, Dept Automat, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Two dimensional displays; Pose estimation; Three-dimensional displays; Color; Image segmentation; Heating systems; Convolutional neural networks; Hand pose estimation; cascaded CNN; mask prediction;
D O I
10.1109/TCSVT.2018.2879980
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a cascaded convolutional neural network for 2D hand pose estimation from single in-the-wild RGB images. Inspired by the commonly used silhouette information in the generative pose estimation approaches, we build the cascaded network with two stages, including mask prediction stage as well as pose estimation stage. We find that the two stages network architecture for end-to-end training could benefit from each other for detecting the hand mask and 2D pose. To further improve the hand pose detection accuracy, we contribute a new RGB hand dataset named OneHand10K, which contains 10K RGB images. Each image contains one single hand. We manually obtain the segmented mask and labeled keypoints for guided learning. We hope that this dataset will be a benchmark and encourage more people to conduct research on this challenging topic. Experiments on the validation dataset have demonstrated the superior performance of the proposed cascaded convolutional neural network.
引用
收藏
页码:3258 / 3268
页数:11
相关论文
共 50 条
  • [1] Cascaded hierarchical CNN for 2D hand pose estimation from a single color image
    Zhang, Mingyue
    Zhou, Zhiheng
    Deng, Ming
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (18) : 25745 - 25763
  • [2] Cascaded hierarchical CNN for 2D hand pose estimation from a single color image
    Mingyue Zhang
    Zhiheng Zhou
    Ming Deng
    [J]. Multimedia Tools and Applications, 2022, 81 : 25745 - 25763
  • [3] Optimized convolutional pose machine for 2D hand pose estimation
    Pan, Tianhong
    Wang, Zheng
    Fan, Yuan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83
  • [4] 3D hand pose retrieval from a single 2D image
    Guan, HY
    Chua, CS
    Ho, YK
    [J]. 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 157 - 160
  • [5] Hand Pose Estimation from a Single RGB-D Image
    Kuznetsova, Alina
    Rosenhahn, Bodo
    [J]. ADVANCES IN VISUAL COMPUTING, PT II, 2013, 8034 : 592 - 602
  • [6] Multiple-Hand 2D Pose Estimation From a Monocular RGB Image
    Mishra, Purnendu
    Sarawadekar, Kishor
    [J]. IEEE ACCESS, 2024, 12 : 40722 - 40735
  • [7] Cascaded Deep Graphical Convolutional Neural Network for 2D Hand Pose Estimation
    Salman, Sartaj Ahmed
    Zakir, Ali
    Takahashi, Hiroki
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [8] A Joint Model for 2D and 3D Pose Estimation from a Single Image
    Simo-Serra, E.
    Quattoni, A.
    Torras, C.
    Moreno-Noguer, F.
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3634 - 3641
  • [9] Correspondenceless Pose Estimation from a Single 2D Image using Classical Mechanics
    Ugurdag, H. Fatih
    Goren, Sezer
    Canbay, Ferhat
    [J]. 23RD INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2008, : 285 - +
  • [10] Joint-wise 2D to 3D lifting for hand pose estimation from a single RGB image
    Chen, Zheng
    Sun, Yi
    [J]. APPLIED INTELLIGENCE, 2023, 53 (06) : 6421 - 6431