A Pipeline for Hand 2-D Keypoint Localization Using Unpaired Image to Image Translation

被引:8
|
作者
Farahanipad, Farnaz [1 ]
Rezaei, Mohammad [1 ]
Dillhoff, Alex [1 ]
Kamangar, Farhad [1 ]
Athitsos, Vassilis [1 ]
机构
[1] Univ Texas Arlington, Arlington, TX 76019 USA
关键词
2-D hand pose estimation; fingertip detection and localization; generative adversarial networks; human-computer interaction; domain transfer;
D O I
10.1145/3453892.3453904
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hand pose estimation is getting a lot of attention in many areas such as Human-Computer Interaction and Sign Language Recognition. A fundamental step to accurately estimate the hand pose involves detecting and localizing fingertips in an image. Despite the progress of 2-D hand pose estimation in recent studies, accurate and robust detection and localization of fingertips still remains a challenging task due to low resolution of a fingertip in images and varying lightning condition. Inspired by the progress of the Generative Adversarial Network (GAN) and image-style transfer, we propose a two-stage pipeline to accurately localize the fingertip position even in varying lighting and severe self occlusion on depth images. The idea is to use a Cycle-consistent Generative Adversarial Network (Cycle-GAN) to apply unpaired image-to-image translation and generate a depth image with colored predictions on the fingertips, wrist, and palm given a real depth image. The model is trained in a semi-supervised manner using a collection of images from source and target domains that do not need to be related in anyway. Then, by applying color segmentation techniques, we localize the center of each colored area which results in finding the location of each fingertip along with center of the wrist and the palm. The proposed method achieves visually promising results on noisy depth images captured using the Microsoft Kinect. Experiments on the challengingNYU hand dataset have demonstrated that our approach not only generates plausible samples, but also outperforms state-of-the-art approaches on 2-D fingertip estimation by a significant margin even in the presence of severe self-occlusion and varying lighting conditions. Moreover, fingertips would be detected irrespective of user orientation using this method.
引用
收藏
页码:226 / 233
页数:8
相关论文
共 50 条
  • [31] Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation
    Chen, Ying-Cong
    Xu, Xiaogang
    Tian, Zhuotao
    Jia, Jiaya
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2403 - 2411
  • [32] UNPAIRED IMAGE-TO-IMAGE SHAPE TRANSLATION ACROSS FASHION DATA
    Wang, Kaili
    Ma, Liqian
    Oramas, Jose M.
    Van Gool, Luc
    Tuytelaars, Tinne
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 206 - 210
  • [33] Unpaired Image-to-Image Translation via Latent Energy Transport
    Zhao, Yang
    Chen, Changyou
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16413 - 16422
  • [34] Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation
    Xu, Yanwu
    Xie, Shaoan
    Wu, Wenhao
    Zhang, Kun
    Gong, Mingming
    Batmanghelich, Kayhan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18290 - 18299
  • [35] One-to-one Mapping for Unpaired Image-to-image Translation
    Shen, Zengming
    Chen, Yifan
    Huang, Thomas S.
    Zhou, S. Kevin
    Georgescu, Bogdan
    Liu, Xuqi
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1159 - 1168
  • [36] Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation
    Cai, Xiuding
    Zhu, Yaoyao
    Miao, Dong
    Fu, Linjie
    Yao, Yu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 891 - 899
  • [37] Cross-Domain Interpolation for Unpaired Image-to-Image Translation
    Lopez, Jorge
    Mauricio, Antoni
    Diaz, Jose
    Camara, Guillermo
    COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 542 - 551
  • [38] Hand Hygiene Quality Assessment Using Image-to-Image Translation
    Wang, Chaofan
    Yang, Kangning
    Jiang, Weiwei
    Wei, Jing
    Sarsenbayeva, Zhanna
    Goncalves, Jorge
    Kostakos, Vassilis
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 64 - 73
  • [39] Generating Large Labeled Data Sets for Laparoscopic Image Processing Tasks Using Unpaired Image-to-Image Translation
    Pfeiffer, Micha
    Funke, Isabel
    Robu, Maria R.
    Bodenstedt, Sebastian
    Strenger, Leon
    Engelhardt, Sandy
    Ross, Tobias
    Clarkson, Matthew J.
    Gurusamy, Kurinchi
    Davidson, Brian R.
    Maier-Hein, Lena
    Riediger, Carina
    Welsch, Thilo
    Weitz, Juergen
    Speidel, Stefanie
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT V, 2019, 11768 : 119 - 127
  • [40] Adversarial Inverse Graphics Networks: Learning 2D-to-3D Lifting and Image-to-Image Translation from Unpaired Supervision
    Tung, Hsiao-Yu Fish
    Harley, Adam W.
    Seto, William
    Fragkiadaki, Katerina
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4364 - 4372