A Pipeline for Hand 2-D Keypoint Localization Using Unpaired Image to Image Translation

被引:8
|
作者
Farahanipad, Farnaz [1 ]
Rezaei, Mohammad [1 ]
Dillhoff, Alex [1 ]
Kamangar, Farhad [1 ]
Athitsos, Vassilis [1 ]
机构
[1] Univ Texas Arlington, Arlington, TX 76019 USA
关键词
2-D hand pose estimation; fingertip detection and localization; generative adversarial networks; human-computer interaction; domain transfer;
D O I
10.1145/3453892.3453904
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hand pose estimation is getting a lot of attention in many areas such as Human-Computer Interaction and Sign Language Recognition. A fundamental step to accurately estimate the hand pose involves detecting and localizing fingertips in an image. Despite the progress of 2-D hand pose estimation in recent studies, accurate and robust detection and localization of fingertips still remains a challenging task due to low resolution of a fingertip in images and varying lightning condition. Inspired by the progress of the Generative Adversarial Network (GAN) and image-style transfer, we propose a two-stage pipeline to accurately localize the fingertip position even in varying lighting and severe self occlusion on depth images. The idea is to use a Cycle-consistent Generative Adversarial Network (Cycle-GAN) to apply unpaired image-to-image translation and generate a depth image with colored predictions on the fingertips, wrist, and palm given a real depth image. The model is trained in a semi-supervised manner using a collection of images from source and target domains that do not need to be related in anyway. Then, by applying color segmentation techniques, we localize the center of each colored area which results in finding the location of each fingertip along with center of the wrist and the palm. The proposed method achieves visually promising results on noisy depth images captured using the Microsoft Kinect. Experiments on the challengingNYU hand dataset have demonstrated that our approach not only generates plausible samples, but also outperforms state-of-the-art approaches on 2-D fingertip estimation by a significant margin even in the presence of severe self-occlusion and varying lighting conditions. Moreover, fingertips would be detected irrespective of user orientation using this method.
引用
收藏
页码:226 / 233
页数:8
相关论文
共 50 条
  • [41] Image compression using the 2-D wavelet transform
    Lewis, A. S.
    Knowles, G.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1992, 1 (02) : 244 - 250
  • [42] Infrared-to-Optical Image Translation for Keypoint-Based Image Registration
    Elsaeidy, Mohamed
    Erkol, Muhammed Emin
    Gunturk, Bahadir Kiirsat
    Ates, Hasan Fehmi
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [43] AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks
    Tang, Hao
    Liu, Hong
    Xu, Dan
    Torr, Philip H. S.
    Sebe, Nicu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 1972 - 1987
  • [44] Mutually Improved Endoscopic Image Synthesis and Landmark Detection in Unpaired Image-to-Image Translation
    Sharan, Lalith
    Romano, Gabriele
    Koehler, Sven
    Kelm, Halvar
    Karck, Matthias
    De Simone, Raffaele
    Engelhardt, Sandy
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (01) : 127 - 138
  • [45] Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation
    Zheng, Ziqiang
    Bin, Yi
    Lv, Xiaoou
    Wu, Yang
    Yang, Yang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2474 - 2487
  • [46] Unpaired image-to-image translation with improved two-dimensional feature
    Hangyao Tu
    Wanliang Wang
    Jiachen Chen
    Fei Wu
    Guoqing Li
    Multimedia Tools and Applications, 2022, 81 : 43851 - 43872
  • [47] Trans-Cycle: Unpaired Image-to-Image Translation Network by Transformer
    Tian, Kai
    Pan, Mengze
    Lu, Zongqing
    Liao, Qingmin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 576 - 587
  • [48] Enhanced Unpaired Image-to-Image Translation via Transformation in Saliency Domain
    Shibasaki, Kei
    Ikehara, Masaaki
    IEEE ACCESS, 2023, 11 : 137495 - 137505
  • [49] Domain Bridge for Unpaired Image-to-Image Translation and Unsupervised Domain Adaptation
    Pizzati, Fabio
    de Charette, Raoul
    Zaccaria, Michela
    Cerri, Pietro
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2979 - 2987
  • [50] Multi-feature contrastive learning for unpaired image-to-image translation
    Yao Gou
    Min Li
    Yu Song
    Yujie He
    Litao Wang
    Complex & Intelligent Systems, 2023, 9 : 4111 - 4122