A Pipeline for Hand 2-D Keypoint Localization Using Unpaired Image to Image Translation

被引：8

作者：

Farahanipad, Farnaz ^{[1
]}

Rezaei, Mohammad ^{[1
]}

Dillhoff, Alex ^{[1
]}

Kamangar, Farhad ^{[1
]}

Athitsos, Vassilis ^{[1
]}

机构：

[1] Univ Texas Arlington, Arlington, TX 76019 USA

来源：

THE 14TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2021 | 2021年

关键词：

2-D hand pose estimation; fingertip detection and localization; generative adversarial networks; human-computer interaction; domain transfer;

D O I：

10.1145/3453892.3453904

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hand pose estimation is getting a lot of attention in many areas such as Human-Computer Interaction and Sign Language Recognition. A fundamental step to accurately estimate the hand pose involves detecting and localizing fingertips in an image. Despite the progress of 2-D hand pose estimation in recent studies, accurate and robust detection and localization of fingertips still remains a challenging task due to low resolution of a fingertip in images and varying lightning condition. Inspired by the progress of the Generative Adversarial Network (GAN) and image-style transfer, we propose a two-stage pipeline to accurately localize the fingertip position even in varying lighting and severe self occlusion on depth images. The idea is to use a Cycle-consistent Generative Adversarial Network (Cycle-GAN) to apply unpaired image-to-image translation and generate a depth image with colored predictions on the fingertips, wrist, and palm given a real depth image. The model is trained in a semi-supervised manner using a collection of images from source and target domains that do not need to be related in anyway. Then, by applying color segmentation techniques, we localize the center of each colored area which results in finding the location of each fingertip along with center of the wrist and the palm. The proposed method achieves visually promising results on noisy depth images captured using the Microsoft Kinect. Experiments on the challengingNYU hand dataset have demonstrated that our approach not only generates plausible samples, but also outperforms state-of-the-art approaches on 2-D fingertip estimation by a significant margin even in the presence of severe self-occlusion and varying lighting conditions. Moreover, fingertips would be detected irrespective of user orientation using this method.

引用

页码：226 / 233

页数：8

共 50 条

[41] Image compression using the 2-D wavelet transform
Lewis, A. S.
Knowles, G.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1992, 1 (02) : 244 - 250
[42] Infrared-to-Optical Image Translation for Keypoint-Based Image Registration
Elsaeidy, Mohamed
Erkol, Muhammed Emin
Gunturk, Bahadir Kiirsat
Ates, Hasan Fehmi
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
[43] AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks
Tang, Hao
Liu, Hong
Xu, Dan
Torr, Philip H. S.
Sebe, Nicu
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 1972 - 1987
[44] Mutually Improved Endoscopic Image Synthesis and Landmark Detection in Unpaired Image-to-Image Translation
Sharan, Lalith
Romano, Gabriele
Koehler, Sven
Kelm, Halvar
Karck, Matthias
De Simone, Raffaele
Engelhardt, Sandy
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (01) : 127 - 138
[45] Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation
Zheng, Ziqiang
Bin, Yi
Lv, Xiaoou
Wu, Yang
Yang, Yang
Shen, Heng Tao
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2474 - 2487
[46] Unpaired image-to-image translation with improved two-dimensional feature
Hangyao Tu
Wanliang Wang
Jiachen Chen
Fei Wu
Guoqing Li
Multimedia Tools and Applications, 2022, 81 : 43851 - 43872
[47] Trans-Cycle: Unpaired Image-to-Image Translation Network by Transformer
Tian, Kai
Pan, Mengze
Lu, Zongqing
Liao, Qingmin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 576 - 587
[48] Enhanced Unpaired Image-to-Image Translation via Transformation in Saliency Domain
Shibasaki, Kei
Ikehara, Masaaki
IEEE ACCESS, 2023, 11 : 137495 - 137505
[49] Domain Bridge for Unpaired Image-to-Image Translation and Unsupervised Domain Adaptation
Pizzati, Fabio
de Charette, Raoul
Zaccaria, Michela
Cerri, Pietro
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2979 - 2987
[50] Multi-feature contrastive learning for unpaired image-to-image translation
Yao Gou
Min Li
Yu Song
Yujie He
Litao Wang
Complex & Intelligent Systems, 2023, 9 : 4111 - 4122

← 1 2 3 4 5 →