Image-free Domain Generalization via CLIP for 3D Hand Pose Estimation

被引：7

作者：

Lee, Seongyeong ^{[1
,2
]}

Park, Hansoo ^{[1
]}

Kim, Dong Uk ^{[1
]}

Kim, Jihyeon ^{[1
]}

Boboev, Muhammadjon ^{[1
]}

Baek, Seungryul ^{[1
]}

机构：

[1] UNIST, Ulsan, South Korea

[2] NC Soft, Seongnam, South Korea

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00295

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

RGB-based 3D hand pose estimation has been successful for decades thanks to large-scale databases and deep learning. However, the hand pose estimation network does not operate well for hand pose images whose characteristics are far different from the training data. This is caused by various factors such as illuminations, camera angles, diverse backgrounds in the input images, etc. Many existing methods tried to solve it by supplying additional large-scale unconstrained/target domain images to augment data space; however collecting such large-scale images takes a lot of labors. In this paper, we present a simple image-free domain generalization approach for the hand pose estimation framework that uses only source domain data. We try to manipulate the image features of the hand pose estimation network by adding the features from text descriptions using the CLIP (Contrastive Language-Image Pre-training) model. The manipulated image features are then exploited to train the hand pose estimation network via the contrastive learning framework. In experiments with STB and RHD datasets, our algorithm shows improved performance over the state-of-the-art domain generalization approaches.

引用

页码：2933 / 2943

页数：11

共 50 条

[41] 3D Hand Shape and Pose Estimation based on 2D Hand Keypoints
Drosakis, Drosakis
Argyros, Antonis
PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2023, 2023, : 148 - 153
[42] Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation
Moon, Gyeongsik
Choi, Hongsuk
Lee, Kyoung Mu
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2307 - 2316
[43] Domain-Translated 3D Object Pose Estimation
Papaioannidis, Christos
Mygdalis, Vasileios
Pitas, Ioannis
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9279 - 9291
[44] Unsupervised Domain Adaptation for 3D Human Pose Estimation
Zhang, Xiheng
Wong, Yongkang
Kankanhalli, Mohan S.
Geng, Weidong
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 926 - 934
[45] Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey
Ohkawa, Takehiko
Furuta, Ryosuke
Sato, Yoichi
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (12) : 3193 - 3206
[46] Estimation of 3D human hand poses with structured pose prior
Guo, Fangtai
He, Zaixing
Zhang, Shuyou
Zhao, Xinyue
IET COMPUTER VISION, 2019, 13 (08) : 683 - 690
[47] Single-Frame Indexing for 3D Hand Pose Estimation
Carley, Cassandra
Tomasi, Carlo
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 493 - 501
[48] A Normalization Strategy for Weakly Supervised 3D Hand Pose Estimation
Guo, Zizhao
Li, Jinkai
Tan, Jiyong
APPLIED SCIENCES-BASEL, 2024, 14 (09):
[49] Hand-eye 3D Pose Estimation for a Drawing Robot
Sultan, Malik Saad
Chen, Xiaopeng
Ma, Gan
Xue, Jingtao
Ni, Wencheng
Zhang, Tongtong
Zhang, Wen
2013 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2013, : 1325 - 1331
[50] AWR: Adaptive Weighting Regression for 3D Hand Pose Estimation
Huang, Weiting
Ren, Pengfei
Wang, Jingyu
Qi, Qi
Sun, Haifeng
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11061 - 11068

← 1 2 3 4 5 →