Image-free Domain Generalization via CLIP for 3D Hand Pose Estimation

被引:7
|
作者
Lee, Seongyeong [1 ,2 ]
Park, Hansoo [1 ]
Kim, Dong Uk [1 ]
Kim, Jihyeon [1 ]
Boboev, Muhammadjon [1 ]
Baek, Seungryul [1 ]
机构
[1] UNIST, Ulsan, South Korea
[2] NC Soft, Seongnam, South Korea
关键词
D O I
10.1109/WACV56688.2023.00295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-based 3D hand pose estimation has been successful for decades thanks to large-scale databases and deep learning. However, the hand pose estimation network does not operate well for hand pose images whose characteristics are far different from the training data. This is caused by various factors such as illuminations, camera angles, diverse backgrounds in the input images, etc. Many existing methods tried to solve it by supplying additional large-scale unconstrained/target domain images to augment data space; however collecting such large-scale images takes a lot of labors. In this paper, we present a simple image-free domain generalization approach for the hand pose estimation framework that uses only source domain data. We try to manipulate the image features of the hand pose estimation network by adding the features from text descriptions using the CLIP (Contrastive Language-Image Pre-training) model. The manipulated image features are then exploited to train the hand pose estimation network via the contrastive learning framework. In experiments with STB and RHD datasets, our algorithm shows improved performance over the state-of-the-art domain generalization approaches.
引用
收藏
页码:2933 / 2943
页数:11
相关论文
共 50 条
  • [41] 3D Hand Shape and Pose Estimation based on 2D Hand Keypoints
    Drosakis, Drosakis
    Argyros, Antonis
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2023, 2023, : 148 - 153
  • [42] Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation
    Moon, Gyeongsik
    Choi, Hongsuk
    Lee, Kyoung Mu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2307 - 2316
  • [43] Domain-Translated 3D Object Pose Estimation
    Papaioannidis, Christos
    Mygdalis, Vasileios
    Pitas, Ioannis
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9279 - 9291
  • [44] Unsupervised Domain Adaptation for 3D Human Pose Estimation
    Zhang, Xiheng
    Wong, Yongkang
    Kankanhalli, Mohan S.
    Geng, Weidong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 926 - 934
  • [45] Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey
    Ohkawa, Takehiko
    Furuta, Ryosuke
    Sato, Yoichi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (12) : 3193 - 3206
  • [46] Estimation of 3D human hand poses with structured pose prior
    Guo, Fangtai
    He, Zaixing
    Zhang, Shuyou
    Zhao, Xinyue
    IET COMPUTER VISION, 2019, 13 (08) : 683 - 690
  • [47] Single-Frame Indexing for 3D Hand Pose Estimation
    Carley, Cassandra
    Tomasi, Carlo
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 493 - 501
  • [48] A Normalization Strategy for Weakly Supervised 3D Hand Pose Estimation
    Guo, Zizhao
    Li, Jinkai
    Tan, Jiyong
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [49] Hand-eye 3D Pose Estimation for a Drawing Robot
    Sultan, Malik Saad
    Chen, Xiaopeng
    Ma, Gan
    Xue, Jingtao
    Ni, Wencheng
    Zhang, Tongtong
    Zhang, Wen
    2013 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2013, : 1325 - 1331
  • [50] AWR: Adaptive Weighting Regression for 3D Hand Pose Estimation
    Huang, Weiting
    Ren, Pengfei
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11061 - 11068