Cross-Domain 3D Hand Pose Estimation with Dual Modalities

被引:4
|
作者
Lin, Qiuxia [1 ]
Yang, Linlin [1 ]
Yao, Angela [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
关键词
D O I
10.1109/CVPR52729.2023.01648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in hand pose estimation have shed light on utilizing synthetic data to train neural networks, which however inevitably hinders generalization to real-world data due to domain gaps. To solve this problem, we present a framework for cross-domain semi-supervised hand pose estimation and target the challenging scenario of learning models from labelled multi-modal synthetic data and unlabelled real-world data. To that end, we propose a dual-modality network that exploits synthetic RGB and synthetic depth images. For pre-training, our network uses multi-modal contrastive learning and attention-fused supervision to learn effective representations of the RGB images. We then integrate a novel self-distillation technique during fine-tuning to reduce pseudo-label noise. Experiments show that the proposed method significantly improves 3D hand pose estimation and 2D keypoint detection on benchmarks.
引用
收藏
页码:17184 / 17193
页数:10
相关论文
共 50 条
  • [1] C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation
    Wu, Cunlin
    Xiao, Yang
    Zhang, Boshen
    Zhang, Mingyang
    Cao, Zhiguo
    Zhou, Joey Tianyi
    [J]. COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 554 - 571
  • [2] Cross-Domain Adaptation for Animal Pose Estimation
    Cao, Jinkun
    Tang, Hongyang
    Fang, Hao-Shu
    Shen, Xiaoyong
    Lu, Cewu
    Tai, Yu-Wing
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9497 - 9506
  • [4] Dense 3D Regression for Hand Pose Estimation
    Wan, Chengde
    Probst, Thomas
    Van Gool, Luc
    Yao, Angela
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5147 - 5156
  • [5] 3D Hand Pose Estimation with Neural Networks
    Antonio Serra, Jose
    Garcia-Rodriguez, Jose
    Orts-Escolano, Sergio
    Manuel Garcia-Chamizo, Juan
    Angelopoulou, Anastassia
    Psarrou, Alexandra
    Mentzelopoulos, Markos
    Montoyo-Bojo, Javier
    Dominguez, Enrique
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2013, 7903 : 504 - +
  • [6] Temporal Hints in 3D Hand Pose Estimation
    Yu, Taidong
    Cao, Zhiguo
    Xiao, Yang
    Zhang, Boshen
    Zhu, Zihao
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2042 - 2047
  • [7] SD-Pose: Semantic Decomposition for Cross-Domain 6D Object Pose Estimation
    Li, Zhigang
    Hu, Yinlin
    Salzmann, Mathieu
    Ji, Xiangyang
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2020 - 2028
  • [8] 3D Hand Pose Estimation with Disentangled Cross-Modal Latent Space
    Gu, Jiajun
    Wang, Zhiyong
    Ouyang, Wanli
    Zhang, Weichen
    Li, Jiafeng
    Zhuo, Li
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 380 - 389
  • [9] Image-free Domain Generalization via CLIP for 3D Hand Pose Estimation
    Lee, Seongyeong
    Park, Hansoo
    Kim, Dong Uk
    Kim, Jihyeon
    Boboev, Muhammadjon
    Baek, Seungryul
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2933 - 2943
  • [10] 3D Hand Pose Estimation with a Single Infrared Camera via Domain Transfer Learning
    Park, Gabyong
    Kim, Tae-Kyun
    Woo, Woontack
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2020), 2020, : 588 - 599