Synthetic-to-Real Pose Estimation with Geometric Reconstruction

被引:0
|
作者
Lin, Qiuxia [1 ]
Gu, Kerui [1 ]
Yang, Linlin [2 ,3 ]
Yao, Angela [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
[2] CUC, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
[3] CUC, Sch Informat & Commun Engn, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pose estimation is remarkably successful under supervised learning, but obtaining annotations, especially for new deployments, is costly and time-consuming. This work tackles adapting models trained on synthetic data to real-world target domains with only unlabelled data. A common approach is model fine-tuning with pseudo-labels from the target domain; yet many pseudo-labelling strategies cannot provide sufficient high-quality pose labels. This work proposes a reconstruction-based strategy as a complement to pseudo-labelling for synthetic-to-real domain adaptation. We generate the driving image by geometrically transforming a base image according to the predicted keypoints and enforce a reconstruction loss to refine the predictions. It provides a novel solution to effectively correct confident yet inaccurate keypoint locations through image reconstruction in domain adaptation. Our approach outperforms the previous state-of-the-arts by 8% for PCK on four large-scale hand and human real-world datasets. In particular, we excel on endpoints such as fingertips and head, with 7.2% and 29.9% improvements in PCK.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Global and Local Texture Randomization for Synthetic-to-Real Semantic Segmentation
    Peng, Duo
    Lei, Yinjie
    Liu, Lingqiao
    Zhang, Pingping
    Liu, Jun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 6594 - 6608
  • [22] Perspective Pose Estimation with Geometric Algebra
    Gebken, Christian
    Sommer, Gerald
    SELECTED PAPERS FROM ICNAAM 2007 AND ICCMSE 2007, 2008, 1046 : 95 - 98
  • [23] Self-Training of Handwritten Word Recognition for Synthetic-to-Real Adaptation
    Wolf, Fabian
    Fink, Gernot A.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3885 - 3892
  • [24] N-ROD: a Neuromorphic Dataset for Synthetic-to-Real Domain Adaptation
    Cannici, Marco
    Plizzari, Chiara
    Planamente, Mirco
    Ciccone, Marco
    Bottino, Andrea
    Caputo, Barbara
    Matteucci, Matteo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1342 - 1347
  • [25] A Parallel Teacher for Synthetic-to-Real Domain Adaptation of Traffic Object Detection
    Wang, Jiangong
    Shen, Tianyu
    Tian, Yonglin
    Wang, Yutong
    Gou, Chao
    Wang, Xiao
    Yao, Fei
    Sun, Changyin
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (03): : 441 - 455
  • [26] Synthetic-to-Real Domain Adaptation for Action Recognition: A Dataset and Baseline Performances
    Reddy, Arun V.
    Shah, Ketul
    Paul, William
    Mocharla, Rohita
    Hoffman, Judy
    Katyal, Kapil D.
    Manocha, Dinesh
    de Melo, Celso M.
    Chellappa, Rama
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11374 - 11381
  • [27] Real-time tracking and pose estimation for industrial objects using geometric features
    Yoon, Y
    DeSouza, GN
    Kak, AC
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2003, : 3473 - 3478
  • [28] Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement
    Bi, Sai
    Sunkavalli, Kalyan
    Perazzi, Federico
    Shechtman, Eli
    Kim, Vladimir G.
    Ramamoorthi, Ravi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2730 - 2739
  • [29] Dual-route synthetic-to-real adaption for single image dehazing
    Qiao, Yingxu
    Huo, Zhanqiang
    Meng, Sensen
    IET IMAGE PROCESSING, 2024, 18 (04) : 926 - 938
  • [30] GeoPose: Dense Reconstruction Guided 6D Object Pose Estimation With Geometric Consistency
    Wang, Deming
    Zhou, Guangliang
    Yan, Yi
    Chen, Huiyi
    Chen, Qijun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4394 - 4408