PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation

被引:4
|
作者
Liu, Hanbing [1 ]
He, Jun-Yan [2 ]
Cheng, Zhi-Qi [3 ]
Xiang, Wangmeng [2 ]
Yang, Qize [2 ]
Chai, Wenhao [4 ]
Wang, Gaoang [5 ]
Bao, Xu [2 ]
Luo, Bin [2 ]
Geng, Yifeng [2 ]
Xie, Xuansong [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Washington, Seattle, WA 98195 USA
[5] Zhejiang Univ, Hangzhou, Peoples R China
关键词
3D human pose estimation; diffusion model; domain-adaptation; multi-hypothesis; Low-Rank adaptation;
D O I
10.1145/3581783.3612368
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current 3D human pose estimators face challenges in adapting to new datasets due to the scarcity of 2D-3D pose pairs in target domain training sets. We present the Multi-Hypothesis Pose Synthesis Domain Adaptation (PoSynDA) framework to overcome this issue without extensive target domain annotation. Utilizing a diffusion-centric structure, PoSynDA simulates the 3D pose distribution in the target domain, filling the data diversity gap. By incorporating a multi-hypothesis network, it creates diverse pose hypotheses and aligns them with the target domain. Target-specific source augmentation obtains the target domain distribution data from the source domain by decoupling the scale and position parameters. The teacher-student paradigm and low-rank adaptation further refine the process. PoSynDA demonstrates competitive performance on benchmarks, such as Human3.6M, MPI-INF-3DHP, and 3DPW, even comparable with the target-trained MixSTE model [66]. This work paves the way for the practical application of 3D human pose estimation.(1)
引用
下载
收藏
页码:5542 / 5551
页数:10
相关论文
共 50 条
  • [1] MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation
    Li, Wenhao
    Liu, Hong
    Tang, Hao
    Wang, Pichao
    Van Gool, Luc
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13137 - 13146
  • [2] Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation
    Shan, Wenkang
    Liu, Zhenhua
    Zhang, Xinfeng
    Wang, Zhao
    Han, Kai
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14715 - 14725
  • [3] Multi-hypothesis representation learning for transformer-based 3D human pose estimation
    Li, Wenhao
    Liu, Hong
    Tang, Hao
    Wang, Pichao
    PATTERN RECOGNITION, 2023, 141
  • [4] Unsupervised Domain Adaptation for 3D Human Pose Estimation
    Zhang, Xiheng
    Wong, Yongkang
    Kankanhalli, Mohan S.
    Geng, Weidong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 926 - 934
  • [5] DBMHT: A double-branch multi-hypothesis transformer for 3D human pose estimation in video
    Xiang, Xuezhi
    Li, Xiaoheng
    Bao, Weijie
    Qiaoa, Yulong
    El Saddik, Abdulmotaleb
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [6] DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion Models
    Holmquist, Karl
    Wandt, Bastian
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15931 - 15941
  • [7] Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video
    Sun, Cheng
    Thomas, Diego
    Kawasaki, Hiroshi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5959 - 5964
  • [8] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
    Chai, Wenhao
    Jiang, Zhongyu
    Hwang, Jenq-Neng
    Wang, Gaoang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14609 - 14619
  • [9] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Liu, Shuangjun
    Sehgal, Naveen
    Ostadabbas, Sarah
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
  • [10] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    Applied Intelligence, 2022, 52 : 14491 - 14506