Unsupervised 3D Pose Transfer With Cross Consistency and Dual Reconstruction

被引:8
|
作者
Song, Chaoyue [1 ]
Wei, Jiacheng [2 ]
Li, Ruibo [1 ]
Liu, Fayao [3 ]
Lin, Guosheng [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, S Lab, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] ASTAR, Inst Inforcomm Res, Singapore 138632, Singapore
基金
新加坡国家研究基金会;
关键词
3D pose transfer; as-rigid-as-possible deformation; conditional normalization layer; cross consistency; optimal transport; unsupervised learning; DEFORMATION TRANSFER; TRANSPORT;
D O I
10.1109/TPAMI.2023.3259059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of 3D pose transfer is to transfer the pose from the source mesh to the target mesh while preserving the identity information (e.g., face, body shape) of the targetmesh. Deep learning-based methods improved the efficiency and performance of 3D pose transfer. However, most of them are trained under the supervision of the ground truth, whose availability is limited in real-world scenarios. In this work, we present X-DualNet, a simple yet effective approach that enables unsupervised 3D pose transfer. In X-DualNet, we introduce a generatorGwhich contains correspondence learning and pose transfer modules to achieve 3D pose transfer. We learn the shape correspondence by solving an optimal transport problem without any key point annotations and generate high-quality meshes with our elastic instance normalization (ElaIN) in the pose transfer module. With G as the basic component, we propose a cross consistency learning scheme and a dual reconstruction objective to learn the pose transfer without supervision. Besides that, we also adopt an as-rigid-as-possible deformer in the training process to fine-tune the body shape of the generated results. Extensive experiments on human and animal data demonstrate that our framework can successfully achieve comparable performance as the state-of-the-art supervised approaches.
引用
收藏
页码:10488 / 10499
页数:12
相关论文
共 50 条
  • [1] Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer
    Chen, Haoyu
    Tang, Hao
    Shi, Henglin
    Peng, Wei
    Sebe, Nicu
    Zhao, Guoying
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8610 - 8619
  • [2] Unsupervised 3D Reconstruction Networks
    Cha, Geonho
    Lee, Minsik
    Oh, Songhwai
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3848 - 3857
  • [3] LEARNING POSE-AWARE 3D RECONSTRUCTION VIA 2D-3D SELF-CONSISTENCY
    Liao, Yi-Lun
    Yang, Yao-Cheng
    Lin, Yuan-Fang
    Chen, Pin-Jung
    Kuo, Chia-Wen
    Chiu, Wei-Chen
    Wang, Yu-Chiang Frank
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3857 - 3861
  • [4] Cross-Domain 3D Hand Pose Estimation with Dual Modalities
    Lin, Qiuxia
    Yang, Linlin
    Yao, Angela
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17184 - 17193
  • [5] 3D hand pose reconstruction with ISOSOM
    Guan, HY
    Turk, M
    ADVANCES IN VISUAL COMPUTING, PROCEEDINGS, 2005, 3804 : 630 - 635
  • [6] Consistency constraints and 3D building reconstruction
    Horna, S.
    Meneveaux, D.
    Damiand, G.
    Bertrand, Y.
    COMPUTER-AIDED DESIGN, 2009, 41 (01) : 13 - 27
  • [7] Unsupervised Domain Adaptation for 3D Human Pose Estimation
    Zhang, Xiheng
    Wong, Yongkang
    Kankanhalli, Mohan S.
    Geng, Weidong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 926 - 934
  • [8] Pose Locality Constrained Representation for 3D Human Pose Reconstruction
    Fan, Xiaochuan
    Zheng, Kang
    Zhou, Youjie
    Wang, Song
    COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 174 - 188
  • [9] Unsupervised 3D Point Cloud Reconstruction via Exploring Multi-View Consistency and Complementarity
    Song, Jiahui
    Hou, Yonghong
    Peng, Bo
    Qin, Tianyi
    Huang, Qingming
    Lei, Jianjun
    IEEE TRANSACTIONS ON BROADCASTING, 2025, 71 (01) : 193 - 202
  • [10] Unsupervised Cross-Dataset Adaptation via Probabilistic Amodal 3D Human Pose Completion
    Kundu, Jogendra Nath
    Rahul, M., V
    Patravali, Jay
    Babu, R. Venkatesh
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 458 - 467