PSPDNet: Part-aware shape and pose disentanglement neural network for 3D human animating meshes

被引:0
|
作者
Li, Guiqing [1 ]
Zeng, Juncheng [1 ]
Zeng, Fanzhong [1 ]
Yao, Chenhao [1 ]
Kuang, Bixia [1 ]
Nie, Yongwei [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
关键词
Disentanglement; Body shape; Mesh autoencoder; Representation learning; AUTOENCODERS;
D O I
10.1016/j.cagd.2023.102218
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Disentangled representations of shape and pose are essential for animating human body meshes in computer animation, computer games, and virtual reality applications. While recent deep neural networks have achieved impressive effectiveness, their performance in terms of interpretability, reconstruction precision, and fine-grained control is not satisfac-tory. To address these issues, we propose the Part-aware Shape and Pose Disentanglement neural network (PSPDNet), a framework for disentangling the shape and pose of 3D hu-man meshes with the same connectivity. PSPDNet utilizes part mesh autoencoders to learn representations of different human body parts, enhancing the interpretability of the la-tent codes by corresponding them with local motions. While mesh autoencoders alone can decouple shape and pose information from animated meshes, they fail to control lo-cal motions. In addition, PSPDNet employs an additional rotation-translation module to remove global rigid motion, i.e., rotation and translation, from the sequence. Finally, we propose a novel loss function which includes disentanglement loss and alignment loss to train PSPDNet in an unsupervised manner. Our experiments show that PSPDNet greatly im-proves disentangled representation with strong interpretability, insensitivity to global rigid transformation, and locality of editing and controlling.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Context-Aware Network for 3D Human Pose Estimation from Monocular RGB Image
    Yin, Binyi
    Zhang, Dongbo
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [32] Structure-aware shape correspondence network for 3D shape synthesis
    Lang, Xufeng
    Sun, Zhengxing
    COMPUTER AIDED GEOMETRIC DESIGN, 2020, 79
  • [33] A 3D shape classifier with neural network supervision
    Liu, Zhenbao
    Mitani, Jun
    Fukui, Yukio
    Nishihara, Seiichi
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2010, 38 (1-3) : 134 - 143
  • [34] 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network
    Li, Sijin
    Chan, Antoni B.
    COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 : 332 - 347
  • [35] Improving 3D Human Pose Estimation via 3D Part Affinity Fields
    Liu, Ding
    Zhao, Zixu
    Wang, Xinchao
    Hu, Yuxiao
    Zhang, Lei
    Huang, Thomas S.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1004 - 1013
  • [36] MRGAN: Multi-Rooted 3D Shape Representation Learning with Unsupervised Part Disentanglement
    Gal, Rinon
    Bermano, Amit
    Zhang, Hao
    Cohen-Or, Daniel
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2039 - 2048
  • [37] Learnable Human Mesh Triangulation for 3D Human Pose and Shape Estimation
    Chun, Sungho
    Park, Sungbum
    Chang, Ju Yong
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2849 - 2858
  • [38] Neural Template: Topology-aware Reconstruction and Disentangled Generation of 3D Meshes
    Hui, Ka-Hei
    Li, Ruihui
    Hu, Jingyu
    Fu, Chi-Wing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18551 - 18561
  • [39] HYRE: Hybrid Regressor for 3D Human Pose and Shape Estimation
    Li, Wenhao
    Liu, Mengyuan
    Liu, Hong
    Ren, Bin
    Li, Xia
    You, Yingxuan
    Sebe, Nicu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 235 - 246
  • [40] BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies
    Wang, Jiangping
    Ma, Kai
    Singh, Vivek Kumar
    Huang, Thomas
    Chen, Terrence
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1591 - 1599