DiffSurf: A Transformer-Based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose

被引:0
|
作者
Yoshiyasu, Yusuke [1 ]
Sun, Leyuan [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, 1-1-1 Umezono, Tsukuba, Ibaraki, Japan
来源
关键词
Diffusion model; 3D surface; Human mesh recovery;
D O I
10.1007/978-3-031-73007-8_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents DiffSurf, a transformer-based denoising diffusion model for generating and reconstructing 3D surfaces. Specifically, we design a diffusion transformer architecture that predicts noise from noisy 3D surface vertices and normals. With this architecture, DiffSurf is able to generate 3D surfaces in various poses and shapes, such as human bodies, hands, animals and man-made objects. Further, DiffSurf is versatile in that it can address various 3D downstream tasks including morphing, body shape variation and 3D human mesh fitting to 2D keypoints. Experimental results on 3D human model benchmarks demonstrate that DiffSurf can generate shapes with greater diversity and higher quality than previous generative models. Furthermore, when applied to the task of single-image 3D human mesh recovery, DiffSurf achieves accuracy comparable to prior techniques at a near real-time rate.
引用
收藏
页码:246 / 264
页数:19
相关论文
共 50 条
  • [1] Transformer-based weakly supervised 3D human pose estimation
    Wu, Xiao-guang
    Xie, Hu-jie
    Niu, Xiao-chen
    Wang, Chen
    Wang, Ze-lei
    Zhang, Shi-wen
    Shan, Yu-ze
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 109
  • [2] Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
    Li, Wenhao
    Liu, Mengyuan
    Liu, Hong
    Wang, Pichao
    Cai, Jialun
    Sebe, Nicu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 604 - 613
  • [3] Transformer-based 3D Human pose estimation and action achievement evaluation
    Yang, Aolei
    Zhou, Yinghong
    Yang, Banghua
    Xu, Yulin
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (04): : 136 - 144
  • [4] TransFusion: A Practical and Effective Transformer-Based Diffusion Model for 3D Human Motion Prediction
    Tian, Sibo
    Zheng, Minghui
    Liang, Xiao
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6232 - 6239
  • [5] Multi-hypothesis representation learning for transformer-based 3D human pose estimation
    Li, Wenhao
    Liu, Hong
    Tang, Hao
    Wang, Pichao
    PATTERN RECOGNITION, 2023, 141
  • [6] Synthetic CT generation from MRI using 3D transformer-based denoising diffusion model
    Pan, Shaoyan
    Abouei, Elham
    Wynne, Jacob
    Chang, Chih-Wei
    Wang, Tonghe
    Qiu, Richard L. J.
    Li, Yuheng
    Peng, Junbo
    Roper, Justin
    Patel, Pretesh
    Yu, David S.
    Mao, Hui
    Yang, Xiaofeng
    MEDICAL PHYSICS, 2024, 51 (04) : 2538 - 2548
  • [7] Generating Music Transition by Using a Transformer-Based Model
    Hsu, Jia-Lien
    Chang, Shuh-Jiun
    ELECTRONICS, 2021, 10 (18)
  • [8] THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers
    Zanfir, Mihai
    Zanfir, Andrei
    Bazavan, Eduard Gabriel
    Freeman, William T.
    Sukthankar, Rahul
    Sminchisescu, Cristian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12951 - 12960
  • [9] Generating Qualitative Descriptions of Diagrams with a Transformer-Based Language Model
    Schorlemmer, Marco
    Ballout, Mohamad
    Kuehnberger, Kai-Uwe
    DIAGRAMMATIC REPRESENTATION AND INFERENCE, DIAGRAMS 2024, 2024, 14981 : 61 - 75
  • [10] A TRANSFORMER-BASED APPROACH FOR METAL 3D PRINTING QUALITY RECOGNITION
    Zhang, Weihao
    Wang, Jiapeng
    Ma, Honglin
    Zhang, Qi
    Fan, Shuqian
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,