HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

被引:0
|
作者
Zhou, Zhenglin [1 ,2 ]
Ma, Fan [2 ]
Fan, Hehe [2 ]
Yang, Zongxin [2 ]
Yang, Yi [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Brain Machine Intelligence, Hangzhou, Peoples R China
[2] Zhejiang Univ, ReLER, CCAI, Hangzhou, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Head avatar animation; Text-guided generation; 3D Gaussian splatting;
D O I
10.1007/978-3-031-73411-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating digital avatars from textual prompts has long been a desirable yet challenging task. Despite the promising results achieved with 2D diffusion priors, current methods struggle to create high-quality and consistent animated avatars efficiently. Previous animatable head models like FLAME have difficulty in accurately representing detailed texture and geometry. Additionally, high-quality 3D static representations face challenges in semantically driving with dynamic priors. In this paper, we introduce HeadStudio, a novel framework that utilizes 3D Gaussian splatting to generate realistic and animatable avatars from text prompts. Firstly, we associate 3D Gaussians with animatable head prior model, facilitating semantic animation on high-quality 3D representations. To ensure consistent animation, we further enhance the optimization from initialization, distillation, and regularization to jointly learn the shape, texture, and animation. Extensive experiments demonstrate the efficacy of HeadStudio in generating animatable avatars from textual prompts, exhibiting appealing appearances. The avatars are capable of rendering high-quality real-time (>= 40 fps) novel views at a resolution of 1024. Moreover, These avatars can be smoothly driven by real-world speech and video. We hope that HeadStudio can enhance digital avatar creation and gain popularity in the community. Code is at: https:// github.com/ZhenglinZhou/HeadStudio.
引用
收藏
页码:145 / 163
页数:19
相关论文
共 50 条
  • [31] Impact of Data Capture Methods on 3D Reconstruction with Gaussian Splatting
    Rangelov, Dimitar
    Waanders, Sierd
    Waanders, Kars
    van Keulen, Maurice
    Miltchev, Radoslav
    JOURNAL OF IMAGING, 2025, 11 (02)
  • [32] Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration
    Liang, Zhihao
    Zhang, Qi
    Hu, Wenbo
    Zhu, Lei
    Feng, Ying
    Jia, Kui
    COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 281 - 297
  • [33] Superpixel-guided Sampling for Compact 3D Gaussian Splatting
    Kim, Myoung Gon
    Jeong, SeungWon
    Park, Seohyeon
    Han, JungHyun
    30TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, VRST 2024, 2024,
  • [34] GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces
    Jiang, Yingwenqi
    Tu, Jiadong
    Liu, Yuan
    Gao, Xifeng
    Long, Xiaoxiao
    Wang, Wenping
    Ma, Yuexin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 5322 - 5332
  • [35] A review of recent advances in 3D Gaussian Splatting for optimization and reconstruction
    Luo, Jie
    Huang, Tianlun
    Wang, Weijun
    Feng, Wei
    IMAGE AND VISION COMPUTING, 2024, 151
  • [36] Gaussian Splatting: 3D Reconstruction and Novel View Synthesis: A Review
    Dalal, Anurag
    Hagen, Daniel
    Robbersmyr, Kjell G.
    Knausgard, Kristian Muri
    IEEE ACCESS, 2024, 12 : 96797 - 96820
  • [37] 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting
    Lyu, Xiaoyang
    Sun, Yang-Tian
    Huang, Yi-Hua
    Wu, Xiuzhe
    Yang, Ziyi
    Chen, Yilun
    Pang, Jiangmiao
    Qi, Xiaojuan
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (06):
  • [38] SynGauss: Real-Time 3D Gaussian Splatting for Audio-Driven Talking Head Synthesis
    Zhou, Zhanyi
    Feng, Quandong
    Li, Hongjun
    IEEE ACCESS, 2025, 13 : 42167 - 42177
  • [39] Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis
    Niedermayr, Simon
    Stumpfegger, Josef
    Westermann, Ruediger
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10349 - 10358
  • [40] DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction
    Ham, Yujin
    Michalkiewicz, Mateusz
    Balakrishnan, Guha
    2024 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL PHOTOGRAPHY, ICCP 2024, 2024,