HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

被引:0
|
作者
Zhou, Zhenglin [1 ,2 ]
Ma, Fan [2 ]
Fan, Hehe [2 ]
Yang, Zongxin [2 ]
Yang, Yi [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Brain Machine Intelligence, Hangzhou, Peoples R China
[2] Zhejiang Univ, ReLER, CCAI, Hangzhou, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Head avatar animation; Text-guided generation; 3D Gaussian splatting;
D O I
10.1007/978-3-031-73411-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating digital avatars from textual prompts has long been a desirable yet challenging task. Despite the promising results achieved with 2D diffusion priors, current methods struggle to create high-quality and consistent animated avatars efficiently. Previous animatable head models like FLAME have difficulty in accurately representing detailed texture and geometry. Additionally, high-quality 3D static representations face challenges in semantically driving with dynamic priors. In this paper, we introduce HeadStudio, a novel framework that utilizes 3D Gaussian splatting to generate realistic and animatable avatars from text prompts. Firstly, we associate 3D Gaussians with animatable head prior model, facilitating semantic animation on high-quality 3D representations. To ensure consistent animation, we further enhance the optimization from initialization, distillation, and regularization to jointly learn the shape, texture, and animation. Extensive experiments demonstrate the efficacy of HeadStudio in generating animatable avatars from textual prompts, exhibiting appealing appearances. The avatars are capable of rendering high-quality real-time (>= 40 fps) novel views at a resolution of 1024. Moreover, These avatars can be smoothly driven by real-world speech and video. We hope that HeadStudio can enhance digital avatar creation and gain popularity in the community. Code is at: https:// github.com/ZhenglinZhou/HeadStudio.
引用
收藏
页码:145 / 163
页数:19
相关论文
共 50 条
  • [21] Generalizable 3D Gaussian Splatting for novel view synthesis
    Zhao, Chuyue
    Huang, Xin
    Yang, Kun
    Wang, Xue
    Wang, Qing
    PATTERN RECOGNITION, 2025, 161
  • [22] View-Consistent 3D Editing with Gaussian Splatting
    Wang, Yuxuan
    Yi, Xuanyu
    Wu, Zike
    Zhao, Na
    Chen, Long
    Zhang, Hanwang
    COMPUTER VISION-ECCV 2024, PT XXXV, 2025, 15093 : 404 - 420
  • [23] Characterization and Analysis of the 3D Gaussian Splatting Rendering Pipeline
    Lee, Jiwon
    Lee, Yunjae
    Kwon, Youngeun
    Rhu, Minsoo
    IEEE COMPUTER ARCHITECTURE LETTERS, 2025, 24 (01) : 13 - 16
  • [24] ThermalGS: Dynamic 3D Thermal Reconstruction with Gaussian Splatting
    Liu, Yuxiang
    Chen, Xi
    Yan, Shen
    Cui, Zeyu
    Xiao, Huaxin
    Liu, Yu
    Zhang, Maojun
    REMOTE SENSING, 2025, 17 (02)
  • [25] FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization
    Zhang, Jiahui
    Zhan, Fangnen
    Xu, Muyu
    Lu, Shijian
    Xing, Eric
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21424 - 21433
  • [26] SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting
    Shaw, Richard
    Nazarczuk, Michal
    Song, Jifei
    Moreau, Arthur
    Catley-Chandar, Sibi
    Dhamo, Helisa
    Perez-Pellitero, Eduardo
    COMPUTER VISION - ECCV 2024, PT LV, 2025, 15113 : 37 - 54
  • [27] Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting
    Nguyen, Van Minh
    Sandidge, Emma
    Mahendrakar, Trupti
    White, Ryan T.
    AEROSPACE, 2024, 11 (03)
  • [28] GS-IR: 3D Gaussian Splatting for Inverse Rendering
    Liang, Zhihao
    Zhang, Qi
    Feng, Ying
    Shan, Ying
    Jia, Kui
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21644 - 21653
  • [29] On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy
    Huang, Letian
    Bai, Jiayang
    Guo, Jie
    Li, Yuanqi
    Guo, Yanwen
    COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 247 - 263
  • [30] GauLoc: 3D Gaussian Splatting-based Camera Relocalization
    Xin, Zhe
    Dai, Chengkai
    Li, Ying
    Wu, Chenming
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)