GaussianAvatar: Human avatar Gaussian splatting from monocular videos☆

被引:1
|
作者
Lin, Haian
Zhan, Yinwei [1 ]
机构
[1] Guangdong Univ Technol, Guangzhou Univ Town,West Rd, Guangzhou 510006, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2025年 / 126卷
关键词
Neural radiance field; 3D Gaussian; Human reconstruction;
D O I
10.1016/j.cag.2024.104155
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many application fields including virtual reality and movie production demand reconstructing high-quality digital human avatars from monocular videos and real-time rendering. However, existing neural radiance field (NeRF)-based methods are costly to train and render. In this paper, we propose GaussianAvatar, a novel framework that extends 3D Gaussian to dynamic human scenes, enabling fast training and real-time rendering. The human 3D Gaussian in canonical space is initialized and transformed to posed space using Linear Blend Skinning (LBS), based on pose parameters, to learn the fine details of the human body at a very small computational cost. We design a pose parameter refinement module and a LBS weight optimization module to increase the accuracy of the pose parameter detection in the real dataset and introduce multi-resolution hash coding to accelerate the training speed. Experimental results demonstrate that our method outperforms existing methods in terms of training time, rendering speed, and reconstruction quality.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] GauHuman: Articulated Gaussian Splatting from Monocular Human Videos
    Hu, Shoukang
    Hu, Tao
    Liu, Ziwei
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 20418 - 20431
  • [2] THGS: Lifelike Talking Human Avatar Synthesis From Monocular Video Via 3D Gaussian Splatting
    Chen, Chuang
    Yu, Lingyun
    Yang, Quanwei
    Zheng, Aihua
    Xie, Hongtao
    COMPUTER GRAPHICS FORUM, 2025,
  • [3] Improving Dynamic 3D Gaussian Splatting from Monocular Videos with Object Motion Information
    Luo, Yixin
    Huang, Zhangjin
    Huang, Xudong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 84 - 95
  • [4] Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos
    Jiang, Yuheng
    Shen, Zhehao
    Hong, Yu
    Guo, Chengcheng
    Wu, Yize
    Zhang, Yingliang
    Yu, Jingyi
    Xu, Lan
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (06):
  • [5] DLCA-Recon: Dynamic Loose Clothing Avatar Reconstruction from Monocular Videos
    Luo, Chunjie
    Luo, Fei
    Wang, Yuseng
    Zhao, Enxu
    Xiao, Chunxia
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3963 - 3971
  • [6] Tracking human arm from monocular videos
    Yue, HongQiang
    Li, ChengRong
    Liang, YiXiong
    Luo, YangYu
    2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 2155 - 2159
  • [7] MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar
    Chen, Yufan
    Wang, Lizhen
    Li, Qijing
    Xiao, Hongjiang
    Zhang, Shengping
    Yao, Hongxun
    Liu, Yebin
    PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS, 2024,
  • [8] High-quality three-dimensional cartoon avatar reconstruction with Gaussian splatting
    Jang, Minhyuk
    Kim, Jong Wook
    Jang, Youngdong
    Kim, Donghyun
    Roh, Wonseok
    Hwang, Inyong
    Lin, Guang
    Kim, Sangpil
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
  • [9] Real-time Gaussian Splatting for Dynamic Reconstruction in Stationary Monocular Cameras
    Chen, Minyu
    Xie, Weixing
    Deng, Zhiyang
    Shi, Chenglong
    Gou, Ruoxuan
    Chen, Yingxuan
    Dong, Xiao
    2024 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE, SEAI 2024, 2024, : 33 - 37
  • [10] RAM-Avatar: Real-time Photo-Realistic Avatar from Monocular Videos with Full-body Control
    Deng, Xiang
    Zheng, Zerong
    Zhang, Yuxiang
    Sun, Jingxiang
    Xu, Chao
    Yang, Xiaodong
    Wang, Lizhen
    Liu, Yebin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 1996 - 2007