GaussianAvatar: Human avatar Gaussian splatting from monocular videos☆

被引：1

作者：

Lin, Haian

Zhan, Yinwei ^{[1
]}

机构：

[1] Guangdong Univ Technol, Guangzhou Univ Town,West Rd, Guangzhou 510006, Peoples R China

来源：

COMPUTERS & GRAPHICS-UK | 2025年 / 126卷

关键词：

Neural radiance field; 3D Gaussian; Human reconstruction;

D O I：

10.1016/j.cag.2024.104155

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Many application fields including virtual reality and movie production demand reconstructing high-quality digital human avatars from monocular videos and real-time rendering. However, existing neural radiance field (NeRF)-based methods are costly to train and render. In this paper, we propose GaussianAvatar, a novel framework that extends 3D Gaussian to dynamic human scenes, enabling fast training and real-time rendering. The human 3D Gaussian in canonical space is initialized and transformed to posed space using Linear Blend Skinning (LBS), based on pose parameters, to learn the fine details of the human body at a very small computational cost. We design a pose parameter refinement module and a LBS weight optimization module to increase the accuracy of the pose parameter detection in the real dataset and introduce multi-resolution hash coding to accelerate the training speed. Experimental results demonstrate that our method outperforms existing methods in terms of training time, rendering speed, and reconstruction quality.

引用

页数：9

共 50 条

[1] GauHuman: Articulated Gaussian Splatting from Monocular Human Videos
Hu, Shoukang
Hu, Tao
Liu, Ziwei
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 20418 - 20431
[2] THGS: Lifelike Talking Human Avatar Synthesis From Monocular Video Via 3D Gaussian Splatting
Chen, Chuang
Yu, Lingyun
Yang, Quanwei
Zheng, Aihua
Xie, Hongtao
COMPUTER GRAPHICS FORUM, 2025,
[3] Improving Dynamic 3D Gaussian Splatting from Monocular Videos with Object Motion Information
Luo, Yixin
Huang, Zhangjin
Huang, Xudong
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 84 - 95
[4] Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos
Jiang, Yuheng
Shen, Zhehao
Hong, Yu
Guo, Chengcheng
Wu, Yize
Zhang, Yingliang
Yu, Jingyi
Xu, Lan
ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (06):
[5] DLCA-Recon: Dynamic Loose Clothing Avatar Reconstruction from Monocular Videos
Luo, Chunjie
Luo, Fei
Wang, Yuseng
Zhao, Enxu
Xiao, Chunxia
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3963 - 3971
[6] Tracking human arm from monocular videos
Yue, HongQiang
Li, ChengRong
Liang, YiXiong
Luo, YangYu
2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 2155 - 2159
[7] MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar
Chen, Yufan
Wang, Lizhen
Li, Qijing
Xiao, Hongjiang
Zhang, Shengping
Yao, Hongxun
Liu, Yebin
PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS, 2024,
[8] High-quality three-dimensional cartoon avatar reconstruction with Gaussian splatting
Jang, Minhyuk
Kim, Jong Wook
Jang, Youngdong
Kim, Donghyun
Roh, Wonseok
Hwang, Inyong
Lin, Guang
Kim, Sangpil
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
[9] Real-time Gaussian Splatting for Dynamic Reconstruction in Stationary Monocular Cameras
Chen, Minyu
Xie, Weixing
Deng, Zhiyang
Shi, Chenglong
Gou, Ruoxuan
Chen, Yingxuan
Dong, Xiao
2024 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE, SEAI 2024, 2024, : 33 - 37
[10] RAM-Avatar: Real-time Photo-Realistic Avatar from Monocular Videos with Full-body Control
Deng, Xiang
Zheng, Zerong
Zhang, Yuxiang
Sun, Jingxiang
Xu, Chao
Yang, Xiaodong
Wang, Lizhen
Liu, Yebin
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 1996 - 2007

← 1 2 3 4 5 →