GaussianAvatar: Human avatar Gaussian splatting from monocular videos☆

被引:1
|
作者
Lin, Haian
Zhan, Yinwei [1 ]
机构
[1] Guangdong Univ Technol, Guangzhou Univ Town,West Rd, Guangzhou 510006, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2025年 / 126卷
关键词
Neural radiance field; 3D Gaussian; Human reconstruction;
D O I
10.1016/j.cag.2024.104155
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many application fields including virtual reality and movie production demand reconstructing high-quality digital human avatars from monocular videos and real-time rendering. However, existing neural radiance field (NeRF)-based methods are costly to train and render. In this paper, we propose GaussianAvatar, a novel framework that extends 3D Gaussian to dynamic human scenes, enabling fast training and real-time rendering. The human 3D Gaussian in canonical space is initialized and transformed to posed space using Linear Blend Skinning (LBS), based on pose parameters, to learn the fine details of the human body at a very small computational cost. We design a pose parameter refinement module and a LBS weight optimization module to increase the accuracy of the pose parameter detection in the real dataset and introduce multi-resolution hash coding to accelerate the training speed. Experimental results demonstrate that our method outperforms existing methods in terms of training time, rendering speed, and reconstruction quality.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality
    Chen, Yu-Chih
    Saha, Avinab
    Chapiro, Alexandre
    Hane, Christian
    Bazin, Jean-Charles
    Qiu, Bo
    Zanetti, Stefano
    Katsavounidis, Ioannis
    Bovik, Alan C.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5740 - 5754
  • [32] Dynamic Gaussian Splatting from Markerless Motion Capture Reconstruct Infants Movements
    Cotton, R. James
    Peyton, Colleen
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 60 - 68
  • [33] I M Avatar: Implicit Morphable Head Avatars from Videos
    Zheng, Yufeng
    Abrevaya, Victoria Fernandez
    Buehler, Marcel C.
    Chen, Xu
    Black, Michael J.
    Hilliges, Otmar
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13535 - 13545
  • [34] Surgical Tool Pose Estimation from Monocular Endoscopic Videos
    Kumar, Suren
    Sovizi, Javad
    Narayanan, Madusudanan Sathia
    Krovi, Venkat
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 598 - 603
  • [35] Rendering Humans from Object-Occluded Monocular Videos
    Xiang, Tiange
    Sun, Adam
    Wu, Jiajun
    Adeli, Ehsan
    Fei-Fei, Li
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3216 - 3227
  • [36] Learning Depth from Monocular Videos using Direct Methods
    Wang, Chaoyang
    Miguel Buenaposada, Jose
    Zhu, Rui
    Lucey, Simon
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2022 - 2030
  • [37] NeuPhysics: Editable Neural Geometry and Physics from Monocular Videos
    Qiao, Yi-Ling
    Gao, Alexander
    Lin, Ming C.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [38] iHuman: Instant Animatable Digital Humans From Monocular Videos
    Paudel, Pramish
    Hanal, Anubhav K.
    Paudel, Danda Pani
    Tandukar, Jyoti
    Chhatkuli, Ajad
    COMPUTER VISION - ECCV 2024, PT LXXV, 2025, 15133 : 304 - 323
  • [39] Temporally Refined Graph U-Nets for Human Shape and Pose Estimation From Monocular Videos
    Zhao, Yang
    Dou, Yong
    Feng, Jiashi
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1949 - 1953
  • [40] Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian Splatting
    Ye, Sheng
    Dong, Zhen-Hui
    Hu, Yubin
    Wen, Yu-Hui
    Liu, Yong-Jin
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)