Improving Dynamic 3D Gaussian Splatting from Monocular Videos with Object Motion Information

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Huang, Xudong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
基金
国家重点研发计划;
关键词
3D Gaussian Splatting; Dynamic Scene Reconstruction; Motion Segmentation;
D O I
10.1007/978-981-97-5612-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the significant advancements achieved by recent 3D-Gaussian-based approaches in dynamic scene reconstruction, their efficacy is markedly diminished in monocular settings, particularly under conditions of rapid object motion. This issue arises from the inherent one-to-many mapping between monocular video and the dynamic scene, i.e., discerning precise object motion states from a monocular video is challenging while varying motion states may correspond to distinct scenes. To alleviate the issue, firstly, we explicitly extract the object motion states information from the monocular video wth a pretrained video tracking model, TAM, and then separate 3D Gaussians into static and dynamic subsets based on such motion states information. Secondly, we present a three-stage training strategy to optimize 3D Gaussian across distinct motion states. Moreover, we introduce an innovative augmentation technique that provides augment views for supervising 3D Gaussians, thereby enriching the model with more multi-view information, pivotal for accurate interpretation of motion states. Our empirical evaluations on Nvidia and iPhone, two of the most challenging monocular datasets, demonstrates our method's superior reconstruction capabilities over other dynamic Gaussian models.
引用
下载
收藏
页码:84 / 95
页数:12
相关论文
共 50 条
  • [1] GaussianAvatar: Human avatar Gaussian splatting from monocular videos
    Lin, Haian
    Zhan, Yinwei
    Computers and Graphics, 2025, 126
  • [2] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [3] Deblur-GS: 3D Gaussian Splatting from Camera Motion Blurred Images
    Chen, Wenbo
    Liu, Ligang
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2024, 7 (01)
  • [4] 3D Gaussian Splatting with Deferred Reflection
    Ye, Keyang
    Hou, Qiming
    Zhou, Kun
    PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS, 2024,
  • [5] Joint 3D Human Motion Capture and Physical Analysis from Monocular Videos
    Zell, Petrissa
    Wandt, Bastian
    Rosenhahn, Bodo
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 17 - 26
  • [6] Recent advances in 3D Gaussian splatting
    Wu, Tong
    Yuan, Yu-Jie
    Zhang, Ling-Xiao
    Yang, Jie
    Cao, Yan-Pei
    Yan, Ling-Qi
    Gao, Lin
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (04) : 613 - 642
  • [7] GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting
    Yang, Chen
    Li, Sikuang
    Fang, Jiemin
    Liang, Ruofan
    Xie, Lingxi
    Zhang, Xiaopeng
    Shen, Wei
    Tian, Qi
    ACM Transactions on Graphics, 2024, 43 (06):
  • [8] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [9] REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos
    Qiu, Lingteng
    Chen, Guanying
    Zhou, Jiapeng
    Xu, Mutian
    Wang, Junle
    Han, Xiaoguang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4637 - 4646
  • [10] Toward Realistic 3D Avatar Generation with Dynamic 3D Gaussian Splatting for AR/VR Communication
    Song, Hail
    2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS, VRW 2024, 2024, : 1124 - 1125