Improving Dynamic 3D Gaussian Splatting from Monocular Videos with Object Motion Information

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Huang, Xudong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024 | 2024年 / 14872卷
基金
国家重点研发计划;
关键词
3D Gaussian Splatting; Dynamic Scene Reconstruction; Motion Segmentation;
D O I
10.1007/978-981-97-5612-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the significant advancements achieved by recent 3D-Gaussian-based approaches in dynamic scene reconstruction, their efficacy is markedly diminished in monocular settings, particularly under conditions of rapid object motion. This issue arises from the inherent one-to-many mapping between monocular video and the dynamic scene, i.e., discerning precise object motion states from a monocular video is challenging while varying motion states may correspond to distinct scenes. To alleviate the issue, firstly, we explicitly extract the object motion states information from the monocular video wth a pretrained video tracking model, TAM, and then separate 3D Gaussians into static and dynamic subsets based on such motion states information. Secondly, we present a three-stage training strategy to optimize 3D Gaussian across distinct motion states. Moreover, we introduce an innovative augmentation technique that provides augment views for supervising 3D Gaussians, thereby enriching the model with more multi-view information, pivotal for accurate interpretation of motion states. Our empirical evaluations on Nvidia and iPhone, two of the most challenging monocular datasets, demonstrates our method's superior reconstruction capabilities over other dynamic Gaussian models.
引用
收藏
页码:84 / 95
页数:12
相关论文
共 50 条
  • [21] GaussReg: Fast 3D Registration with Gaussian Splatting
    Change, Jiahao
    Xu, Yinglin
    Li, Yihao
    Chen, Yuantao
    Feng, Wensen
    Han, Xiaoguang
    COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 407 - 423
  • [22] Toward Realistic 3D Avatar Generation with Dynamic 3D Gaussian Splatting for AR/VR Communication
    Song, Hail
    2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS, VRW 2024, 2024, : 1124 - 1125
  • [23] Robust 3D arm tracking from monocular videos
    Guo, F
    Qian, G
    ADVANCES IN INTELLIGENT COMPUTING, PT 2, PROCEEDINGS, 2005, 3645 : 841 - 850
  • [24] Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections
    Zhang, Dongbin
    Wang, Chuming
    Wang, Weitao
    Li, Peihao
    Qin, Minghan
    Wang, Haoqian
    COMPUTER VISION - ECCV 2024, PT LXXVI, 2025, 15134 : 341 - 359
  • [25] Privacy-Preserving 3D Gaussian Splatting
    Ali, Usman
    2024 IEEE GAMING, ENTERTAINMENT, AND MEDIA CONFERENCE, GEM 2024, 2024, : 385 - 387
  • [26] Reducing the Memory Footprint of 3D Gaussian Splatting
    Papantonakis, Panagiotis
    Kopanas, Georgios
    Kerbl, Bernhard
    Lanvin, Alexandre
    Drettakis, George
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2024, 7 (01)
  • [27] An Immersive 3D Navigation System Using 3D Gaussian Splatting
    Chen, Ming-Yi
    Chang, I-Cheng
    Chen, Jin-Wei
    Yang, Bing-Hua
    Wun, Cun-Fang
    2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 735 - 736
  • [28] Estimating 3D Motion and Forces of Human–Object Interactions from Internet Videos
    Zongmian Li
    Jiri Sedlar
    Justin Carpentier
    Ivan Laptev
    Nicolas Mansard
    Josef Sivic
    International Journal of Computer Vision, 2022, 130 : 363 - 383
  • [29] Real-time 3D human pose and motion reconstruction from monocular RGB videos
    Yiannakides, Anastasios
    Aristidou, Andreas
    Chrysanthou, Yiorgos
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2019, 30 (3-4)
  • [30] MonoDCN: Monocular 3D object detection based on dynamic convolution
    Qu, Shenming
    Yang, Xinyu
    Gao, Yiming
    Liang, Shengbin
    PLOS ONE, 2022, 17 (10):