Improving Dynamic 3D Gaussian Splatting from Monocular Videos with Object Motion Information

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Huang, Xudong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024 | 2024年 / 14872卷
基金
国家重点研发计划;
关键词
3D Gaussian Splatting; Dynamic Scene Reconstruction; Motion Segmentation;
D O I
10.1007/978-981-97-5612-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the significant advancements achieved by recent 3D-Gaussian-based approaches in dynamic scene reconstruction, their efficacy is markedly diminished in monocular settings, particularly under conditions of rapid object motion. This issue arises from the inherent one-to-many mapping between monocular video and the dynamic scene, i.e., discerning precise object motion states from a monocular video is challenging while varying motion states may correspond to distinct scenes. To alleviate the issue, firstly, we explicitly extract the object motion states information from the monocular video wth a pretrained video tracking model, TAM, and then separate 3D Gaussians into static and dynamic subsets based on such motion states information. Secondly, we present a three-stage training strategy to optimize 3D Gaussian across distinct motion states. Moreover, we introduce an innovative augmentation technique that provides augment views for supervising 3D Gaussians, thereby enriching the model with more multi-view information, pivotal for accurate interpretation of motion states. Our empirical evaluations on Nvidia and iPhone, two of the most challenging monocular datasets, demonstrates our method's superior reconstruction capabilities over other dynamic Gaussian models.
引用
收藏
页码:84 / 95
页数:12
相关论文
共 50 条
  • [1] GauHuman: Articulated Gaussian Splatting from Monocular Human Videos
    Hu, Shoukang
    Hu, Tao
    Liu, Ziwei
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 20418 - 20431
  • [2] GaussianAvatar: Human avatar Gaussian splatting from monocular videos☆
    Lin, Haian
    Zhan, Yinwei
    COMPUTERS & GRAPHICS-UK, 2025, 126
  • [3] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [4] ThermalGS: Dynamic 3D Thermal Reconstruction with Gaussian Splatting
    Liu, Yuxiang
    Chen, Xi
    Yan, Shen
    Cui, Zeyu
    Xiao, Huaxin
    Liu, Yu
    Zhang, Maojun
    REMOTE SENSING, 2025, 17 (02)
  • [5] SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting
    Shaw, Richard
    Nazarczuk, Michal
    Song, Jifei
    Moreau, Arthur
    Catley-Chandar, Sibi
    Dhamo, Helisa
    Perez-Pellitero, Eduardo
    COMPUTER VISION - ECCV 2024, PT LV, 2025, 15113 : 37 - 54
  • [6] Object-centric Reconstruction and Tracking of Dynamic Unknown Objects Using 3D Gaussian Splatting
    Barad, Kuldeep R.
    Richard, Antoine
    Dentler, Jan
    Olivares-Mendez, Miguel
    Martinez, Carol
    2024 INTERNATIONAL CONFERENCE ON SPACE ROBOTICS, ISPARO, 2024, : 202 - 209
  • [7] Deblurring 3D Gaussian Splatting
    Lee, Byeonghyeon
    Lee, Howoong
    Sun, Xiangyu
    Alit, Usman
    Park, Eunbyung
    COMPUTER VISION - ECCV 2024, PT LVIII, 2025, 15116 : 127 - 143
  • [8] Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal
    Wang, Yuxin
    Wu, Qianyi
    Zhang, Guofeng
    Xu, Dan
    COMPUTER VISION - ECCV 2024, PT III, 2025, 15061 : 1 - 17
  • [9] Deblur-GS: 3D Gaussian Splatting from Camera Motion Blurred Images
    Chen, Wenbo
    Liu, Ligang
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2024, 7 (01)
  • [10] THGS: Lifelike Talking Human Avatar Synthesis From Monocular Video Via 3D Gaussian Splatting
    Chen, Chuang
    Yu, Lingyun
    Yang, Quanwei
    Zheng, Aihua
    Xie, Hongtao
    COMPUTER GRAPHICS FORUM, 2025,