Improving Dynamic 3D Gaussian Splatting from Monocular Videos with Object Motion Information

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Huang, Xudong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
基金
国家重点研发计划;
关键词
3D Gaussian Splatting; Dynamic Scene Reconstruction; Motion Segmentation;
D O I
10.1007/978-981-97-5612-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the significant advancements achieved by recent 3D-Gaussian-based approaches in dynamic scene reconstruction, their efficacy is markedly diminished in monocular settings, particularly under conditions of rapid object motion. This issue arises from the inherent one-to-many mapping between monocular video and the dynamic scene, i.e., discerning precise object motion states from a monocular video is challenging while varying motion states may correspond to distinct scenes. To alleviate the issue, firstly, we explicitly extract the object motion states information from the monocular video wth a pretrained video tracking model, TAM, and then separate 3D Gaussians into static and dynamic subsets based on such motion states information. Secondly, we present a three-stage training strategy to optimize 3D Gaussian across distinct motion states. Moreover, we introduce an innovative augmentation technique that provides augment views for supervising 3D Gaussians, thereby enriching the model with more multi-view information, pivotal for accurate interpretation of motion states. Our empirical evaluations on Nvidia and iPhone, two of the most challenging monocular datasets, demonstrates our method's superior reconstruction capabilities over other dynamic Gaussian models.
引用
下载
收藏
页码:84 / 95
页数:12
相关论文
共 50 条
  • [21] Delving into Motion-Aware Matching for Monocular 3D Object Tracking
    Huang, Kuan-Chih
    Yang, Ming-Hsuan
    Tsai, Yi-Hsuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6886 - 6895
  • [22] Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
    Li, Zongmian
    Sedlar, Jiri
    Carpentier, Justin
    Laptev, Ivan
    Mansard, Nicolas
    Sivic, Josef
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (02) : 363 - 383
  • [23] Dynamic Gaussian Splatting from Markerless Motion Capture Reconstruct Infants Movements
    Cotton, R. James
    Peyton, Colleen
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 60 - 68
  • [24] 3D Visual Object Detection from Monocular Images
    Wang, Qiaosong
    Rasmussen, Christopher
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 168 - 180
  • [25] Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting
    Nguyen, Van Minh
    Sandidge, Emma
    Mahendrakar, Trupti
    White, Ryan T.
    AEROSPACE, 2024, 11 (03)
  • [26] GauLoc: 3D Gaussian Splatting-based Camera Relocalization
    Xin, Zhe
    Dai, Chengkai
    Li, Ying
    Wu, Chenming
    Computer Graphics Forum, 2024, 43 (07)
  • [27] A review of recent advances in 3D Gaussian Splatting for optimization and reconstruction
    Luo, Jie
    Huang, Tianlun
    Wang, Weijun
    Feng, Wei
    Image and Vision Computing, 2024, 151
  • [28] Gaussian Splatting: 3D Reconstruction and Novel View Synthesis: A Review
    Dalal, Anurag
    Hagen, Daniel
    Robbersmyr, Kjell G.
    Knausgard, Kristian Muri
    IEEE ACCESS, 2024, 12 : 96797 - 96820
  • [29] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [30] Depth dynamic center difference convolutions for monocular 3D object detection
    Wu, Xinyu
    Ma, Dongliang
    Qu, Xin
    Jiang, Xin
    Zeng, Dan
    NEUROCOMPUTING, 2023, 520 : 73 - 81