Improving Dynamic 3D Gaussian Splatting from Monocular Videos with Object Motion Information

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Huang, Xudong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
基金
国家重点研发计划;
关键词
3D Gaussian Splatting; Dynamic Scene Reconstruction; Motion Segmentation;
D O I
10.1007/978-981-97-5612-4_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the significant advancements achieved by recent 3D-Gaussian-based approaches in dynamic scene reconstruction, their efficacy is markedly diminished in monocular settings, particularly under conditions of rapid object motion. This issue arises from the inherent one-to-many mapping between monocular video and the dynamic scene, i.e., discerning precise object motion states from a monocular video is challenging while varying motion states may correspond to distinct scenes. To alleviate the issue, firstly, we explicitly extract the object motion states information from the monocular video wth a pretrained video tracking model, TAM, and then separate 3D Gaussians into static and dynamic subsets based on such motion states information. Secondly, we present a three-stage training strategy to optimize 3D Gaussian across distinct motion states. Moreover, we introduce an innovative augmentation technique that provides augment views for supervising 3D Gaussians, thereby enriching the model with more multi-view information, pivotal for accurate interpretation of motion states. Our empirical evaluations on Nvidia and iPhone, two of the most challenging monocular datasets, demonstrates our method's superior reconstruction capabilities over other dynamic Gaussian models.
引用
下载
收藏
页码:84 / 95
页数:12
相关论文
共 50 条
  • [41] MonoAux: Fully Exploiting Auxiliary Information and Uncertainty for Monocular 3D Object Detection
    Li, Zhenglin
    Zheng, Wenbo
    Yang, Le
    Ma, Liyan
    Zhou, Yang
    Peng, Yan
    CYBORG AND BIONIC SYSTEMS, 2024, 5
  • [42] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection
    Wang, Li
    Du, Liang
    Ye, Xiaoqing
    Fu, Yanwei
    Guo, Guodong
    Xue, Xiangyang
    Feng, Jianfeng
    Zhang, Li
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 454 - 463
  • [43] Unsupervised Learning of 3D Object Categories from Videos in the Wild
    Henzler, Philipp
    Reizenstein, Jeremy
    Labatut, Patrick
    Shapovalov, Roman
    Ritschel, Tobias
    Vedaldi, Andrea
    Novotny, David
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4698 - 4707
  • [44] Structure and motion of curved 3D objects from monocular silhouettes
    Vijayakumar, B
    Kriegman, DJ
    Ponce, J
    1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, : 327 - 334
  • [45] Towards Learning Monocular 3D Object Localization From 2D Labels Using the Physical Laws of Motion
    Kienzle, Daniel
    Ludwig, Katja
    Lorenz, Julian
    Lienhart, Rainer
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1564 - 1573
  • [46] MonoPoly: A practical monocular 3D object detector
    Guan, He
    Song, Chunfeng
    Zhang, Zhaoxiang
    Tan, Tieniu
    PATTERN RECOGNITION, 2022, 132
  • [47] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [48] 3D Human Motion Capture from Monocular Image Sequences
    Wandt, Bastian
    Ackermann, Hanno
    Rosenhahn, Bodo
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [49] 3D Reconstruction of Human Motion from Monocular Image Sequences
    Wandt, Bastian
    Ackermann, Hanno
    Rosenhahn, Bodo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) : 1505 - 1516
  • [50] Dimension Embeddings for Monocular 3D Object Detection
    Zhang, Yunpeng
    Zheng, Wenzhao
    Zhu, Zheng
    Huang, Guan
    Du, Dalong
    Zhou, Jie
    Lu, Jiwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1579 - 1588