Neural Sparse Voxel Fields

被引:0
|
作者
Liu, Lingjie [1 ]
Gu, Jiatao [2 ]
Lin, Kyaw Zaw [3 ]
Chua, Tat-Seng [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Facebook AI Res, Menlo Pk, CA USA
[3] Natl Univ Singapore, Singapore, Singapore
关键词
REPRESENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Photo-realistic free-viewpoint rendering of real-world scenes using classical computer graphics techniques is challenging, because it requires the difficult step of capturing detailed appearance and geometry models. Recent studies have demonstrated promising results by learning scene representations that implicitly encode both geometry and appearance without 3D supervision. However, existing approaches in practice often show blurry renderings caused by the limited network capacity or the difficulty in finding accurate intersections of camera rays with the scene geometry. Synthesizing high-resolution imagery from these representations often requires time-consuming optical ray marching. In this work, we introduce Neural Sparse Voxel Fields (NSVF), a new neural scene representation for fast and high-quality free-viewpoint rendering. NSVF defines a set of voxel-bounded implicit fields organized in a sparse voxel octree to model local properties in each cell. We progressively learn the underlying voxel structures with a diffentiable ray-marching operation from only a set of posed RGB images. With the sparse voxel octree structure, rendering novel views can be accelerated by skipping the voxels containing no relevant scene content. Our method is typically over 10 times faster than the state-of-the-art (namely, NeRF (Mildenhall et al., 2020)) at inference time while achieving higher quality results. Furthermore, by utilizing an explicit sparse voxel representation, our method can easily be applied to scene editing and scene composition. We also demonstrate several challenging tasks, including multi-scene learning, free-viewpoint rendering of a moving human, and large-scale scene rendering.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs
    Bao, Yanqi
    Li, Yuxin
    Huo, Jing
    Ding, Tianyu
    Liang, Xinyue
    Li, Wenbin
    Gao, Yang
    arXiv, 2023,
  • [22] Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs
    Bao, Yanqi
    Li, Yuxin
    Huo, Jing
    Ding, Tianyu
    Liang, Xinyue
    Li, Wenbin
    Gao, Yang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2180 - 2188
  • [23] ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields
    Somraj, Nagabhushan
    Soundararajan, Rajiv
    PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,
  • [24] RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs
    Niemeyer, Michael
    Barron, Jonathan T.
    Mildenhall, Ben
    Sajjadi, Mehdi S. M.
    Geiger, Andreas
    Radwan, Noha
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5470 - 5480
  • [25] SSVDAGs: Symmetry-aware Sparse Voxel DAGs
    Villanueva, Alberto Jaspe
    Marton, Fabio
    Gobbetti, Enrico
    PROCEEDINGS I3D 2016: 20TH ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES, 2016, : 7 - 14
  • [26] DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets
    Wang, Haiyang
    Shi, Chen
    Shi, Shaoshuai
    Lei, Meng
    Wang, Sen
    He, Di
    Schiele, Bernt
    Wang, Liwei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13520 - 13529
  • [27] Out-of-Core Construction of Sparse Voxel Octrees
    Baert, J.
    Lagae, A.
    Dutre, Ph.
    COMPUTER GRAPHICS FORUM, 2014, 33 (06) : 220 - 227
  • [28] CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
    Zhong, Yingji
    Hong, Lanqing
    Li, Zhenguo
    Xu, Dan
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21466 - 21475
  • [29] Voxel selection and neural decoding of fMRI data based on robust sparse programming with multi-dimensional derivative constraints
    Yu, Zhuliang
    Feng, Bao
    Gu, Zhenghui
    Xue, Zhenxia
    Li, Yuanqing
    Wang, Cong
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2015, 26 (01) : 225 - 241
  • [30] Voxel selection and neural decoding of fMRI data based on robust sparse programming with multi-dimensional derivative constraints
    Zhuliang Yu
    Bao Feng
    Zhenghui Gu
    Zhenxia Xue
    Yuanqing Li
    Cong Wang
    Multidimensional Systems and Signal Processing, 2015, 26 : 225 - 241