X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360. Insufficient RGB-D Views

被引:2
|
作者
Zhu, Haoyi [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
D O I
10.1109/WACV56688.2023.00572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Radiance Fields (NeRFs), despite their outstanding performance on novel view synthesis, often need dense input views. Many papers train one model for each scene respectively and few of them explore incorporating multimodal data into this problem. In this paper, we focus on a rarely discussed but important setting: can we train one model that can represent multiple scenes, with 360. insufficient views and RGB-D images? We refer insufficient views to few extremely sparse and almost non-overlapping views. To deal with it, X-NeRF, a fully explicit approach which learns a general scene completion process instead of a coordinate-based mapping, is proposed. Given a few insufficient RGB-D input views, X-NeRF first transforms them to a sparse point cloud tensor and then applies a 3D sparse generative Convolutional Neural Network (CNN) to complete it to an explicit radiance field whose volumetric rendering can be conducted fast without running networks during inference. To avoid overfitting, besides common rendering loss, we apply perceptual loss as well as view augmentation through random rotation on point clouds. The proposed methodology significantly out-performs previous implicit methods in our setting, indicating the great potential of proposed problem and approach. Codes and data are available at https://github.com/HaoyiZhu/XNeRF.
引用
收藏
页码:5755 / 5764
页数:10
相关论文
共 3 条
  • [1] RGB-D Object Discovery via Multi-Scene Analysis
    Herbst, Evan
    Ren, Xiaofeng
    Fox, Dieter
    2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011,
  • [2] NeRF-OR: neural radiance fields for operating room scene reconstruction from sparse-view RGB-D videos
    Gerats, Beerend G. A.
    Wolterink, Jelmer M.
    Broeders, Ivo A. M. J.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2025, 20 (01) : 147 - 156
  • [3] 3D Multi-scene Stylization Based on Conditional Neural Radiance Fields
    Zhang, Sijia
    Liu, Ting
    Li, Zhuoyuan
    Sun, Yi
    ADVANCES IN NEURAL NETWORKS-ISNN 2024, 2024, 14827 : 103 - 112