X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360. Insufficient RGB-D Views

被引：2

作者：

Zhu, Haoyi ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00572

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural Radiance Fields (NeRFs), despite their outstanding performance on novel view synthesis, often need dense input views. Many papers train one model for each scene respectively and few of them explore incorporating multimodal data into this problem. In this paper, we focus on a rarely discussed but important setting: can we train one model that can represent multiple scenes, with 360. insufficient views and RGB-D images? We refer insufficient views to few extremely sparse and almost non-overlapping views. To deal with it, X-NeRF, a fully explicit approach which learns a general scene completion process instead of a coordinate-based mapping, is proposed. Given a few insufficient RGB-D input views, X-NeRF first transforms them to a sparse point cloud tensor and then applies a 3D sparse generative Convolutional Neural Network (CNN) to complete it to an explicit radiance field whose volumetric rendering can be conducted fast without running networks during inference. To avoid overfitting, besides common rendering loss, we apply perceptual loss as well as view augmentation through random rotation on point clouds. The proposed methodology significantly out-performs previous implicit methods in our setting, indicating the great potential of proposed problem and approach. Codes and data are available at https://github.com/HaoyiZhu/XNeRF.

引用

页码：5755 / 5764

页数：10

共 3 条

[1] RGB-D Object Discovery via Multi-Scene Analysis
Herbst, Evan
Ren, Xiaofeng
Fox, Dieter
2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011,
[2] NeRF-OR: neural radiance fields for operating room scene reconstruction from sparse-view RGB-D videos
Gerats, Beerend G. A.
Wolterink, Jelmer M.
Broeders, Ivo A. M. J.
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2025, 20 (01) : 147 - 156
[3] 3D Multi-scene Stylization Based on Conditional Neural Radiance Fields
Zhang, Sijia
Liu, Ting
Li, Zhuoyuan
Sun, Yi
ADVANCES IN NEURAL NETWORKS-ISNN 2024, 2024, 14827 : 103 - 112

← 1 →