Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?

被引:0
|
作者
Zhu, Hanxin [1 ]
He, Tianyu [2 ]
Li, Xin [1 ]
Li, Bingchen [1 ]
Chen, Zhibo [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res Asia, Redmond, WA USA
关键词
D O I
10.1109/CVPR52733.2024.01918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Radiance Field (NeRF) has achieved superior performance for novel view synthesis by modeling the scene with a Multi-Layer Perception (MLP) and a volume rendering procedure, however, when fewer known views are given (i.e., few-shot view synthesis), the model is prone to overfit the given views. To handle this issue, previous efforts have been made towards leveraging learned priors or introducing additional regularizations. In contrast, in this paper, we for the first time provide an orthogonal method from the perspective of network structure. Given the observation that trivially reducing the number of model parameters alleviates the overfitting issue, but at the cost of missing details, we propose the multi-input MLP (mi-MLP) that incorporates the inputs (i.e., location and viewing direction) of the vanilla MLP into each layer to prevent the overfitting issue without harming detailed synthesis. To further reduce the artifacts, we propose to model colors and volume density separately and present two regularization terms. Extensive experiments on multiple datasets demonstrate that: 1) although the proposed mi-MLP is easy to implement, it is surprisingly effective as it boosts the PSNR of the baseline from 14.73 to 24.23. 2) the overall framework achieves state-of- the-art results on a wide range of benchmarks.
引用
收藏
页码:20288 / 20298
页数:11
相关论文
共 50 条
  • [41] Category Decoupled Few-Shot Classification for Graph Neural Network
    Deng, Gelong
    Huang, Guoheng
    Chen, Ziyan
    Computer Engineering and Applications, 2024, 60 (02) : 129 - 136
  • [42] Local feature graph neural network for few-shot learning
    Weng P.
    Dong S.
    Ren L.
    Zou K.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (04) : 4343 - 4354
  • [43] Neural representational geometry underlies few-shot concept learning
    Sorscher, Ben
    Ganguli, Surya
    Sompolinsky, Haim
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (43)
  • [44] Few-shot palmprint recognition via graph neural networks
    Shao, Huikai
    Zhong, Dexing
    ELECTRONICS LETTERS, 2019, 55 (16) : 890 - 891
  • [45] MetaNODE: Prototype Optimization as a Neural ODE for Few-Shot Learning
    Zhang, Baoquan
    Li, Xutao
    Feng, Shanshan
    Ye, Yunming
    Ye, Rui
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9014 - 9021
  • [46] Few-Shot Audio Classification with Attentional Graph Neural Networks
    Zhang, Shilei
    Qin, Yong
    Sun, Kewei
    Lin, Yonghua
    INTERSPEECH 2019, 2019, : 3649 - 3653
  • [47] Graph Neural Networks With Triple Attention for Few-Shot Learning
    Cheng, Hao
    Zhou, Joey Tianyi
    Tay, Wee Peng
    Wen, Bihan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8225 - 8239
  • [48] Multi-view Interaction Learning for Few-Shot Relation Classification
    Han, Yi
    Qiao, Linbo
    Zheng, Jianming
    Kan, Zhigang
    Gao, Yifu
    Feng, Linhui
    Tang, Yu
    Zhai, Qi
    Li, Dongsheng
    Liao, Xiangke
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 649 - 658
  • [49] FeatWalk: Enhancing Few-Shot Classification through Local View Leveraging
    Chen, Dalong
    Zhang, Jianjia
    Zheng, Wei-Shi
    Wang, Ruixuan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1019 - 1027
  • [50] Few-Shot NeRF-Based View Synthesis for Viewpoint-Biased Camera Pose Estimation
    Ito, Sota
    Aizawa, Hiroaki
    Kato, Kunihito
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 308 - 319