Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?

被引:0
|
作者
Zhu, Hanxin [1 ]
He, Tianyu [2 ]
Li, Xin [1 ]
Li, Bingchen [1 ]
Chen, Zhibo [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res Asia, Redmond, WA USA
关键词
D O I
10.1109/CVPR52733.2024.01918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Radiance Field (NeRF) has achieved superior performance for novel view synthesis by modeling the scene with a Multi-Layer Perception (MLP) and a volume rendering procedure, however, when fewer known views are given (i.e., few-shot view synthesis), the model is prone to overfit the given views. To handle this issue, previous efforts have been made towards leveraging learned priors or introducing additional regularizations. In contrast, in this paper, we for the first time provide an orthogonal method from the perspective of network structure. Given the observation that trivially reducing the number of model parameters alleviates the overfitting issue, but at the cost of missing details, we propose the multi-input MLP (mi-MLP) that incorporates the inputs (i.e., location and viewing direction) of the vanilla MLP into each layer to prevent the overfitting issue without harming detailed synthesis. To further reduce the artifacts, we propose to model colors and volume density separately and present two regularization terms. Extensive experiments on multiple datasets demonstrate that: 1) although the proposed mi-MLP is easy to implement, it is surprisingly effective as it boosts the PSNR of the baseline from 14.73 to 24.23. 2) the overall framework achieves state-of- the-art results on a wide range of benchmarks.
引用
收藏
页码:20288 / 20298
页数:11
相关论文
共 50 条
  • [21] Neural Snowball for Few-Shot Relation Learning
    Gao, Tianyu
    Han, Xu
    Xie, Ruobing
    Liu, Zhiyuan
    Lin, Fen
    Lin, Leyu
    Sun, Maosong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7772 - 7779
  • [22] Few-Shot Remote Sensing Novel View Synthesis with Geometry Constraint NeRF
    Jiaming Kang
    Keyan Chen
    Zhengxia Zou
    Zhenwei Shi
    Guidance,Navigation and Control, 2024, (03) : 70 - 91
  • [23] FewarNet: An Efficient Few-Shot View Synthesis Network Based on Trend Regularization
    Song, Chenxi
    Wang, Shigang
    Wei, Jian
    Zhao, Yan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9264 - 9280
  • [24] CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency
    Zhu, Hanxin
    Chen, Zhibo
    2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES, VR 2024, 2024, : 960 - 968
  • [25] Convolutional Siamese neural network for few-shot multi-view face identification
    Meddad, Majdouline
    Moujahdi, Chouaib
    Mikram, Mounia
    Rziza, Mohammed
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 3135 - 3144
  • [26] Convolutional Siamese neural network for few-shot multi-view face identification
    Majdouline Meddad
    Chouaib Moujahdi
    Mounia Mikram
    Mohammed Rziza
    Signal, Image and Video Processing, 2023, 17 : 3135 - 3144
  • [27] Rethinking the Correlation in Few-Shot Segmentation: A Buoys View
    Wang, Yuan
    Sun, Rui
    Zhang, Tianzhu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7183 - 7192
  • [28] Few-Shot Partial Multi-View Learning
    Zhou Y.
    Guo Y.
    Hao S.
    Hong R.
    Luo J.
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45 (10) : 11824 - 11841
  • [29] FSGS: Real-Time Few-Shot View Synthesis Using Gaussian Splatting
    Zhu, Zehao
    Fan, Zhiwen
    Jiang, Yifan
    Wan, Zhangyang
    COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 145 - 163
  • [30] Few-shot Video-to-Video Synthesis
    Wang, Ting-Chun
    Liu, Ming-Yu
    Tao, Andrew
    Liu, Guilin
    Kautz, Jan
    Catanzaro, Bryan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32