Efficient 3D View Synthesis from Single-Image Utilizing Diffusion Priors

被引:0
|
作者
Wen, Yifan [1 ]
Wang, Zitong [1 ]
Li, Zhuoyuan [1 ]
Wei, Dongxing [1 ]
Sun, Yi [1 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian, Liaoning, Peoples R China
来源
关键词
Novel View Synthesis; Single-view; Diffusion Prior; NEURAL RADIANCE FIELDS; SCENES;
D O I
10.1007/978-981-97-4399-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a novel framework for synthesizing novel views of objects from a single image. Leveraging the capabilities of fine-tuned diffusion models, our method combines latent 3D knowledge as priors to reconstruct 3D scenes. This facilitates the generation of high-fidelity 3D content from a solitary 2D viewpoint. We employ a two-stage process, beginning with fine-tuning a diffusion model on a given image viewpoint, followed by optimizing a neural radiance field using score distillation sampling (SDS). Our technique not only ensures fidelity to the original image but also enhances the perceptual understanding of the object in three dimensions. This method is effective for a wide range of objects, irrespective of the need for training on multiple views, and is applicable to both real-world and synthetic datasets. The resultant 3D reconstructions exhibit detailed geometry and realistic textures, closely matching the input images.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 50 条
  • [31] StoDIP: Efficient 3D MRF Image Reconstruction with Deep Image Priors and Stochastic Iterations
    Mayo, Perla
    Cencini, Matteo
    Pirkl, Carolin M.
    Menzel, Marion, I
    Tosetti, Michela
    Menze, Bjoern H.
    Golbabaee, Mohammad
    MACHINE LEARNING IN MEDICAL IMAGING, PT II, MLMI 2024, 2025, 15242 : 128 - 137
  • [32] Bridging Implicit and Explicit Geometric Transformation for Single-Image View Synthesis
    Park, Byeongjun
    Go, Hyojun
    Kim, Changick
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6326 - 6340
  • [33] Single Image 3D Without a Single 3D Image
    Fouhey, David F.
    Hussain, Wajahat
    Gupta, Abhinav
    Hebert, Martial
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1053 - 1061
  • [34] Recurrent Diffusion for 3D Point Cloud Generation From a Single Image
    Zhou, Yan
    Ye, Dewang
    Zhang, Huaidong
    Xu, Xuemiao
    Sun, Huajie
    Xu, Yewen
    Liu, Xiangyu
    Zhou, Yuexia
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1753 - 1765
  • [35] Utilizing the Neural Renderer for Accurate 3D Face Reconstruction from a Single Image
    Wei, Wei
    Zhang, Danni
    Wang, Huichen
    Duan, Xiaodong
    Guo, Chen
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 10535 - 10553
  • [36] Utilizing the Neural Renderer for Accurate 3D Face Reconstruction from a Single Image
    Wei Wei
    Danni Zhang
    Huichen Wang
    Xiaodong Duan
    Chen Guo
    Neural Processing Letters, 2023, 55 : 10535 - 10553
  • [37] Fine-grained 3D Face Reconstruction from a Single Image using Illumination Priors
    Qiu, Weibin
    Yu, Yao
    Zhou, Yu
    Du, Sidan
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 876 - 883
  • [38] View Generalization for Single Image Textured 3D Models
    Bhattad, Anand
    Dundar, Aysegul
    Liu, Guilin
    Tao, Andrew
    Catanzaro, Bryan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6077 - 6086
  • [39] A Self-Supervised Bootstrap Method for Single-Image 3D Face Reconstruction
    Xing, Yifan
    Tewari, Rahul
    Mendonc, Paulo R. S.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1014 - 1023
  • [40] Automatic, effective, and efficient 3D face reconstruction from arbitrary view image
    Wang, CH
    Yan, SC
    Li, H
    Zhang, HJ
    Li, MJ
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004, 3332 : 553 - 560