Efficient 3D View Synthesis from Single-Image Utilizing Diffusion Priors

被引：0

作者：

Wen, Yifan ^{[1
]}

Wang, Zitong ^{[1
]}

Li, Zhuoyuan ^{[1
]}

Wei, Dongxing ^{[1
]}

Sun, Yi ^{[1
]}

机构：

[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian, Liaoning, Peoples R China

来源：

ADVANCES IN NEURAL NETWORKS-ISNN 2024 | 2024年 / 14827卷

关键词：

Novel View Synthesis; Single-view; Diffusion Prior; NEURAL RADIANCE FIELDS; SCENES;

D O I：

10.1007/978-981-97-4399-5_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we introduce a novel framework for synthesizing novel views of objects from a single image. Leveraging the capabilities of fine-tuned diffusion models, our method combines latent 3D knowledge as priors to reconstruct 3D scenes. This facilitates the generation of high-fidelity 3D content from a solitary 2D viewpoint. We employ a two-stage process, beginning with fine-tuning a diffusion model on a given image viewpoint, followed by optimizing a neural radiance field using score distillation sampling (SDS). Our technique not only ensures fidelity to the original image but also enhances the perceptual understanding of the object in three dimensions. This method is effective for a wide range of objects, irrespective of the need for training on multiple views, and is applicable to both real-world and synthetic datasets. The resultant 3D reconstructions exhibit detailed geometry and realistic textures, closely matching the input images.

引用

页码：93 / 102

页数：10

共 50 条

[31] StoDIP: Efficient 3D MRF Image Reconstruction with Deep Image Priors and Stochastic Iterations
Mayo, Perla
Cencini, Matteo
Pirkl, Carolin M.
Menzel, Marion, I
Tosetti, Michela
Menze, Bjoern H.
Golbabaee, Mohammad
MACHINE LEARNING IN MEDICAL IMAGING, PT II, MLMI 2024, 2025, 15242 : 128 - 137
[32] Bridging Implicit and Explicit Geometric Transformation for Single-Image View Synthesis
Park, Byeongjun
Go, Hyojun
Kim, Changick
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6326 - 6340
[33] Single Image 3D Without a Single 3D Image
Fouhey, David F.
Hussain, Wajahat
Gupta, Abhinav
Hebert, Martial
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1053 - 1061
[34] Recurrent Diffusion for 3D Point Cloud Generation From a Single Image
Zhou, Yan
Ye, Dewang
Zhang, Huaidong
Xu, Xuemiao
Sun, Huajie
Xu, Yewen
Liu, Xiangyu
Zhou, Yuexia
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1753 - 1765
[35] Utilizing the Neural Renderer for Accurate 3D Face Reconstruction from a Single Image
Wei, Wei
Zhang, Danni
Wang, Huichen
Duan, Xiaodong
Guo, Chen
NEURAL PROCESSING LETTERS, 2023, 55 (08) : 10535 - 10553
[36] Utilizing the Neural Renderer for Accurate 3D Face Reconstruction from a Single Image
Wei Wei
Danni Zhang
Huichen Wang
Xiaodong Duan
Chen Guo
Neural Processing Letters, 2023, 55 : 10535 - 10553
[37] Fine-grained 3D Face Reconstruction from a Single Image using Illumination Priors
Qiu, Weibin
Yu, Yao
Zhou, Yu
Du, Sidan
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 876 - 883
[38] View Generalization for Single Image Textured 3D Models
Bhattad, Anand
Dundar, Aysegul
Liu, Guilin
Tao, Andrew
Catanzaro, Bryan
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6077 - 6086
[39] A Self-Supervised Bootstrap Method for Single-Image 3D Face Reconstruction
Xing, Yifan
Tewari, Rahul
Mendonc, Paulo R. S.
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1014 - 1023
[40] Automatic, effective, and efficient 3D face reconstruction from arbitrary view image
Wang, CH
Yan, SC
Li, H
Zhang, HJ
Li, MJ
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004, 3332 : 553 - 560

← 1 2 3 4 5 →