Diffusion models for 3D generation: A survey

被引:0
|
作者
Wang, Chen [1 ]
Peng, Hao-Yang [2 ]
Liu, Ying-Tian [2 ]
Gu, Jiatao [3 ]
Hu, Shi-Min [2 ]
机构
[1] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[3] Apple, Machine Learning Res, ML, New York, NY USA
来源
COMPUTATIONAL VISUAL MEDIA | 2025年 / 11卷 / 01期
关键词
diffusion models; 3D generation; generative models; AIGC;
D O I
10.26599/CVM.2025.9450452
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Denoising diffusion models have demonstrated tremendous success in modeling data distributions and synthesizing high-quality samples. In the 2D image domain, they have become the state-of-the-art and are capable of generating photo-realistic images with high controllability. More recently, researchers have begun to explore how to utilize diffusion models to generate 3D data, as doing so has more potential in real-world applications. This requires careful design choices in two key ways: identifying a suitable 3D representation and determining how to apply the diffusion process. In this survey, we provide the first comprehensive review of diffusion models for manipulating 3D content, including 3D generation, reconstruction, and 3D-aware image synthesis. We classify existing methods into three major categories: 2D space diffusion with pretrained models, 2D space diffusion without pretrained models, and 3D space diffusion. We also summarize popular datasets used for 3D generation with diffusion models. Along with this survey, we maintain a repository https://github.com/cwchenwang/awesome-3d-diffusion to track the latest relevant papers and codebases. Finally, we pose current challenges for diffusion models for 3D generation, and suggest future research directions.
引用
收藏
页码:1 / 28
页数:28
相关论文
共 50 条
  • [41] Enhancing Diffusion Models with 3D Perspective Geometry Constraints
    Upadhyay, Rishi
    Zhang, Howard
    Ba, Yunhao
    Yang, Ethan
    Gella, Blake
    Jiang, Sicheng
    Wong, Alex
    Kadambi, Achuta
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (06):
  • [42] VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
    Han, Junlin
    Kokkinos, Filippos
    Torre, Philip
    COMPUTER VISION - ECCV 2024, PT II, 2025, 15060 : 333 - 350
  • [43] Survey and systematization of 3D object detection models and methods
    Moritz Drobnitzky
    Jonas Friederich
    Bernhard Egger
    Patrick Zschech
    The Visual Computer, 2024, 40 : 1867 - 1913
  • [44] A SURVEY OF CONTENT BASED SIMILARITY MEASURES FOR 3D MODELS
    Quan, Lulin
    Yang, Zhixin
    PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 2193 - 2202
  • [45] A Survey on Cage-based Deformation of 3D Models
    Stroeter, D.
    Thiery, J. M.
    Hormann, K.
    Chen, J.
    Chang, Q.
    Besler, S.
    Mueller-Roemer, J. S.
    Boubekeur, T.
    Stork, A.
    Fellner, D. W.
    COMPUTER GRAPHICS FORUM, 2024, 43 (02)
  • [46] A Survey of Stereoscopic 3D Just Noticeable Difference Models
    Fan, Yu
    Larabi, Mohamed-Chaker
    Cheikh, Faouzi Alaya
    Fernandez-Maloigne, Christine
    IEEE ACCESS, 2019, 7 : 8621 - 8645
  • [47] 3D Brain and Heart Volume Generative Models: A Survey
    Liu, Yanbin
    Dwivedi, Girish
    Boussaid, Farid
    Bennamoun, Mohammed
    ACM COMPUTING SURVEYS, 2024, 56 (06)
  • [48] Survey and systematization of 3D object detection models and methods
    Drobnitzky, Moritz
    Friederich, Jonas
    Egger, Bernhard
    Zschech, Patrick
    VISUAL COMPUTER, 2024, 40 (03): : 1867 - 1913
  • [49] DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
    Mo, Shentong
    Xie, Enze
    Chu, Ruihang
    Yao, Lewei
    Hong, Lanqing
    Niessner, Matthias
    Li, Zhenguo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [50] Pseudo 3D image generation with simple depth models
    Yamada, K
    Suehiro, K
    Nakamura, H
    ICCE: 2005 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2005, : 277 - 278