3D Neural Field Generation using Triplane Diffusion

被引:45
|
作者
Shue, J. Ryan [1 ]
Chan, Eric Ryan [2 ]
Po, Ryan [2 ]
Ankner, Zachary [3 ,4 ]
Wu, Jiajun [2 ]
Wetzstein, Gordon [2 ]
机构
[1] Milton Acad, Milton, MA 02186 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] MIT, Cambridge, MA 02139 USA
[4] MosaicML, San Francisco, CA USA
关键词
D O I
10.1109/CVPR52729.2023.02000
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffusion models have emerged as the state-of-the-art for image generation, among other tasks. Here, we present an efficient diffusion-based model for 3D-aware generation of neural fields. Our approach pre-processes training data, such as ShapeNet meshes, by converting them to continuous occupancy fields and factoring them into a set of axis-aligned triplane feature representations. Thus, our 3D training scenes are all represented by 2D feature planes, and we can directly train existing 2D diffusion models on these representations to generate 3D neural fields with high quality and diversity, outperforming alternative approaches to 3D-aware generation. Our approach requires essential modifications to existing triplane factorization pipelines to make the resulting features easy to learn for the diffusion model. We demonstrate state-of-the-art results on 3D generation on several object classes from ShapeNet.
引用
收藏
页码:20875 / 20886
页数:12
相关论文
共 50 条
  • [21] PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation
    Chen, Zhaoxi
    Hong, Fangzhou
    Mei, Haiyi
    Wang, Guangcong
    Yang, Lei
    Liu, Ziwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] MDM: Molecular Diffusion Model for 3D Molecule Generation
    Huang, Lei
    Zhang, Hengtong
    Xu, Tingyang
    Wong, Ka-Chun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5105 - 5112
  • [23] Pyramid Diffusion for Fine 3D Large Scene Generation
    Liu, Yuheng
    Li, Xinke
    Li, Xueting
    Qi, Lu
    Li, Chongshou
    Yang, Ming-Hsuan
    COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 71 - 87
  • [24] StructLDM: Structured Latent Diffusion for 3D Human Generation
    Hu, Tao
    Hong, Fangzhou
    Liu, Ziwei
    COMPUTER VISION - ECCV 2024, PT LI, 2025, 15109 : 363 - 381
  • [25] Generation of 3D Building Model Using 3D Line Segments
    Woo, Dong-Min
    ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 452 - +
  • [26] Diffusion Probabilistic Models for 3D Point Cloud Generation
    Luo, Shitong
    Hu, Wei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2836 - 2844
  • [27] 3D Contour Generation based on Diffusion Probabilistic Models
    Wu, Yiqi
    Huang, Xuan
    Song, Kelin
    He, Fazhi
    Zhang, Dejun
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1992 - 1997
  • [28] TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation
    Kalischek, Nikolai
    Peters, Torben
    Wegner, Jan D.
    Schindler, Konrad
    COMPUTER VISION - ECCV 2024, PT LIII, 2025, 15111 : 357 - 373
  • [29] GAUDI: A Neural Architect for Immersive 3D Scene Generation
    Bautista, Miguel Angel
    Guo, Pengsheng
    Abnar, Samira
    Talbott, Walter
    Toshev, Alexander
    Chen, Zhuoyuan
    Dinh, Laurent
    Zhai, Shuangfei
    Goh, Hanlin
    Ulbricht, Daniel
    Dehghan, Afshin
    Susskind, Josh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] 3D numerical simulation of black hole formation using collisionless particles - Triplane symmetric case
    Shibata, M
    PROGRESS OF THEORETICAL PHYSICS, 1999, 101 (02): : 251 - 282