3D Neural Field Generation using Triplane Diffusion

被引:45
|
作者
Shue, J. Ryan [1 ]
Chan, Eric Ryan [2 ]
Po, Ryan [2 ]
Ankner, Zachary [3 ,4 ]
Wu, Jiajun [2 ]
Wetzstein, Gordon [2 ]
机构
[1] Milton Acad, Milton, MA 02186 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] MIT, Cambridge, MA 02139 USA
[4] MosaicML, San Francisco, CA USA
关键词
D O I
10.1109/CVPR52729.2023.02000
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffusion models have emerged as the state-of-the-art for image generation, among other tasks. Here, we present an efficient diffusion-based model for 3D-aware generation of neural fields. Our approach pre-processes training data, such as ShapeNet meshes, by converting them to continuous occupancy fields and factoring them into a set of axis-aligned triplane feature representations. Thus, our 3D training scenes are all represented by 2D feature planes, and we can directly train existing 2D diffusion models on these representations to generate 3D neural fields with high quality and diversity, outperforming alternative approaches to 3D-aware generation. Our approach requires essential modifications to existing triplane factorization pipelines to make the resulting features easy to learn for the diffusion model. We demonstrate state-of-the-art results on 3D generation on several object classes from ShapeNet.
引用
收藏
页码:20875 / 20886
页数:12
相关论文
共 50 条
  • [1] Neural Wavelet-domain Diffusion for 3D Shape Generation
    Hui, Ka-Hei
    Li, Ruihui
    Hu, Jingyu
    Fu, Chi-Wing
    PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
  • [2] Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation
    Kim, Kihong
    Lee, Haneol
    Park, Jihye
    Kim, Seyeon
    Lee, Kwanghee
    Kim, Seungryong
    Yoo, Jaejun
    COMPUTER VISION - ECCV 2024, PT LII, 2025, 15110 : 148 - 165
  • [3] 3D Vessel Graph Generation Using Denoising Diffusion
    Prabhakar, Chinmay
    Shit, Suprosanna
    Musio, Fabio
    Yang, Kaiyuan
    Amiranashvili, Tamaz
    Paetzold, Johannes C.
    Li, Hongwei Bran
    Menze, Bjoern
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XI, 2024, 15011 : 3 - 13
  • [4] Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation
    Schroeppel, Philipp
    Wewer, Christopher
    Lenssen, Jan Eric
    Ilg, Eddy
    Brox, Thomas
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 8785 - 8794
  • [5] Compositional 3D Scene Generation using Locally Conditioned Diffusion
    Po, Ryan
    Wetzstein, Gordon
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 651 - 663
  • [6] Diffusion models for 3D generation: A survey
    Wang, Chen
    Peng, Hao-Yang
    Liu, Ying-Tian
    Gu, Jiatao
    Hu, Shi-Min
    COMPUTATIONAL VISUAL MEDIA, 2025, 11 (01): : 1 - 28
  • [7] Equivariant Diffusion for Molecule Generation in 3D
    Hoogeboom, Emiel
    Satorras, Victor Garcia
    Vignac, Clement
    Welling, Max
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [8] Neural Wavelet-domain Diffusion for 3D Shape Generation, Inversion, and Manipulation
    Hu, Jingyu
    Hui, Ka-Hei
    Liu, Zhengzhe
    Li, Ruihui
    Fu, Chi-Wing
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (02):
  • [9] LN3DIFF: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
    Lan, Yushi
    Hong, Fangzhou
    Yang, Shuai
    Zhou, Shangchen
    Meng, Xuyi
    Dai, Bo
    Pan, Xingang
    Loy, Chen Change
    COMPUTER VISION-ECCV 2024, PT IV, 2025, 15062 : 112 - 130
  • [10] Realistic Tumor generation using 3D conditional latent diffusion model
    Hu, Rui
    Yoon, Siyeop
    Wu, Dufan
    Tivnan, Matthew
    Chen, Zhennong
    Wang, Yuang
    Luo, Jie
    Cui, Jianan
    Li, Quanzheng
    Liu, Huafeng
    Guo, Ning
    JOURNAL OF NUCLEAR MEDICINE, 2024, 65