Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation

被引:81
|
作者
Chen, Rui [1 ]
Chen, Yongwei [1 ]
Jiao, Ningxin [1 ]
Jia, Kui [1 ]
机构
[1] South China Univ Technol, Guangzhou, Guangdong, Peoples R China
关键词
D O I
10.1109/ICCV51070.2023.02033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic 3D content creation has achieved rapid progress recently due to the availability of pre-trained, large language models and image diffusion models, forming the emerging topic of text-to-3D content creation. Existing text-to-3D methods commonly use implicit scene representations, which couple the geometry and appearance via volume rendering and are suboptimal in terms of recovering finer geometries and achieving photorealistic rendering; consequently, they are less effective for generating highquality 3D assets. In this work, we propose a new method of Fantasia3D for high- quality text-to-3D content creation. Key to Fantasia3D is the disentangled modeling and learning of geometry and appearance. For geometry learning, we rely on a hybrid scene representation, and propose to encode surface normal extracted from the representation as the input of the image diffusion model. For appearance modeling, we introduce the spatially varying bidirectional reflectance distribution function (BRDF) into the text-to-3D task, and learn the surface material for photorealistic rendering of the generated surface. Our disentangled framework is more compatible with popular graphics engines, supporting relighting, editing, and physical simulation of the generated 3D assets. We conduct thorough experiments that show the advantages of our method over existing ones under different text-to-3D task settings. Project page and source codes: https://fantasia3d.github.io/.
引用
收藏
页码:22189 / 22199
页数:11
相关论文
共 50 条
  • [1] Magic3D: High-Resolution Text-to-3D Content Creation
    Lin, Chen-Hsuan
    Gao, Jun
    Tang, Luming
    Takikawa, Towaki
    Zeng, Xiaohui
    Huang, Xun
    Kreis, Karsten
    Fidler, Sanja
    Liu, Ming-Yu
    Lin, Tsung-Yi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 300 - 309
  • [2] Transformation of Text-to-3D Graphics
    Kadir, Rabiah Abdul
    Ahmad, Azlina
    Marstawi, Ali
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1085 - 1089
  • [3] Text-to-3D Shape Generation
    Lee, H.
    Savva, M.
    Chang, A. X.
    COMPUTER GRAPHICS FORUM, 2024, 43 (02)
  • [4] Instant3D: Instant Text-to-3D Generation
    Li, Ming
    Zhou, Pan
    Liu, Jia-Wei
    Keppo, Jussi
    Lin, Min
    Yan, Shuicheng
    Xu, Xiangyu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4456 - 4472
  • [5] AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose
    Zhang, Huichao
    Chen, Bowen
    Yang, Hao
    Qu, Liao
    Wang, Xu
    Chen, Li
    Long, Chao
    Zhu, Feida
    Du, Daniel
    Zheng, Min
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7124 - 7132
  • [6] Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting
    Maier, Robert
    Kim, Kihwan
    Cremers, Daniel
    Kautz, Jan
    Niessner, Matthias
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3133 - 3141
  • [7] Correction: Instant3D: Instant Text-to-3D Generation
    Ming Li
    Pan Zhou
    Jia-Wei Liu
    Jussi Keppo
    Min Lin
    Shuicheng Yan
    Xiangyu Xu
    International Journal of Computer Vision, 2025, 133 (1) : 509 - 509
  • [8] ATT3D: Amortized Text-to-3D Object Synthesis
    Lorraine, Jonathan
    Xie, Kevin
    Zeng, Xiaohui
    Lin, Chen-Hsuan
    Takikawa, Towaki
    Sharp, Nicholas
    Lin, Tsung-Yi
    Liu, Ming-Yu
    Fidler, Sanja
    Lucas, James
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17900 - 17910
  • [9] Control3D: Towards Controllable Text-to-3D Generation
    Chen, Yang
    Pan, Yingwei
    Li, Yehao
    Yao, Ting
    Mei, Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1148 - 1156
  • [10] DreamFont3D: Personalized Text-to-3D Artistic Font Generation
    Li, Xiang
    Meng, Lei
    Wu, Lei
    Li, Manyi
    Meng, Xiangxu
    PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS, 2024,