Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior

被引:1
|
作者
Wu, Yiqian [1 ]
Xu, Hao [1 ]
Tang, Xiangjun [1 ]
Chen, Xien [2 ]
Tang, Siyu [3 ]
Zhang, Zhebin [4 ]
Li, Chen [4 ]
Jin, Xiaogang [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Peoples R China
[2] Yale Univ, New Haven, CT USA
[3] Swiss Fed Inst Technol, Zurich, Switzerland
[4] OPPO US Res Ctr, Menlo Pk, CA USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2024年 / 43卷 / 04期
基金
中国国家自然科学基金;
关键词
3D portrait generation; 3D-aware GANs; diffusion models;
D O I
10.1145/3658162
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing neural rendering-based text-to-3D-portrait generation methods typically make use of human geometry prior and diffusion models to obtain guidance. However, relying solely on geometry information introduces issues such as the Janus problem, over-saturation, and over-smoothing. We present Portrait3D, a novel neural rendering-based framework with a novel joint geometry-appearance prior to achieve text-to-3D-portrait generation that overcomes the aforementioned issues. To accomplish this, we train a 3D portrait generator, 3DPortraitGAN(sic), as a robust prior. This generator is capable of producing 360 degrees. canonical 3D portraits, serving as a starting point for the subsequent diffusion-based generation process. To mitigate the "grid-like" artifact caused by the high-frequency information in the featuremap-based 3D representation commonly used by most 3D-aware GANs, we integrate a novel pyramid tri-grid 3D representation into 3DPortraitGAN(sic). To generate 3D portraits from text, we first project a randomly generated image aligned with the given prompt into the pre-trained 3DPortraitGAN(sic) 's latent space. The resulting latent code is then used to synthesize a pyramid tri-grid. Beginning with the obtained pyramid tri-grid, we use score distillation sampling to distill the diffusion model's knowledge into the pyramid tri-grid. Following that, we utilize the diffusion model to refine the rendered images of the 3D portrait and then use these refined images as training data to further optimize the pyramid tri-grid, effectively eliminating issues with unrealistic color and unnatural artifacts. Our experimental results show that Portrait3D can produce realistic, high-quality, and canonical 3D portraits that align with the prompt.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Optimized Color Models for High-Quality 3D Scanning
    Narayan, Karthik S.
    Abbeel, Pieter
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 2503 - 2510
  • [42] 3D High-quality Textile Reconstruction with Synthesized Texture
    Hu, Pengpeng
    Komura, Taku
    Li, Duan
    Wu, Ge
    Zhong, Yueqi
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 355 - 364
  • [43] Producing High-quality 3D Maps from Lidar
    Xiong, Biao
    GIM INTERNATIONAL-THE WORLDWIDE MAGAZINE FOR GEOMATICS, 2016, 30 (02): : 34 - 35
  • [44] High-Quality Surface-Based 3D Reconstruction Using 2.5D Maps
    Song, Lingxiao
    Yu, Xiao
    Di, Huijun
    Wang, Weiran
    2022 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2022), 2022, : 749 - 750
  • [45] Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
    Chen, Rui
    Chen, Yongwei
    Jiao, Ningxin
    Jia, Kui
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22189 - 22199
  • [46] Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration
    Yang, Xiaofeng
    Liu, Fayao
    Xu, Yi
    Su, Hanjing
    Wu, Qingyao
    Lin, Guosheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6549 - 6557
  • [47] Naive Mesh-to-Mesh Coloured Model Generation using 3D GANs
    Spick, Ryan J.
    Demediuk, Simon
    Walker, James Alfred
    PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2020), 2020,
  • [48] Decorate3D: Text-Driven High-Quality Texture Generation for Mesh Decoration in theWild
    Guo, Yanhui
    Zuo, Xinxin
    Dai, Peng
    Lu, Juwei
    Wu, Xiaolin
    Cheng, Li
    Yan, Youliang
    Xu, Songcen
    Wu, Xiaofei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [49] High-Quality Depth Estimation Using an Exemplar 3D Model for Stereo Conversion
    Lee, Jungjin
    Kim, Younghui
    Lee, Sangwoo
    Kim, Bumki
    Noh, Junyong
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2015, 21 (07) : 835 - 847
  • [50] High-quality Structured-light Scanning of 3D Objects using Turntable
    Kazo, Csaba
    Hajder, Levente
    3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012), 2012, : 553 - 557