Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior

被引：1

作者：

Wu, Yiqian ^{[1
]}

Xu, Hao ^{[1
]}

Tang, Xiangjun ^{[1
]}

Chen, Xien ^{[2
]}

Tang, Siyu ^{[3
]}

Zhang, Zhebin ^{[4
]}

Li, Chen ^{[4
]}

Jin, Xiaogang ^{[1
]}

机构：

[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Peoples R China

[2] Yale Univ, New Haven, CT USA

[3] Swiss Fed Inst Technol, Zurich, Switzerland

[4] OPPO US Res Ctr, Menlo Pk, CA USA

来源：

ACM TRANSACTIONS ON GRAPHICS | 2024年 / 43卷 / 04期

基金：

中国国家自然科学基金;

关键词：

3D portrait generation; 3D-aware GANs; diffusion models;

D O I：

10.1145/3658162

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Existing neural rendering-based text-to-3D-portrait generation methods typically make use of human geometry prior and diffusion models to obtain guidance. However, relying solely on geometry information introduces issues such as the Janus problem, over-saturation, and over-smoothing. We present Portrait3D, a novel neural rendering-based framework with a novel joint geometry-appearance prior to achieve text-to-3D-portrait generation that overcomes the aforementioned issues. To accomplish this, we train a 3D portrait generator, 3DPortraitGAN(sic), as a robust prior. This generator is capable of producing 360 degrees. canonical 3D portraits, serving as a starting point for the subsequent diffusion-based generation process. To mitigate the "grid-like" artifact caused by the high-frequency information in the featuremap-based 3D representation commonly used by most 3D-aware GANs, we integrate a novel pyramid tri-grid 3D representation into 3DPortraitGAN(sic). To generate 3D portraits from text, we first project a randomly generated image aligned with the given prompt into the pre-trained 3DPortraitGAN(sic) 's latent space. The resulting latent code is then used to synthesize a pyramid tri-grid. Beginning with the obtained pyramid tri-grid, we use score distillation sampling to distill the diffusion model's knowledge into the pyramid tri-grid. Following that, we utilize the diffusion model to refine the rendered images of the 3D portrait and then use these refined images as training data to further optimize the pyramid tri-grid, effectively eliminating issues with unrealistic color and unnatural artifacts. Our experimental results show that Portrait3D can produce realistic, high-quality, and canonical 3D portraits that align with the prompt.

引用

页数：12

共 50 条

[41] Optimized Color Models for High-Quality 3D Scanning
Narayan, Karthik S.
Abbeel, Pieter
2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 2503 - 2510
[42] 3D High-quality Textile Reconstruction with Synthesized Texture
Hu, Pengpeng
Komura, Taku
Li, Duan
Wu, Ge
Zhong, Yueqi
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 355 - 364
[43] Producing High-quality 3D Maps from Lidar
Xiong, Biao
GIM INTERNATIONAL-THE WORLDWIDE MAGAZINE FOR GEOMATICS, 2016, 30 (02): : 34 - 35
[44] High-Quality Surface-Based 3D Reconstruction Using 2.5D Maps
Song, Lingxiao
Yu, Xiao
Di, Huijun
Wang, Weiran
2022 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2022), 2022, : 749 - 750
[45] Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Chen, Rui
Chen, Yongwei
Jiao, Ningxin
Jia, Kui
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22189 - 22199
[46] Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration
Yang, Xiaofeng
Liu, Fayao
Xu, Yi
Su, Hanjing
Wu, Qingyao
Lin, Guosheng
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6549 - 6557
[47] Naive Mesh-to-Mesh Coloured Model Generation using 3D GANs
Spick, Ryan J.
Demediuk, Simon
Walker, James Alfred
PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2020), 2020,
[48] Decorate3D: Text-Driven High-Quality Texture Generation for Mesh Decoration in theWild
Guo, Yanhui
Zuo, Xinxin
Dai, Peng
Lu, Juwei
Wu, Xiaolin
Cheng, Li
Yan, Youliang
Xu, Songcen
Wu, Xiaofei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[49] High-Quality Depth Estimation Using an Exemplar 3D Model for Stereo Conversion
Lee, Jungjin
Kim, Younghui
Lee, Sangwoo
Kim, Bumki
Noh, Junyong
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2015, 21 (07) : 835 - 847
[50] High-quality Structured-light Scanning of 3D Objects using Turntable
Kazo, Csaba
Hajder, Levente
3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012), 2012, : 553 - 557

← 1 2 3 4 5 →