AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

被引:0
|
作者
Zhang, Huichao [1 ]
Chen, Bowen [1 ]
Yang, Hao [1 ]
Qu, Liao [1 ,2 ]
Wang, Xu [1 ]
Chen, Li [1 ]
Long, Chao [1 ]
Zhu, Feida [1 ]
Du, Daniel [1 ]
Zheng, Min [1 ]
机构
[1] ByteDance, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guidance. In specific, we introduce a 2D diffusion model conditioned on DensePose signal to establish 3D pose control of avatars through 2D images, which enhances view consistency from partially observed scenarios. It addresses the infamous Janus Problem and significantly stablizes the generation process. Moreover, we propose a progressive high-resolution 3D synthesis strategy, which obtains substantial improvement over the quality of the created 3D avatars. To this end, the proposed AvatarVerse pipeline achieves zero-shot 3D modeling of 3D avatars that are not only more expressive, but also in higher quality and fidelity than previous works. Rigorous qualitative evaluations and user studies showcase AvatarVerse's superiority in synthesizing high-fidelity 3D avatars, leading to a new standard in high-quality and stable 3D avatar creation. Our project page is: https://avatarverse3d.github.io/
引用
收藏
页码:7124 / 7132
页数:9
相关论文
共 50 条
  • [1] An Intuitive System for 3D Avatar with High-quality
    Lee, JiHyung
    Choi, Yoon-Seok
    Koo, Bon-Ki
    Hwang, Chi Jung
    2010 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS ICCE, 2010,
  • [2] AvatarStudio: High-Fidelity and Animatable 3D Avatar Creation from Text
    Zhang, Xuanmeng
    Zhang, Jianfeng
    Zhang, Chenxu
    Liew, Jun Hao
    Zhang, Huichao
    Yang, Yi
    Feng, Jiashi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [3] MH Pose: 3D Human Pose Estimation based on High-quality Heatmap
    Zhou, Huifen
    Hong, Chaoqun
    Han, Yong
    Huang, Pengcheng
    Zhuang, Yanhui
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3215 - 3222
  • [4] HQ3DAvatar: High-quality Implicit 3D Head Avatar
    Teotia, Kartik
    Mallikarjun, B. R.
    Pan, Xingang
    Kim, Hyeongwoo
    Garrido, Pablo
    Elgharib, Mohamed
    Theobalt, Christian
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (03):
  • [5] HQ-Avatar: Towards High-Quality 3D Avatar Generation via Point-based Representation
    Zhang, Weitian
    Wu, Sijing
    Yan, Yichao
    Xue, Ben
    Zhu, Wenhan
    Yang, Xiaokang
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [6] Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
    Chen, Rui
    Chen, Yongwei
    Jiao, Ningxin
    Jia, Kui
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22189 - 22199
  • [7] Producing High-quality 3D Maps from Lidar
    Xiong, Biao
    GIM INTERNATIONAL-THE WORLDWIDE MAGAZINE FOR GEOMATICS, 2016, 30 (02): : 34 - 35
  • [8] Creation of 3D Scene from Raw Text
    Dessai, Sneha N.
    Dhanaraj, Rachel
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 1466 - 1469
  • [9] High-Quality Reconstruction of 3D Model
    Liu, Xing-ming
    Cai, Tie
    Gui, Rong-zhi
    Wang, Hui-jing
    Liu, Jun-yao
    COMPUTER SCIENCE AND TECHNOLOGY (CST2016), 2017, : 946 - 954
  • [10] EucliDreamer: Fast and High-Quality Texturing for 3D Models with Stable Diffusion Depth
    Le, Cindy
    Hetang, Conrui
    Lin, Chendi
    Cao, Ang
    He, Yihui
    arXiv, 2023,