SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation

被引:0
|
作者
Zheng, Peng [1 ]
Liu, Tao [1 ]
Yi, Zili [3 ]
Ma, Rui [1 ,2 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China
[2] MOE, Engn Res Ctr Knowledge Driven Human Machine Intel, Changchun, Peoples R China
[3] Nanjing Univ, Sch Intelligence Sci & Technol, Suzhou, Peoples R China
来源
COMPUTER VISION - ECCV 2024, PT XXX | 2025年 / 15088卷
基金
中国国家自然科学基金;
关键词
Generative models; 3D-aware human image synthesis; Compositional image synthesis;
D O I
10.1007/978-3-031-73404-5_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of neural radiance fields and generative models, numerous methods have been proposed for learning 3D human generation from 2D images. These methods allow control over the pose of the generated 3D human and enable rendering from different viewpoints. However, none of these methods explore semantic disentanglement in human image synthesis, i.e., they can not disentangle the generation of different semantic parts, such as the body, tops, and bottoms. Furthermore, existing methods are limited to synthesize images at 5122 resolution due to the high computational cost of neural radiance fields. To address these limitations, we introduce SemanticHuman-HD, the first method to achieve semantic disentangled human image synthesis. Notably, SemanticHuman-HD is also the first method to achieve 3D-aware image synthesis at 10242 resolution, benefiting from our proposed 3D-aware super-resolution module. By leveraging the depth maps and semantic masks as guidance for the 3D-aware super-resolution, we significantly reduce the number of sampling points during volume rendering, thereby reducing the computational cost. Our comparative experiments demonstrate the superiority of our method. The effectiveness of each proposed component is also verified through ablation studies. Moreover, our method opens up exciting possibilities for various applications, including 3D garment generation, semantic-aware image synthesis, controllable image synthesis, and out-of-distribution image synthesis. Our project page is at https://pengzheng0707.github.io/SemanticHuman- HD/.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 50 条
  • [31] 3D resolution of urban sites by high-resolution aerial stereoscopy
    Oriot, Hélène
    Michel, Alain
    Bulletin - Societe Francaise de Photogrammetrie et de Teledetection, 2002, (166): : 19 - 26
  • [32] Fast High-Resolution 3D Radar Imaging Based on 3D FISTA
    Ren, Jiaying
    Lu, Xinfei
    Guan, Jian
    Yin, Zhiping
    Ehen, Weidong
    2017 IEEE RADAR CONFERENCE (RADARCONF), 2017, : 815 - 818
  • [33] nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding
    Zhu, Benjin
    Wang, Zhe
    Li, Hongsheng
    COMPUTER VISION - ECCV 2024, PT V, 2025, 15063 : 125 - 141
  • [34] Experience with fully 3D PET and implications for future high-resolution 3D tomographs
    Bailey, DL
    Miller, MP
    Spinks, TJ
    Bloomfield, PM
    Livieratos, L
    Young, HE
    Jones, T
    PHYSICS IN MEDICINE AND BIOLOGY, 1998, 43 (04): : 777 - 786
  • [36] High-resolution 3D coherent laser radar imaging
    Krause, Brian
    Gatt, Philip
    Embry, Carl
    Buck, Joe
    LASER RADAR TECHNOLOGY AND APPLICATIONS XI, 2006, 6214
  • [37] Rapid High-Resolution Visible Light 3D Printing
    Ahn, Dowon
    Stevens, Lynn M.
    Zhou, Kevin
    Page, Zachariah A.
    ACS CENTRAL SCIENCE, 2020, 6 (09) : 1555 - 1563
  • [38] Connecting high-resolution 3D chromatin organization with epigenomics
    Feng, Fan
    Yao, Yuan
    Wang, Xue Qing David
    Zhang, Xiaotian
    Liu, Jie
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [40] HIGH-RESOLUTION 3D DIGITAL MODELS - CH SURVEYOR
    Ene, D.
    Radvan, R.
    ROMANIAN REPORTS IN PHYSICS, 2010, 62 (03) : 660 - 670