TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling

被引:0
|
作者
Huo, Dong [1 ,3 ]
Guo, Zixin [2 ]
Zuo, Xinxin [3 ]
Shi, Zhihao [3 ]
Lu, Juwei [3 ]
Dai, Peng [3 ]
Xu, Songcen [3 ]
Cheng, Li [1 ]
Yang, Yee-Hong [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Univ Toronto, Toronto, ON, Canada
[3] Huawei Noahs Ark Lab, Montreal, PQ, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1007/978-3-031-72920-1_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a 3D mesh, we aim to synthesize 3D textures that correspond to arbitrary textual descriptions. Current methods for generating and assembling textures from sampled views often result in prominent seams or excessive smoothing. To tackle these issues, we present TexGen, a novel multi-view sampling and resampling framework for texture generation leveraging a pre-trained text-to-image diffusion model. For view consistent sampling, first of all we maintain a texture map in RGB space that is parameterized by the denoising step and updated after each sampling step of the diffusion model to progressively reduce the view discrepancy. An attention-guided multi-view sampling strategy is exploited to broadcast the appearance information across views. To preserve texture details, we develop a noise resampling technique that aids in the estimation of noise, generating inputs for subsequent denoising steps, as directed by the text prompt and current texture map. Through an extensive amount of qualitative and quantitative evaluations, we demonstrate that our proposed method produces significantly better texture quality for diverse 3D objects with a high degree of view consistency and rich appearance details, outperforming current state-of-the-art methods. Furthermore, our proposed texture generation technique can also be applied to texture editing while preserving the original identity. More experimental results are available at https://dong-huo.github.io/TexGen/.
引用
收藏
页码:352 / 368
页数:17
相关论文
共 50 条
  • [41] Text-Guided Graph Neural Networks for Referring 3D Instance Segmentation
    Huang, Pin-Hao
    Lee, Han-Hung
    Chen, Hwann-Tzong
    Liu, Tyng-Luh
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1610 - 1618
  • [42] Vox-E: Text-guided Voxel Editing of 3D Objects
    Sella, Etai
    Fiebelman, Gal
    Hedman, Peter
    Averbuch-Elor, Hadar
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 430 - 440
  • [43] AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation
    Wang, Xinzhou
    Wang, Yikai
    Yee, Junliang
    Sung, Fuchun
    Wang, Zhengyi
    Wang, Ling
    Liu, Pengkun
    Sung, Kai
    Wan, Xintong
    Xie, Wende
    Liu, Fangfu
    He, Bin
    COMPUTER VISION - ECCV 2024, PT XXV, 2025, 15083 : 321 - 339
  • [44] SOGDet: Semantic-Occupancy Guided Multi-View 3D Object Detection
    Zhou, Qiu
    Cao, Jinming
    Leng, Hanchao
    Yin, Yifang
    Kun, Yu
    Zimmermann, Roger
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7668 - 7676
  • [45] Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
    Yu, Cuican
    Lu, Guansong
    Zeng, Yihan
    Sun, Jian
    Liang, Xiaodan
    Li, Huibin
    Xu, Zongben
    Xu, Songcen
    Zhang, Wei
    Xu, Hang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15280 - 15291
  • [46] Physically Guided Generative Adversarial Network for Holographic 3D Content Generation From Multi-View Light Field
    Zeng, Yunhui
    Long, Zhenwei
    Qiu, Yawen
    Wang, Shiyi
    Wei, Junjie
    Jin, Xin
    Cao, Hongkun
    Li, Zhiheng
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2024, 14 (02) : 286 - 298
  • [47] 3D imaging for glasses free multi-view 3D displays
    Gurbuz, Sabri
    Kawakita, Masahiro
    Yano, Sumio
    Iwasawa, Shoichiro
    Ando, Hiroshi
    STEREOSCOPIC DISPLAYS AND APPLICATIONS XXII, 2011, 7863
  • [48] A multi-view VR interface for 3D GIS
    Germs, R
    Van Maren, G
    Verbree, E
    Jansen, FW
    COMPUTERS & GRAPHICS-UK, 1999, 23 (04): : 497 - 506
  • [49] Multi-view 3D Seismic Facies classifier
    Trindade, Elton Alves
    Roisenberg, Mauro
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 1003 - 1011
  • [50] Multi-view 3D display using waveguides
    Lee, Byoungho
    Lee, Chang-Kun
    INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING (ICOPEN 2015), 2015, 9524