Advances in text-guided 3D editing: a survey

被引:0
|
作者
Lu, Lihua [1 ]
Li, Ruyang [1 ]
Zhang, Xiaohui [1 ]
Wei, Hui [1 ]
Du, Guoguang [1 ]
Wang, Binqiang [1 ]
机构
[1] Shandong Mass Informat Technol Res Inst, Jinan, Peoples R China
关键词
Text prompts; Text-guided 3D editing; Editing capacity; NEURAL RADIANCE FIELDS;
D O I
10.1007/s10462-024-10937-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In 3D Artificial Intelligence Generated Content (AIGC), compared with generating 3D assets from scratch, editing extant 3D assets satisfies user prompts, allowing the creation of diverse and high-quality 3D assets in a time and labor-saving manner. More recently, text-guided 3D editing that modifies 3D assets guided by text prompts is user-friendly and practical, which evokes a surge in research within this field. In this survey, we comprehensively investigate recent literature on text-guided 3D editing in an attempt to answer two questions: What are the methodologies of existing text-guided 3D editing? How has current progress in text-guided 3D editing gone so far? Specifically, we focus on text-guided 3D editing methods published in the past 4 years, delving deeply into their frameworks and principles. We then present a fundamental taxonomy in terms of the editing strategy, optimization scheme, and 3D representation. Based on the taxonomy, we review recent advances in this field, considering factors such as editing scale, type, granularity, and perspective. In addition, we highlight four applications of text-guided 3D editing, including texturing, style transfer, local editing of scenes, and insertion editing, to exploit further the 3D editing capacities with in-depth comparisons and discussions. Depending on the insights achieved by this survey, we discuss open challenges and future research directions. We hope this survey will help readers gain a deeper understanding of this exciting field and foster further advancements in text-guided 3D editing.
引用
收藏
页数:61
相关论文
共 50 条
  • [1] A Survey of Text-guided 3D Face Reconstruction
    Cen, Mengyue
    Shen, Haoran
    Zhao, Wangyan
    Pan, Dingcheng
    Feng, Xiaoyi
    2024 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MEDIA COMPUTING, ICIPMC 2024, 2024, : 82 - 87
  • [2] TECA: Text-Guided Generation and Editing of Compositional 3D Avatars
    Zhang, Hao
    Feng, Yao
    Kulits, Peter
    Wen, Yandong
    Thies, Justus
    Black, Michael J.
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1520 - 1530
  • [3] ClipFace: Text-guided Editing of Textured 3D Morphable Models
    Aneja, Shivangi
    Thies, Justus
    Dai, Angela
    Niessner, Matthias
    PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,
  • [4] Vox-E: Text-guided Voxel Editing of 3D Objects
    Sella, Etai
    Fiebelman, Gal
    Hedman, Peter
    Averbuch-Elor, Hadar
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 430 - 440
  • [5] TEXTure: Text-Guided Texturing of 3D Shapes
    Richardson, Elad
    Metzer, Gal
    Alaluf, Yuval
    Giryes, Raja
    Cohen-Or, Daniel
    PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,
  • [6] Towards Implicit Text-Guided 3D Shape Generation
    Liu, Zhengzhe
    Wang, Yi
    Qi, Xiaojuan
    Fu, Chi-Wing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17875 - 17885
  • [7] WordRobe: Text-Guided Generation of Textured 3D Garments
    Srivastava, Astitva
    Manu, Pranav
    Raj, Amit
    Jampani, Varun
    Sharma, Avinash
    COMPUTER VISION-ECCV 2024, PT I, 2025, 15059 : 458 - 475
  • [8] DREAMCRAFT: Text-Guided Generation of Functional 3D Environments in Minecraft
    Earle, Sam
    Kokkinos, Filippos
    Nie, Yuhe
    Togelius, Julian
    Raileanu, Roberta
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2024, 2024,
  • [9] HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks
    Chen, Zhuo
    Xu, Xudong
    Yan, Yichao
    Pan, Ye
    Zhu, Wenhan
    Wu, Wayne
    Dai, Bo
    Yang, Xiaokang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9997 - 10010
  • [10] Text-guided 3D Human Generation from 2D Collections
    Fu, Tsu-Jui
    Xiong, Wenhan
    Nie, Yixin
    Liu, Jingyu
    Oguz, Barlas
    Wang, William Yang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4508 - 4520