DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

被引:0
|
作者
He, Kai [1 ,2 ]
Yao, Kaixin [1 ,3 ]
Zhang, Qixuan [1 ,2 ]
Liu, Lingjie [4 ]
Yu, Jingyi [1 ]
Xu, Lan [1 ]
机构
[1] ShanghaiTech Univ, Shanghai, Peoples R China
[2] Deemos Technol Co Ltd, Shanghai, Peoples R China
[3] NeuDim Technol Co Ltd, Shanghai, Peoples R China
[4] Univ Penn, Philadelphia, PA 19104 USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2024年 / 43卷 / 04期
基金
国家重点研发计划;
关键词
Garment Generation; Sewing Patterns; Autoregressive Model;
D O I
10.1145/3658147
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Apparel's significant role in human appearance underscores the importance of garment digitalization for digital human creation. Recent advances in 3D content creation are pivotal for digital human creation. Nonetheless, garment generation from text guidance is still nascent. We introduce a text-driven 3D garment generation framework, DressCode, which aims to democratize design for novices and offer immense potential in fashion design, virtual try-on, and digital human creation. We first introduce SewingGPT, a GPT-based architecture integrating cross-attention with text-conditioned embedding to generate sewing patterns with text guidance. We then tailor a pre-trained Stable Diffusion to generate tile-based Physically-based Rendering (PBR) textures for the garments. By leveraging a large language model, our framework generates CG-friendly garments through natural language interaction. It also facilitates pattern completion and texture editing, streamlining the design process through user-friendly interaction. This framework fosters innovation by allowing creators to freely experiment with designs and incorporate unique elements into their work. With comprehensive evaluations and comparisons with other state-of-the-art methods, our method showcases superior quality and alignment with input prompts. User studies further validate our high-quality rendering results, highlighting its practical utility and potential in production settings. Our project page is https://IHe-KaiI.github.io/DressCode/.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Set to Ordered Text: Generating Discharge Instructions from Medical Billing Codes
    Kurisinkel, Litton J.
    Chen, Nancy F.
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6165 - 6175
  • [32] A NOVEL METHOD FOR AUTOMATICALLY GENERATING MULTI-MODAL DIALOGUE FROM TEXT
    Prendinger, Helmut
    Piwek, Paul
    Ishizuka, Mitsuru
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2007, 1 (03) : 319 - 334
  • [33] WAV2GLOSS: Generating Interlinear Glossed Text from Speech
    He, Taiqi
    Choi, Kwanghee
    Tjuatja, Lindia
    Robinson, Nathaniel R.
    Shi, Jiatong
    Neubig, Graham
    Mortensen, David R.
    Levin, Lori
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 568 - 582
  • [34] Towards automatically generating supply chain maps from natural language text
    Wichmann, Pascal
    Brintrup, Alexandra
    Baker, Simon
    Woodall, Philip
    McFarlane, Duncan
    IFAC PAPERSONLINE, 2018, 51 (11): : 1726 - 1731
  • [35] An ontology-based procedure for generating object model from text description
    Vongdoiwang, Waralak
    Batanov, Dencho N.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 10 (01) : 93 - 108
  • [36] Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem
    Schumann, Raphael
    Riezler, Stefan
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 489 - 502
  • [37] CLIPSwarm: Generating Drone Shows from Text Prompts with Vision-Language Models
    Pueyo, Pablo
    Montijano, Eduardo
    Murillo, Ana C.
    Mac Schwager
    2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024), 2024, : 11917 - 11923
  • [38] GENERATING TEXT FROM COMPRESSED INPUT - AN INTELLIGENT INTERFACE FOR PEOPLE WITH SEVERE MOTOR IMPAIRMENTS
    DEMASCO, PW
    MCCOY, KF
    COMMUNICATIONS OF THE ACM, 1992, 35 (05) : 68 - 78
  • [39] Text me the data: Generating Ground Pressure Sequence from Textual Descriptions for HAR
    Ray, Lala Shakti Swarup
    Zhou, Bo
    Suh, Sungho
    Krupp, Lars
    Rey, Vitor Fortes
    Lukowicz, Paul
    2024 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS, PERCOM WORKSHOPS, 2024, : 461 - 464
  • [40] Generating Audio-Visual Slideshows from Text Articles Using Word Concreteness
    Leake, Mackenzie
    Shin, Hijung Valentina
    Kim, Joy O.
    Agrawala, Maneesh
    PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,