M&M VTO: Multi-Garment Virtual Try-On and Editing

被引:0
|
作者
Zhu, Luyang [1 ,2 ,3 ]
Li, Yingwei [1 ,3 ]
Liu, Nan [1 ,3 ]
Peng, Hao [1 ,3 ]
Yang, Dawei [1 ,3 ]
Kemelmacher-Shlizerman, Ira [1 ,2 ,3 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Univ Washington, Seattle, WA 98195 USA
[3] Google, Mountain View, CA 94043 USA
关键词
D O I
10.1109/CVPR52733.2024.00134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present M&M VTO-a mix and match virtual try-on method that takes as input multiple garment images, text description for garment layout and an image of a person. An example input includes: an image of a shirt, an image of a pair of pants, "rolled sleeves, shirt tucked in", and an image of a person. The output is a visualization of how those garments (in the desired layout) would look like on the given person. Key contributions of our method are: 1) a single stage diffusion based model, with no super resolution cascading, that allows to mix and match multiple garments at 1024x512 resolution preserving and warping intricate garment details, 2) architecture design (VTO UNet Diffusion Transformer) to disentangle denoising from person specific features, allowing for a highly effective finetuning strategy for identity preservation (6MB model per individual vs 4GB achieved with, e.g., dreambooth finetuning); solving a common identity loss problem in current virtual try-on methods, 3) layout control for multiple garments via text inputs finetuned over PaLI-3 [8] for virtual try-on task. Experimental results indicate that M&M VTO achieves state-of-the-art performance both qualitatively and quantitatively, as well as opens up new opportunities for virtual try-on via language-guided and multi-garment try-on.
引用
收藏
页码:1346 / 1356
页数:11
相关论文
共 50 条
  • [21] Psychophysical testing of garment size variation using three-dimensional virtual try-on technology
    Kim, Dong-Eun
    TEXTILE RESEARCH JOURNAL, 2016, 86 (04) : 365 - 379
  • [22] Me or just like me? The role of virtual try-on and physical appearance in apparel M-retailing
    Plotkina, Daria
    Saurel, Helene
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2019, 51 : 362 - 377
  • [23] Leveraging personalization and customization affordances of virtual try-on apps for a new model in apparel m-shopping
    Tawira, Letwin
    Ivanov, Alex
    ASIA PACIFIC JOURNAL OF MARKETING AND LOGISTICS, 2023, 35 (02) : 451 - 471
  • [24] Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing
    Cui, Aiyu
    McKee, Daniel
    Lazebnik, Svetlana
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14618 - 14627
  • [25] Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing
    Cui, Aiyu
    McKee, Daniel
    Lazebnik, Svetlana
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3935 - 3940
  • [26] A Multi-Level Consistency Network for High-Fidelity Virtual Try-On
    Wei, Hao
    Chen, Rui
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (05)
  • [27] Virtual try-on technologies in the clothing industry. Part 1: investigation of distance ease between body and garment
    Lage, Agne
    Ancutiene, Kristina
    JOURNAL OF THE TEXTILE INSTITUTE, 2017, 108 (10) : 1787 - 1793
  • [28] Dress Code: High-Resolution Multi-Category Virtual Try-On
    Morelli, Davide
    Fincato, Matteo
    Cornia, Marcella
    Landi, Federico
    Cesari, Fabio
    Cucchiara, Rita
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2230 - 2234
  • [29] Dress Code: High-Resolution Multi-category Virtual Try-On
    Morelli, Davide
    Fincato, Matteo
    Cornia, Marcella
    Landi, Federico
    Cesari, Fabio
    Cucchiara, Rita
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 345 - 362
  • [30] StyleVTON: A multi-pose virtual try-on with identity and clothing detail preservation
    Islam, Tasin
    Miron, Alina
    Liu, Xiaohui
    Li, Yongmin
    NEUROCOMPUTING, 2024, 594