M&M VTO: Multi-Garment Virtual Try-On and Editing

被引：0

作者：

Zhu, Luyang ^{[1
,2
,3
]}

Li, Yingwei ^{[1
,3
]}

Liu, Nan ^{[1
,3
]}

Peng, Hao ^{[1
,3
]}

Yang, Dawei ^{[1
,3
]}

Kemelmacher-Shlizerman, Ira ^{[1
,2
,3
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

[2] Univ Washington, Seattle, WA 98195 USA

[3] Google, Mountain View, CA 94043 USA

来源：

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年

关键词：

D O I：

10.1109/CVPR52733.2024.00134

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present M&M VTO-a mix and match virtual try-on method that takes as input multiple garment images, text description for garment layout and an image of a person. An example input includes: an image of a shirt, an image of a pair of pants, "rolled sleeves, shirt tucked in", and an image of a person. The output is a visualization of how those garments (in the desired layout) would look like on the given person. Key contributions of our method are: 1) a single stage diffusion based model, with no super resolution cascading, that allows to mix and match multiple garments at 1024x512 resolution preserving and warping intricate garment details, 2) architecture design (VTO UNet Diffusion Transformer) to disentangle denoising from person specific features, allowing for a highly effective finetuning strategy for identity preservation (6MB model per individual vs 4GB achieved with, e.g., dreambooth finetuning); solving a common identity loss problem in current virtual try-on methods, 3) layout control for multiple garments via text inputs finetuned over PaLI-3 [8] for virtual try-on task. Experimental results indicate that M&M VTO achieves state-of-the-art performance both qualitatively and quantitatively, as well as opens up new opportunities for virtual try-on via language-guided and multi-garment try-on.

引用

页码：1346 / 1356

页数：11

共 50 条

[21] Psychophysical testing of garment size variation using three-dimensional virtual try-on technology
Kim, Dong-Eun
TEXTILE RESEARCH JOURNAL, 2016, 86 (04) : 365 - 379
[22] Me or just like me? The role of virtual try-on and physical appearance in apparel M-retailing
Plotkina, Daria
Saurel, Helene
JOURNAL OF RETAILING AND CONSUMER SERVICES, 2019, 51 : 362 - 377
[23] Leveraging personalization and customization affordances of virtual try-on apps for a new model in apparel m-shopping
Tawira, Letwin
Ivanov, Alex
ASIA PACIFIC JOURNAL OF MARKETING AND LOGISTICS, 2023, 35 (02) : 451 - 471
[24] Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing
Cui, Aiyu
McKee, Daniel
Lazebnik, Svetlana
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14618 - 14627
[25] Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing
Cui, Aiyu
McKee, Daniel
Lazebnik, Svetlana
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3935 - 3940
[26] A Multi-Level Consistency Network for High-Fidelity Virtual Try-On
Wei, Hao
Chen, Rui
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (05)
[27] Virtual try-on technologies in the clothing industry. Part 1: investigation of distance ease between body and garment
Lage, Agne
Ancutiene, Kristina
JOURNAL OF THE TEXTILE INSTITUTE, 2017, 108 (10) : 1787 - 1793
[28] Dress Code: High-Resolution Multi-Category Virtual Try-On
Morelli, Davide
Fincato, Matteo
Cornia, Marcella
Landi, Federico
Cesari, Fabio
Cucchiara, Rita
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2230 - 2234
[29] Dress Code: High-Resolution Multi-category Virtual Try-On
Morelli, Davide
Fincato, Matteo
Cornia, Marcella
Landi, Federico
Cesari, Fabio
Cucchiara, Rita
COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 345 - 362
[30] StyleVTON: A multi-pose virtual try-on with identity and clothing detail preservation
Islam, Tasin
Miron, Alina
Liu, Xiaohui
Li, Yongmin
NEUROCOMPUTING, 2024, 594

← 1 2 3 4 5 →