A Flow-Based Generative Network for Photo-Realistic Virtual Try-on

被引:2
|
作者
Wang, Tao [1 ]
Gu, Xiaoling [1 ]
Zhu, Junkai [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, Hangzhou 310005, Peoples R China
基金
美国国家科学基金会;
关键词
Clothing; Strain; Semantics; Three-dimensional displays; Estimation; Shape; Computational modeling; Image-based virtual try-on; image synthesis; appearance flow; OPTICAL-FLOW;
D O I
10.1109/ACCESS.2022.3167509
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image-based virtual try-on systems aim at transferring the try-on clothes onto a target person. Despite making considerable progress recently, such systems are still highly challenging for real-world applications because of occlusion and drastic spatial deformation. To address the issues, we propose a novel Flow-based Virtual Try-on Network (FVTN). It consists of three modules. Firstly, the Parsing Alignment Module (PAM) aligns the source clothing to the target person at the semantic level by predicting a semantic parsing map. Secondly, the Flow Estimation Module (FEM) learns a robust clothing deformation model by estimating multi-scale dense flow fields in an unsupervised fashion. Thirdly, the Fusion and Rendering Module (FRM) synthesizes the final try-on image by effectively integrating the warped clothing features and human body features. Extensive experiments on a public fashion dataset demonstrate that our FVTN qualitatively and quantitatively outperforms the state-of-the-art approaches. The source code and trained models are available at https://github.com/gxl-groups/FVNT.
引用
下载
收藏
页码:40899 / 40909
页数:11
相关论文
共 50 条
  • [41] StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
    Zhang, Han
    Xu, Tao
    Li, Hongsheng
    Zhang, Shaoting
    Wang, Xiaogang
    Huang, Xiaolei
    Metaxas, Dimitris
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5908 - 5916
  • [42] Virtual try-on based on attention U-Net
    Hu, Xinrong
    Zhang, Junyu
    Huang, Jin
    Liang, JinXing
    Yu, Feng
    Peng, Tao
    VISUAL COMPUTER, 2022, 38 (9-10): : 3365 - 3376
  • [43] Recurrent Appearance Flow for Occlusion-Free Virtual Try-On
    Gu, Xiaoling
    Zhu, Junkai
    Wong, Yongkang
    Wu, Zizhao
    Yu, Jun
    Fan, Jianping
    Kankanhalli, Mohan
    ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 20 (08)
  • [44] Towards Multi-pose Guided Virtual Try-on Network
    Dong, Haoye
    Liang, Xiaodan
    Shen, Xiaohui
    Wang, Bochao
    Lai, Hanjiang
    Zhu, Jia
    Hu, Zhiting
    Yin, Jian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9025 - 9034
  • [45] Image-based virtual try-on: Fidelity and simplification
    Islam, Tasin
    Miron, Alina
    Liu, Xiaohui
    Li, Yongmin
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 129
  • [46] Robust Egocentric Photo-realistic Facial Expression Transfer for Virtual Reality
    Jourabloo, Amin
    De la Torre, Fernando
    Saragih, Jason
    Wei, Shih-En
    Lombardi, Stephen
    Wang, Te-Li
    Belko, Danielle
    Trimble, Autumn
    Badino, Hernan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20291 - 20300
  • [47] LC-VTON: Length Controllable Virtual Try-On Network
    Yao, Jinliang
    Zheng, Haonan
    IEEE ACCESS, 2023, 11 : 88451 - 88461
  • [48] Multi-Scene Virtual Try-on Network Guided by Attributes
    Lv, Xiaoyang
    Zhang, Bo
    Li, Jie
    Cao, Yangjie
    Yang, Cong
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 161 - 165
  • [49] VTNCT: an image-based virtual try-on network by combining feature with pixel transformation
    Chang, Yuan
    Peng, Tao
    Yu, Feng
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    VISUAL COMPUTER, 2023, 39 (07): : 2583 - 2596
  • [50] Photo-realistic interactive virtual environment generation using multiview cameras
    Kim, N
    Woo, W
    Tadenuma, M
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2001, 2001, 4310 : 245 - 254