A Markov Chain approach for video-based virtual try-on with denoising diffusion generative adversarial network

被引:0
|
作者
Hou, Jue [1 ,2 ]
Lu, Yinwen [1 ,2 ]
Wang, Mingjie [3 ]
Ouyang, Wenbing [4 ]
Yang, Yang [1 ,2 ]
Zou, Fengyuan [1 ,2 ]
Gu, Bingfei [1 ,2 ]
Liu, Zheng [2 ,5 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Fash Design & Engn, CN-310018 Hangzhou, Zhejiang, Peoples R China
[2] Minist Culture & Tourism, Key Lab Silk Culture Heritage & Prod Design Digita, CN-310018 Hangzhou, Zhejiang, Peoples R China
[3] Zhejiang Sci Tech Univ, Sch Sci, Dept Math, CN-310018 Hangzhou, Zhejiang, Peoples R China
[4] Amazon Inc, 410 Terry Ave N, Seattle, WA 98109 USA
[5] Zhejiang Sci Tech Univ, Sch Int Educ, CN-310018 Hangzhou, Zhejiang, Peoples R China
关键词
Markov Chain; Diffusion model; Video synthesis; Virtual try -on;
D O I
10.1016/j.knosys.2024.112233
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based virtual try-ons have attracted unprecedented attention owing to the development of e-commerce. However, this problem is very challenging because of the arbitrary poses of persons and the demand for temporary consistency of frames, particularly when attempting to synthesize high-quality virtual try-on videos using single images. Specifically, there are two key challenges. 1) The existing video-based virtual try-on methods are based on generative adversarial networks (GAN), which are limited by unstable training and a lack of realism in generated details. 2) The explicit building of stronger constraints of generated frames, which aims to increase the coherence of generated videos. To address these challenges, this study proposed a novel framework, Extended Markov Chain Based Denoising Diffusion Generative Adversarial Network (EMC-DDGAN), which was derived from a denoising diffusion GAN, which is a diffusion model with efficient sampling. Moreover, we proposed an extended Markov chain that used a diffusion model to synthesize frames via sequential generation. With a carefully designed network and learning objects, the proposed approach achieved outstanding performance on public datasets. Rigorous experiments demonstrated that EMC-DDGAN could synthesize higher-quality videos compared to other state-of-the-art methods and validated the effectiveness of the proposed approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Generative-Adversarial-Network-Based Virtual Try-On Method
    Zhang S.
    Wang Q.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2021, 54 (09): : 925 - 933
  • [2] Fast and robust virtual try-on based on parser-free generative adversarial network
    Rohil, Mukesh Kumar
    Parikh, Arpan
    VIRTUAL REALITY, 2024, 28 (01)
  • [3] Fast and robust virtual try-on based on parser-free generative adversarial network
    Mukesh Kumar Rohil
    Arpan Parikh
    Virtual Reality, 2024, 28
  • [4] ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on
    Kuppa, Gaurav
    Jong, Andrew
    Liu, Xin
    Liu, Ziwei
    Moh, Teng-Sheng
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2021), 2021, : 191 - 200
  • [5] Attention-based Video Virtual Try-On
    Tsai, Wen-Jiin
    Tien, Yi-Cheng
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 209 - 216
  • [6] A Flow-Based Generative Network for Photo-Realistic Virtual Try-on
    Wang, Tao
    Gu, Xiaoling
    Zhu, Junkai
    IEEE ACCESS, 2022, 10 : 40899 - 40909
  • [7] Enhancing consistency in virtual try-on: A novel diffusion-based approach
    Zhou, Chenglin
    Zhang, Wei
    Lian, Zhichao
    IMAGE AND VISION COMPUTING, 2024, 148
  • [8] MV-TON: Memory-based Video Virtual Try-on network
    Zhong, Xiaojing
    Wu, Zhonghua
    Tan, Taizhe
    Lin, Guosheng
    Wu, Qingyao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 908 - 916
  • [9] An Approach for EEG Denoising Based on Wasserstein Generative Adversarial Network
    Dong, Yuanzhe
    Tang, Xi
    Li, Qingge
    Wang, Yingying
    Jiang, Naifu
    Tian, Lan
    Zheng, Yue
    Li, Xiangxin
    Zhao, Shaofeng
    Li, Guanglin
    Fang, Peng
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 3524 - 3534
  • [10] VITON: An Image-based Virtual Try-on Network
    Han, Xintong
    Wu, Zuxuan
    Wu, Zhe
    Yu, Ruichi
    Davis, Larry S.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7543 - 7552