Multi-Scale Correspondence Learning for Person Image Generation

被引:0
|
作者
Shen, Shi-Long [1 ]
Wu, Ai-Guo [1 ]
Xu, Yong [2 ]
机构
[1] Harbin Inst Technol Shenzhen, Shenzhen, Peoples R China
[2] Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen, Peoples R China
关键词
generative models; generative adversarial networks; person image generation;
D O I
10.1587/transinf.2022DLP0058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A generative model is presented for two types of person image generation in this paper. First, this model is applied to pose-guided person image generation, i.e., converting the pose of a source person im-age to the target pose while preserving the texture of that source person image. Second, this model is also used for clothing-guided person image generation, i.e., changing the clothing texture of a source person image to the desired clothing texture. The core idea of the proposed model is to establish the multi-scale correspondence, which can effectively address the misalignment introduced by transferring pose, thereby preserving richer in-formation on appearance. Specifically, the proposed model consists of two stages: 1) It first generates the target semantic map imposed on the target pose to provide more accurate guidance during the generation process. 2) After obtaining the multi-scale feature map by the encoder, the multi-scale correspondence is established, which is useful for a fine-grained genera-tion. Experimental results show the proposed method is superior to state-of-the-art methods in pose-guided person image generation and show its effectiveness in clothing-guided person image generation.
引用
收藏
页码:804 / 812
页数:9
相关论文
共 50 条
  • [1] Multi-scale cross-domain alignment for person image generation
    Ma, Liyuan
    Gao, Tingwei
    Shen, Haibin
    Huang, Kejie
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (02) : 374 - 387
  • [2] MsCGAN: Multi-scale Conditional Generative Adversarial Networks for Person Image Generation
    Tang, Wei
    Li, Gui
    Bao, Xinyuan
    Nian, Fudong
    Li, Tong
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 1440 - 1445
  • [3] Exploiting appearance transfer and multi-scale context for efficient person image generation
    Shen, Chengkang
    Wang, Peiyan
    Tang, Wei
    PATTERN RECOGNITION, 2022, 124
  • [4] MANet: Multi-Scale Attention Network for Correspondence Learning
    Chen, Yukai
    Zheng, Linxin
    Liu, Xin
    Xiao, Guobao
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1978 - 1982
  • [5] MANet: Multi-Scale Attention Network for Correspondence Learning
    Chen, Yukai
    Zheng, Linxin
    Liu, Xin
    Xiao, Guobao
    Xiao, Guobao (gbx@mju.edu.cn), 1978, Institute of Electrical and Electronics Engineers Inc. (28): : 1978 - 1982
  • [6] A novel Multi-scale architecture driven by decoupled semantic attention transfer for person image generation
    Wang, Meng
    Chen, Jiaxing
    Liu, Haipeng
    MATERIALS LETTERS, 2023, 336 : 24 - 36
  • [7] Multi-scale joint learning for person re-identification
    Xie P.
    Xu X.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (03): : 613 - 622
  • [8] A multi-scale unsupervised learning for deformable image registration
    Shuwei Shao
    Zhongcai Pei
    Weihai Chen
    Wentao Zhu
    Xingming Wu
    Baochang Zhang
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 157 - 166
  • [9] A multi-scale unsupervised learning for deformable image registration
    Shao, Shuwei
    Pei, Zhongcai
    Chen, Weihai
    Zhu, Wentao
    Wu, Xingming
    Zhang, Baochang
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (01) : 157 - 166
  • [10] Multi-Scale Ensemble Learning for Thermal Image Enhancement
    Ban, Yuseok
    Lee, Kyungjae
    APPLIED SCIENCES-BASEL, 2021, 11 (06):