Multi-Scale Correspondence Learning for Person Image Generation

被引:0
|
作者
Shen, Shi-Long [1 ]
Wu, Ai-Guo [1 ]
Xu, Yong [2 ]
机构
[1] Harbin Inst Technol Shenzhen, Shenzhen, Peoples R China
[2] Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen, Peoples R China
关键词
generative models; generative adversarial networks; person image generation;
D O I
10.1587/transinf.2022DLP0058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A generative model is presented for two types of person image generation in this paper. First, this model is applied to pose-guided person image generation, i.e., converting the pose of a source person im-age to the target pose while preserving the texture of that source person image. Second, this model is also used for clothing-guided person image generation, i.e., changing the clothing texture of a source person image to the desired clothing texture. The core idea of the proposed model is to establish the multi-scale correspondence, which can effectively address the misalignment introduced by transferring pose, thereby preserving richer in-formation on appearance. Specifically, the proposed model consists of two stages: 1) It first generates the target semantic map imposed on the target pose to provide more accurate guidance during the generation process. 2) After obtaining the multi-scale feature map by the encoder, the multi-scale correspondence is established, which is useful for a fine-grained genera-tion. Experimental results show the proposed method is superior to state-of-the-art methods in pose-guided person image generation and show its effectiveness in clothing-guided person image generation.
引用
收藏
页码:804 / 812
页数:9
相关论文
共 50 条
  • [41] A multi-scale kernel learning method and its application in image classification
    Bao, Jian
    Chen, Yangyang
    Yu, Li
    Chen, Chunwei
    NEUROCOMPUTING, 2017, 257 : 16 - 23
  • [42] Learning multi-scale synergic discriminative features for prostate image segmentation
    Jia, Haozhe
    Cai, Weidong
    Huang, Heng
    Xia, Yong
    PATTERN RECOGNITION, 2022, 126
  • [43] Image manipulation detection and localization using multi-scale contrastive learning
    Bai, Ruyi
    APPLIED SOFT COMPUTING, 2024, 163
  • [44] Prompt learning and multi-scale attention for infrared and visible image fusion
    Li, Yanan
    Ji, Qingtao
    Jiao, Shaokang
    INFRARED PHYSICS & TECHNOLOGY, 2025, 145
  • [45] Multi-scale GAN with residual image learning for removing heterogeneous blur
    Khan, Rayyan Azam
    Luo, Yigang
    Wu, Fang-Xiang
    IET IMAGE PROCESSING, 2022, 16 (09) : 2412 - 2431
  • [46] Multi-scale microstructure binary pattern extraction and learning for image representation
    Zhang, Dongbo
    Yi, Lianglin
    Tang, Hongzhong
    Zhang, Ying
    Xu, Haixia
    IET IMAGE PROCESSING, 2019, 13 (13) : 2507 - 2515
  • [47] Multi-Scale Detail-Noise Complementary Learning for Image Denoising
    Cui, Yan
    Shi, Mingyue
    Jiang, Jielin
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [48] MHDNet: A Multi-Scale Hybrid Deep Learning Model for Person Re-Identification
    Wang, Jinghui
    Wang, Jun
    ELECTRONICS, 2024, 13 (08)
  • [49] Learning multi-level and multi-scale deep representations for privacy image classification
    Han, Yahui
    Huang, Yonggang
    Pan, Lei
    Zheng, Yunbo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (02) : 2259 - 2274
  • [50] Direct multi-scale ordination with canonical correspondence analysis
    Wagner, HH
    ECOLOGY, 2004, 85 (02) : 342 - 351