Multi-Scale Correspondence Learning for Person Image Generation

被引:0
|
作者
Shen, Shi-Long [1 ]
Wu, Ai-Guo [1 ]
Xu, Yong [2 ]
机构
[1] Harbin Inst Technol Shenzhen, Shenzhen, Peoples R China
[2] Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen, Peoples R China
关键词
generative models; generative adversarial networks; person image generation;
D O I
10.1587/transinf.2022DLP0058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A generative model is presented for two types of person image generation in this paper. First, this model is applied to pose-guided person image generation, i.e., converting the pose of a source person im-age to the target pose while preserving the texture of that source person image. Second, this model is also used for clothing-guided person image generation, i.e., changing the clothing texture of a source person image to the desired clothing texture. The core idea of the proposed model is to establish the multi-scale correspondence, which can effectively address the misalignment introduced by transferring pose, thereby preserving richer in-formation on appearance. Specifically, the proposed model consists of two stages: 1) It first generates the target semantic map imposed on the target pose to provide more accurate guidance during the generation process. 2) After obtaining the multi-scale feature map by the encoder, the multi-scale correspondence is established, which is useful for a fine-grained genera-tion. Experimental results show the proposed method is superior to state-of-the-art methods in pose-guided person image generation and show its effectiveness in clothing-guided person image generation.
引用
收藏
页码:804 / 812
页数:9
相关论文
共 50 条
  • [11] Multi-scale Contrastive Learning for Complex Scene Generation
    Lee, Hanbit
    Kim, Youna
    Lee, Sang-goo
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 764 - 774
  • [12] Multi-scale Deep Learning Architectures for Person Re-identification
    Qian, Xuelin
    Fu, Yanwei
    Jiang, Yu-Gang
    Xiang, Tao
    Xue, Xiangyang
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5409 - 5418
  • [13] Contextual Multi-Scale Feature Learning for Person Re-Identification
    Fan, Baoyu
    Wang, Li
    Zhang, Runze
    Guo, Zhenhua
    Zhao, Yaqian
    Li, Rengang
    Gong, Weifeng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 655 - 663
  • [14] Person re-identification based on multi-scale feature learning
    Li, Yueying
    Liu, Li
    Zhu, Lei
    Zhang, Huaxiang
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [15] Person Search by Multi-Scale Matching
    Lan, Xu
    Zhu, Xiatian
    Gong, Shaogang
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 553 - 569
  • [16] Person Re-Identification by Deep Learning Multi-Scale Representations
    Chen, Yanbei
    Zhu, Xiatian
    Gong, Shaogang
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2590 - 2600
  • [17] Enhancing identification for person search with multi-scale multi-grained representation learning
    Han, Zhixiong
    Ma, Bingpeng
    PATTERN RECOGNITION, 2024, 150
  • [18] LEARNING MULTI-SCALE FEATURES FOR JPEG IMAGE ARTIFACTS REMOVAL
    Ji, Jiahuan
    Zhong, Baojiang
    Song, Weigang
    Ma, Kai-Kuang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1565 - 1569
  • [19] The Multi-scale Dominant Binary Pattern Learning for Image Recognition
    Yi, Liangling
    Zhang, Dongbo
    Xu, Haixia
    Zhang, Ying
    2018 13TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2018, : 1385 - 1390
  • [20] MUSICAL: Multi-Scale Image Contextual Attention Learning for Inpainting
    Wang, Ning
    Li, Jingyuan
    Zhang, Lefei
    Du, Bo
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3748 - 3754