Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation

被引:0
|
作者
Tang, Jilin [1 ]
Yuan, Yi [1 ]
Shao, Tianjia [2 ]
Liu, Yong [3 ]
Wang, Mengmeng [3 ]
Zhou, Kun [2 ]
机构
[1] NetEase Fuxi AI Lab, Beijing, Peoples R China
[2] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Peoples R China
[3] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we tackle the problem of pose guided person image generation, which aims to transfer a person image from the source pose to a novel target pose while maintaining the source appearance. Given the inefficiency of standard CNNs in handling large spatial transformation, we propose a structure-aware flow based method for high-quality person image generation. Specifically, instead of learning the complex overall pose changes of human body, we decompose the human body into different semantic parts (e.g., head, torso, and legs) and apply different networks to predict the flow fields for these parts separately. Moreover, we carefully design the network modules to effectively capture the local and global semantic correlations of features within and among the human parts respectively. Extensive experimental results show that our method can generate high-quality results under large pose discrepancy and outperforms state-of-the-art methods in both qualitative and quantitative comparisons.
引用
收藏
页码:2656 / 2664
页数:9
相关论文
共 50 条
  • [1] A Structure-Aware Method for Direct Pose Estimation
    Blanton, Hunter
    Workman, Scott
    Jacobs, Nathan
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 205 - 214
  • [2] Pose-Aware Disentangled Multiscale Transformer for Pose Guided Person Image Generation
    Shibasaki, Kei
    Ikehara, Masaaki
    IEEE ACCESS, 2023, 11 : 146054 - 146064
  • [3] Learning structure-aware semantic segmentation with image-level supervision
    Liu, Jiawei
    Zhang, Jing
    Hong, Yicong
    Barnes, Nick
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Structure-aware image fusion
    Li, Wen
    Xie, Yuange
    Zhou, Haole
    Han, Ying
    Zhan, Kun
    OPTIK, 2018, 172 : 1 - 11
  • [5] Structure-Aware Procedural Text Generation From an Image Sequence
    Nishimura, Taichi
    Hashimoto, Atsushi
    Ushiku, Yoshitaka
    Kameko, Hirotaka
    Yamakata, Yoko
    Mori, Shinsuke
    IEEE ACCESS, 2021, 9 : 2125 - 2141
  • [6] An Illumination Insensitive and Structure-Aware Image Color Layer Decomposition Method
    Cheng, Wengang
    Dou, Pengli
    Zhou, Dengwen
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 163 - 175
  • [7] STRUCTURE-AWARE GENERATIVE ADVERSARIAL NETWORK FOR TEXT-TO-IMAGE GENERATION
    Chen, Wenjie
    Ni, Zhangkai
    Wang, Hanli
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2075 - 2079
  • [8] Lightweight Texture Correlation Network for Pose Guided Person Image Generation
    Zhang, Pengze
    Yang, Lingxiao
    Xie, Xiaohua
    Lai, Jianhuang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4584 - 4598
  • [9] Image compression with structure-aware inpainting
    Wang, Chen
    Sun, Xiaoyan
    Wu, Feng
    Xiong, Hongkai
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 1816 - 1819
  • [10] Pose Guided Person Image Generation
    Ma, Liqian
    Jia, Xu
    Sun, Qianru
    Schiele, Bernt
    Tuytelaars, Tinne
    Van Gool, Luc
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30