Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation

被引:0
|
作者
Tang, Jilin [1 ]
Yuan, Yi [1 ]
Shao, Tianjia [2 ]
Liu, Yong [3 ]
Wang, Mengmeng [3 ]
Zhou, Kun [2 ]
机构
[1] NetEase Fuxi AI Lab, Beijing, Peoples R China
[2] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Peoples R China
[3] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou, Peoples R China
来源
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we tackle the problem of pose guided person image generation, which aims to transfer a person image from the source pose to a novel target pose while maintaining the source appearance. Given the inefficiency of standard CNNs in handling large spatial transformation, we propose a structure-aware flow based method for high-quality person image generation. Specifically, instead of learning the complex overall pose changes of human body, we decompose the human body into different semantic parts (e.g., head, torso, and legs) and apply different networks to predict the flow fields for these parts separately. Moreover, we carefully design the network modules to effectively capture the local and global semantic correlations of features within and among the human parts respectively. Extensive experimental results show that our method can generate high-quality results under large pose discrepancy and outperforms state-of-the-art methods in both qualitative and quantitative comparisons.
引用
收藏
页码:2656 / 2664
页数:9
相关论文
共 50 条
  • [41] Loss functions for pose guided person image generation
    Shi, Haoyue
    Le Wang
    Zheng, Nanning
    Hua, Gang
    Tang, Wei
    PATTERN RECOGNITION, 2022, 122
  • [42] A Structure-Aware Adversarial Framework with the Keypoint Biorientation Field for Multiperson Pose Estimation
    Meng, Xianjia
    Yang, Yong
    Li, Kang
    Ying, Zuobin
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [43] Pose Guided Person Image Generation Via Dual-Task Correlation and Affinity Learning
    Zhang, Pengze
    Yang, Lingxiao
    Xie, Xiaohua
    Lai, Jianhuang
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (08) : 5111 - 5128
  • [44] An Articulated Structure-aware Network for 3D Human Pose Estimation
    Tang, Zhenhua
    Zhang, Xiaoyan
    Hou, Junhui
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 48 - 63
  • [45] Structure-Aware Dialogue Modeling Methods for Conversational Semantic Role Labeling
    Wu, Han
    Xu, Kun
    Song, Linqi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 742 - 752
  • [46] GATE FUNCTION BASED STRUCTURE-AWARE CONVOLUTION FOR SCENE SEMANTIC SEGMENTATION
    Cheng, Zhou
    Li, Jiancheng
    Yuan, Chun
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 253 - 258
  • [47] Structure-aware enhancement of imaging mass spectrometry data for semantic segmentation
    Liang, Luming
    Zhang, Zhi-min
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 171 : 259 - 265
  • [48] Structure-Aware Dialogue Modeling Methods for Conversational Semantic Role Labeling
    Wu, Han
    Xu, Kun
    Song, Linqi
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2024, 32 : 742 - 752
  • [49] A Tensor-based Technique for Structure-aware Image Inpainting
    Akl, Adib
    Yaacoub, Charles
    ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 599 - 605
  • [50] Learning Part Generation and Assembly for Structure-Aware Shape Synthesis
    Li, Jun
    Niu, Chengjie
    Xu, Kai
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11362 - 11369