Person image generation with attention-based injection network

被引:8
|
作者
Liu, Meichen [1 ]
Wang, Kejun [1 ]
Ji, Ruihang [2 ]
Ge, Shuzhi Sam [3 ]
Chen, Jing [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Coll Control Sci & Engn, Harbin 150001, Peoples R China
[3] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117576, Singapore
基金
中国国家自然科学基金;
关键词
Image generation; Semantic parsing; Attention mechanism; Person re-identification;
D O I
10.1016/j.neucom.2021.06.077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person image generation becomes a challenging problem due to the content ambiguity and style inconsistency. In this paper, we propose a novel Attention-based Injection Network (AIN) to address this issue. Instead of directly learning the relationship between the source and target image, we decompose the process into two accessible modules, namely Semantic-guided Attention Network (SAN) and Pose-guided Attention Network (PAN). SAN is proposed to capture the semantic information which can embed the human attributes into the latent space via the semantic layout. PAN enables a natural re-coupling of the pose and appearance, which can selectively integrate features to complete the human pose transformation. Additionally, a semantic layout loss is proposed to focus on the semantic content similarity between the source and generated images. Compared with other methods, our networks can enforce the local textures and styles consistency between the source and generated image. Experiments show that superior both qualitative and quantitative results are obtained on Market-1501 and DeepFashion datasets. On the basis of AIN, our network can further achieve the data augmentation for person re identification (Re-ID) with dramatically improving the person Re-ID accuracy. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:345 / 359
页数:15
相关论文
共 50 条
  • [21] Hashtag Recommendation with Attention-Based Neural Image Hashtagging Network
    Wu, Gaosheng
    Li, Yuhua
    Yan, Wenjin
    Li, Ruixuan
    Gu, Xiwu
    Yang, Qi
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 52 - 63
  • [22] Baggage Image Retrieval with Attention-Based Network for Security Checks
    Huang, Gan
    Yang, Li
    Zhang, Ding
    Wang, Xiaofeng
    Wang, Yanfu
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (09)
  • [23] ATTENTION-BASED NETWORK FOR LOW-LIGHT IMAGE ENHANCEMENT
    Zhang, Cheng
    Yan, Qingsen
    Zhu, Yu
    Li, Xianjun
    Sun, Jinqiu
    Zhang, Yanning
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [24] A Hierarchical Multimodal Attention-based Neural Network for Image Captioning
    Cheng, Yong
    Huang, Fei
    Zhou, Lian
    Jin, Cheng
    Zhang, Yuejie
    Zhang, Tao
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 889 - 892
  • [25] Attention-based mechanism and feature fusion network for person re-identification
    An, Mingshou
    He, Yunchuan
    Lim, Hye-Youn
    Kang, Dae-Seong
    [J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2024, 20 (01)
  • [26] Attention-based Natural Language Person Retrieval
    Zhou, Tao
    Chen, Muhao
    Yu, Jie
    Terzopoulos, Demetri
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 27 - 34
  • [27] Attention-based Fusion for Multi-source Human Image Generation
    Lathuiliere, Stephane
    Sangineto, Enver
    Siarohin, Aliaksandr
    Sebe, Nicu
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 428 - 437
  • [28] Attention-Based Multistage Fusion Network for Remote Sensing Image Pansharpening
    Zhang, Wanwan
    Li, Jinjiang
    Hua, Zhen
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [29] GridDehazeNet: Attention-Based Multi-Scale Network for Image Dehazing
    Liu, Xiaohong
    Ma, Yongrui
    Shi, Zhihao
    Chen, Jun
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7313 - 7322
  • [30] ADRN: ATTENTION-BASED DEEP RESIDUAL NETWORK FOR HYPERSPECTRAL IMAGE DENOISING
    Zhao, Yongsen
    Zhai, Deming
    Jiang, Junjun
    Liu, Xianming
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2668 - 2672