Dual Attention GANs for Semantic Image Synthesis

被引:34
|
作者
Tang, Hao [1 ]
Bai, Song [2 ]
Sebe, Nicu [1 ,3 ]
机构
[1] Univ Trento, DISI, Trento, Italy
[2] Univ Oxford, Dept Engn Sci, Oxford, England
[3] Huawei Res Ireland, Dublin, Ireland
关键词
Generative Adversarial Networks (GANs); Semantic Image Synthesis; Spatial Attention; Channel Attention;
D O I
10.1145/3394171.3416270
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the semantic image synthesis task that aims at transferring semantic label maps to photo-realistic images. Existing methods lack effective semantic constraints to preserve the semantic information and ignore the structural correlations in both spatial and channel dimensions, leading to unsatisfactory blurry and artifact-prone results. To address these limitations, we propose a novel Dual Attention GAN (DAGAN) to synthesize photo-realistic and semantically-consistent images with fine details from the input layouts without imposing extra training overhead or modifying the network architectures of existing methods. We also propose two novel modules, i.e., position-wise Spatial Attention Module (SAM) and scale-wise Channel Attention Module (CAM), to capture semantic structure attention in spatial and channel dimensions, respectively. Specifically, SAM selectively correlates the pixels at each position by a spatial attention map, leading to pixels with the same semantic label being related to each other regardless of their spatial distances. Meanwhile, CAM selectively emphasizes the scalewise features at each channel by a channel attention map, which integrates associated features among all channel maps regardless of their scales. We finally sum the outputs of SAM and CAM to further improve feature representation. Extensive experiments on four challenging datasets show that DAGAN achieves remarkably better results than state-of-the-art methods, while using fewer model parameters. The source code and trained models are available at https://github.com/Ha0Tang/DAGAN.
引用
收藏
页码:1994 / 2002
页数:9
相关论文
共 50 条
  • [1] Dual image and mask synthesis with GANs for semantic segmentation in optical coherence tomography
    Kugelman, Jason
    Alonso-Caneiro, David
    Read, Scott A.
    Vincent, Stephen J.
    Chen, Fred K.
    Collins, Michael J.
    [J]. 2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
  • [2] Dual conditional GAN based on external attention for semantic image synthesis
    Liu, Gang
    Zhou, Qijun
    Xie, Xiaoxiao
    Yu, Qingchen
    [J]. CONNECTION SCIENCE, 2023, 35 (01)
  • [3] Collaging Class-specific GANs for Semantic Image Synthesis
    Li, Yuheng
    Li, Yijun
    Lu, Jingwan
    Shechtman, Eli
    Lee, Yong Jae
    Singh, Krishna Kumar
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14398 - 14407
  • [4] Dual Contrastive Loss and Attention for GANs
    Yu, Ning
    Liu, Guilin
    Dundar, Aysegul
    Tao, Andrew
    Catanzaro, Bryan
    Davis, Larry
    Fritz, Mario
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6711 - 6722
  • [5] High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
    Wang, Ting-Chun
    Liu, Ming-Yu
    Zhu, Jun-Yan
    Tao, Andrew
    Kautz, Jan
    Catanzaro, Bryan
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8798 - 8807
  • [6] Cross-view panorama image synthesis with progressive attention GANs
    Wu, Songsong
    Tang, Hao
    Jing, Xiao-Yuan
    Qian, Jianjun
    Sebe, Nicu
    Yan, Yan
    Zhang, Qinghua
    [J]. PATTERN RECOGNITION, 2022, 131
  • [7] GANs for Biological Image Synthesis
    Osokin, Anton
    Chessel, Anatole
    Salas, Rafael E. Carazo
    Vaggi, Federico
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2252 - 2261
  • [8] Hiding image into image with hybrid attention mechanism based on GANs
    Zhu, Yuling
    Dong, Yunyun
    Song, Bingbing
    Yao, Shaowen
    [J]. IET IMAGE PROCESSING, 2024, 18 (10) : 2679 - 2689
  • [9] Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis
    Tang, Hao
    Sun, Guolei
    Sebe, Nicu
    Van Gool, Luc
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14435 - 14452
  • [10] Dual Semantic Relationship Attention Network for Image-Text Matching
    Wen, Keyu
    Gu, Xiaodong
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,