Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis

被引:2
|
作者
Tang, Hao [1 ]
Sun, Guolei [1 ]
Sebe, Nicu [2 ]
Van Gool, Luc [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland
[2] Univ Trento, Dept Informat Engn & Comp Sci DISI, I-38123 Trento, Italy
基金
欧盟地平线“2020”;
关键词
Contrastive learning; edge guided; GANs; multi-scale; semantic image synthesis; TRANSLATION;
D O I
10.1109/TPAMI.2023.3298721
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel edge guided generative adversarial network with contrastive learning (ECGAN) for the challenging semantic image synthesis task. Although considerable improvements have been achieved by the community in the recent period, the quality of synthesized images is far from satisfactory due to three largely unresolved challenges. 1) The semantic labels do not provide detailed structural information, making it challenging to synthesize local details and structures; 2) The widely adopted CNN operations such as convolution, down-sampling, and normalization usually cause spatial resolution loss and thus cannot fully preserve the original semantic information, leading to semantically inconsistent results (e.g., missing small objects); 3) Existing semantic image synthesis methods focus on modeling "local" semantic information from a single input semantic layout. However, they ignore "global" semantic information of multiple input semantic layouts, i.e., semantic cross-relations between pixels across different input layouts. To tackle 1), we propose to use the edge as an intermediate representation which is further adopted to guide image generation via a proposed attention guided edge transfer module. Edge information is produced by a convolutional generator and introduces detailed structure information. To tackle 2), we design an effective module to selectively highlight class-dependent feature maps according to the original semantic layout to preserve the semantic information. To tackle 3), inspired by current methods in contrastive learning, we propose a novel contrastive learning method, which aims to enforce pixel embeddings belonging to the same semantic class to generate more similar image content than those from different classes. We further propose a novel multi-scale contrastive learning method that aims to push same-class features from different scales closer together being able to capture more semantic relations by explicitly exploring the structures of labeled pixels from multiple input semantic layouts from different scales. Experiments on three challenging datasets show that our methods achieve significantly better results than state-of-the-art approaches.
引用
收藏
页码:14435 / 14452
页数:18
相关论文
共 50 条
  • [1] Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation
    Pissas, Theodoros
    Ravasio, Claudio S.
    Da Cruz, Lyndon
    Bergeles, Christos
    [J]. COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 413 - 429
  • [2] Multi-scale Contrastive Learning with Attention for Histopathology Image Classification
    Tan, Jing Wei
    Khoa Tuan Nguyen
    Lee, Kyoungbun
    Jeong, Won-Ki
    [J]. MEDICAL IMAGING 2023, 2023, 12471
  • [3] Multi-scale contrastive learning method for PolSAR image classification
    Hua, Wenqiang
    Wang, Chen
    Sun, Nan
    Liu, Lin
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (01)
  • [4] Multi-Scale Subgraph Contrastive Learning
    Liu, Yanbei
    Zhao, Yu
    Wang, Xiao
    Geng, Lei
    Xiao, Zhitao
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2215 - 2223
  • [5] Image manipulation detection and localization using multi-scale contrastive learning
    Bai, Ruyi
    [J]. APPLIED SOFT COMPUTING, 2024, 163
  • [6] Multi-scale semantic image inpainting with residual learning and GAN
    Jiao, Libin
    Wu, Hao
    Wang, Haodi
    Bie, Rongfang
    [J]. NEUROCOMPUTING, 2019, 331 : 199 - 212
  • [7] Multi-scale multi-instance contrastive learning for whole slide image classification
    Zhang, Jianan
    Hao, Fang
    Liu, Xueyu
    Yao, Shupei
    Wu, Yongfei
    Li, Ming
    Zheng, Wen
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [8] Multi-scale Contrastive Learning for Gastroenteroscopy Classification
    Li, Dan
    Li, Xuechen
    Peng, Zhibin
    Chen, Wenting
    Shen, Linlin
    Wu, Guangyao
    [J]. 2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 852 - +
  • [9] Deep image clustering with contrastive learning and multi-scale graph convolutional networks
    Xu, Yuankun
    Huang, Dong
    Wang, Chang-Dong
    Lai, Jian-Huang
    [J]. PATTERN RECOGNITION, 2024, 146
  • [10] Edge-Guided Image Gap Interpolation Using Multi-Scale Transformation
    Langari, Bahareh
    Vaseghi, Saeed
    Prochazka, Ales
    Vaziri, Babak
    Aria, Farzad Tahmasebi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (09) : 4394 - 4405