Retrieval-based Spatially Adaptive Normalization for Semantic Image Synthesis

被引:13
|
作者
Shi, Yupeng [1 ]
Liu, Xiao [1 ]
Wei, Yuxiang [2 ]
Wu, Zhongqin [1 ]
Zuo, Wangmeng [2 ,3 ]
机构
[1] Tomorrow Adv Life, Beijing, Peoples R China
[2] Harbin Inst Technol, Harbin, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/CVPR52688.2022.01094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic image synthesis is a challenging task with many practical applications. Albeit remarkable progress has been made in semantic image synthesis with spatially-adaptive normalization, existing methods usually normalize the feature activations under the coarse-level guidance (e.g., semantic class). However, different parts of a semantic object (e.g., wheel and window of car) are quite different in structures and textures, making blurry synthesis results usually inevitable due to the missing of fine-grained guidance. In this paper, we propose a novel normalization module, termed as REtrieval-based Spatially Adaptive normaLization (RESAIL), for introducing pixel level fine-grained guidance to the normalization architecture. Specifically, we first present a retrieval paradigm by finding a content patch of the same semantic class from training set with the most similar shape to each test semantic mask. Then, the retrieved patches are composited into retrieval-based guidance, which can be used by RESAIL for pixel level fine-grained modulation on feature activations, thereby greatly mitigating blurry synthesis results. Moreover, distorted ground-truth images are also utilized as alternatives of retrieval-based guidance for feature normalization, further benefiting model training and improving visual quality of generated images. Experiments on several challenging datasets show that our RESAIL performs favorably against state-of-the-arts in terms of quantitative metrics, visual quality, and subjective evaluation.
引用
收藏
页码:11214 / 11223
页数:10
相关论文
共 50 条
  • [1] Semantic Image Synthesis with Spatially-Adaptive Normalization
    Park, Taesung
    Liu, Ming-Yu
    Wang, Ting-Chun
    Zhu, Jun-Yan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2332 - 2341
  • [2] SEAN: Image Synthesis with Semantic Region-Adaptive Normalization
    Zhu, Peihao
    Abdal, Rameen
    Qin, Yipeng
    Wonka, Peter
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5103 - 5112
  • [3] Efficient Semantic Image Synthesis via Class-Adaptive Normalization
    Tan, Zhentao
    Chen, Dongdong
    Chu, Qi
    Chai, Menglei
    Liao, Jing
    He, Mingming
    Yuan, Lu
    Hua, Gang
    Yu, Nenghai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4852 - 4866
  • [4] Content Based Image Retrieval using Adaptive Semantic Signature
    Jena, Pradeep Kumar
    Khuntia, Bonomali
    Palai, Charulata
    Pattanaik, Satya R.
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [5] Generative Adversarial Networks with Adaptive Semantic Normalization for text-to-image synthesis
    Huang, Siyue
    Chen, Ying
    DIGITAL SIGNAL PROCESSING, 2022, 120
  • [6] Image Retrieval-Based Localization Under Seasonal Changes
    Zhu, Hao
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 142 - 148
  • [7] Night-to-Day Image Translation for Retrieval-based Localization
    Anoosheh, Asha
    Sattler, Torsten
    Timofte, Radu
    Pollefeys, Marc
    Van Gool, Luc
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5958 - 5964
  • [8] Incorporating retrieval-based method for feature enhanced image captioning
    Shanshan Zhao
    Lixiang Li
    Haipeng Peng
    Applied Intelligence, 2023, 53 : 9731 - 9743
  • [9] Image retrieval-based decision support system for dermatoscopic images
    Rahman, Md. Mahmudur
    Desai, Bipin C.
    Bhattacharya, Prabir
    19TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2006, : 285 - +
  • [10] Multispectral Domain Invariant Image for Retrieval-based Place Recognition
    Han, Daechan
    Hwang, YuJin
    Kim, Namil
    Choi, Yukyung
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 9271 - 9277