HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引:3
|
作者
Zhou, Peng [1 ]
Xie, Lingxi [2 ]
Ni, Bingbing [1 ]
Liu, Lin [3 ]
Tian, Qi [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China
[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;
D O I
10.1109/TCSVT.2022.3222456
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.
引用
收藏
页码:2147 / 2161
页数:15
相关论文
共 50 条
  • [1] Cross-Domain Face Synthesis using a Controllable GAN
    Mokhayeri, Fania
    Kamali, Kaveh
    Granger, Eric
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 241 - 249
  • [2] SEMANTIC GAN: APPLICATION FOR CROSS-DOMAIN IMAGE STYLE TRANSFER
    Li, Pengfei
    Yang, Meng
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 910 - 915
  • [3] DHNet: High-resolution and hierarchical network for cross-domain OCT speckle noise reduction
    Zhou, Yi
    Li, Jiang
    Wang, Meng
    Peng, Yuanyuan
    Chen, Zhongyue
    Zhu, Weifang
    Shi, Fei
    Wang, Lianyu
    Wang, Tingting
    Yao, Chenpu
    Chen, Xinjian
    MEDICAL PHYSICS, 2022, 49 (09) : 5914 - 5928
  • [4] Deep Adversarial Domain Adaptation Method for Cross-Domain Classification in High-Resolution Remote Sensing Images
    Teng Wenxiu
    Wang Ni
    Chen Taisheng
    Wang Benlin
    Chen Menglin
    Shi Huihui
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (11)
  • [5] DiCyc: GAN-based deformation invariant cross-domain information fusion for medical image synthesis
    Wang, Chengjia
    Yang, Guang
    Papanastasiou, Giorgos
    Tsaftaris, Sotirios A.
    Newby, David E.
    Gray, Calum
    Macnaught, Gillian
    MacGillivray, Tom J.
    INFORMATION FUSION, 2021, 67 : 147 - 160
  • [6] CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis
    Zhang, Yubo
    Han, Shuang
    Zhang, Zhongxin
    Wang, Jianyang
    Bi, Hongbo
    VISUAL COMPUTER, 2023, 39 (04): : 1283 - 1293
  • [7] CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis
    Yubo Zhang
    Shuang Han
    Zhongxin Zhang
    Jianyang Wang
    Hongbo Bi
    The Visual Computer, 2023, 39 : 1283 - 1293
  • [8] Joint Distribution Adaptive-Alignment for Cross-Domain Segmentation of High-Resolution Remote Sensing Images
    Huang, Haitao
    Li, Baopu
    Zhang, Yuchen
    Chen, Tao
    Wang, Bin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [9] Cross-domain Web Image Annotation
    Si, Si
    Tao, Dacheng
    Chan, Kwok-Ping
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 184 - +
  • [10] Cross-domain fashion image retrieval
    Gajic, Bojana
    Baldrich, Ramon
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1950 - 1952