HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引:3
|
作者
Zhou, Peng [1 ]
Xie, Lingxi [2 ]
Ni, Bingbing [1 ]
Liu, Lin [3 ]
Tian, Qi [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China
[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;
D O I
10.1109/TCSVT.2022.3222456
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.
引用
收藏
页码:2147 / 2161
页数:15
相关论文
共 50 条
  • [41] Cross-Domain Interpolation for Unpaired Image-to-Image Translation
    Lopez, Jorge
    Mauricio, Antoni
    Diaz, Jose
    Camara, Guillermo
    COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 542 - 551
  • [42] GP-GAN: Towards Realistic High-Resolution Image Blending
    Wu, Huikai
    Zheng, Shuai
    Zhang, Junge
    Huang, Kaiqi
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2487 - 2495
  • [43] StyleSwin: Transformer-based GAN for High-resolution Image Generation
    Zhang, Bowen
    Gu, Shuyang
    Zhang, Bo
    Bao, Jianmin
    Chen, Dong
    Wen, Fang
    Wang, Yong
    Guo, Baining
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11294 - 11304
  • [44] THEOREM FOR HIGH-RESOLUTION HIGH-CONTRAST IMAGE SYNTHESIS
    BUCKLEW, JA
    SALEH, BEA
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (08): : 1233 - 1236
  • [45] Self-Ensembling GAN for Cross-Domain Semantic Segmentation
    Xu, Yonghao
    He, Fengxiang
    Du, Bo
    Tao, Dacheng
    Zhang, Liangpei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7837 - 7850
  • [46] Cross-Domain Sentiment Classification with Attention-Assisted GAN
    Li, Yi-Fan
    Lin, Yu
    Gao, Yang
    Khan, Latifur
    2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 88 - 95
  • [47] CFFT-GAN: Cross-Domain Feature Fusion Transformer for Exemplar-Based Image Translation
    Ma, Tianxiang
    Li, Bingchuan
    Liu, Wei
    Hua, Miao
    Dong, Jing
    Tan, Tieniu
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1887 - 1895
  • [48] CMRFusion: A cross-domain multi-resolution fusion method for infrared and visible image fusion
    Xiong, Zhang
    Cao, Yuanjia
    Zhang, Xiaohui
    Hu, Qingping
    Han, Hongwei
    OPTICS AND LASERS IN ENGINEERING, 2023, 170
  • [49] High-Resolution Image Synthesis with Latent Diffusion Models
    Rombach, Robin
    Blattmann, Andreas
    Lorenz, Dominik
    Esser, Patrick
    Ommer, Bjoern
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10674 - 10685
  • [50] Cross-domain collaborative learning for single image deraining
    Pan, Zaiyu
    Wang, Jun
    Shen, Zhengwen
    Han, Shuyu
    Zhu, Jihong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211