Adaptive Multi-scale Information Fusion Based on Dynamic Receptive Field for Image-to-image Translation

被引:2
|
作者
Yin Mengxiao [1 ,2 ]
Lin Zhenfeng [1 ]
Yang Feng [1 ,2 ]
机构
[1] Guangxi Univ, Sch Comp & Elect Informat, Nanning 530004, Peoples R China
[2] Guangxi Univ, Guangxi Key Lab Multimedia Commun & Network Techn, Nanning 530004, Peoples R China
基金
中国国家自然科学基金;
关键词
Image translation; Multi-scale information; Dynamic receptive field; Adaptive feature selection;
D O I
10.11999/JEIT200675
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In order to improve the quality of the generated images by the image translation model, the generator in the translation model to obtain high-quality generated images is improved, the diversified image translation is explored and the generation ability of the translation model is expanded. In terms of generator improvement, the dynamic receptive field mechanism of Selective Kernel Block (SKBlock) is used to obtain and fuse the multi-scale information of each up sampling feature in the generator. With the help of multi-scale information of features and dynamic receptive field, the Selective Kernel Generative Adversarial Network (SK-GAN) is constructed. Compared with the traditional generator, SK-GAN improves the quality of the generated image by using dynamic receptive field to obtain multi-scale information. In terms of diversified image translation, the Selective Kernel Generative Adversarial Network with Guide (GSK-GAN) is proposed based on SK-GAN in sketch synthesis realistic image task. GSK-GAN uses the guided image to guide the source image translation and extracts the guide image features through the guided image encoder. Then transmits information of the guided image features to the generator by Parameter Generator (PG) and Feature Transformation (FT). In addition, a dual branch guided image encoder is proposed to improve the editing ability of the translation model. The random style image generation is realized by using the latent variable distribution of the guide image. The experimental results show that the improved generator is helpful to improve the quality of the generated images, and SK-GAN can obtain reasonable results in multiple datasets. GSK-GAN no only ensures the quality of the generated images, but also generates more styles of images
引用
收藏
页码:2386 / 2394
页数:9
相关论文
共 22 条
  • [1] Guided Image-to-Image Translation with Bi-Directional Feature Transformation
    AlBahar, Badour
    Huang, Jia-Bin
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9015 - 9024
  • [2] [Anonymous], 2017, P 31 ADV NEUR INF PR
  • [3] Photographic Image Synthesis with Cascaded Refinement Networks
    Chen, Qifeng
    Koltun, Vladlen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1520 - 1529
  • [4] SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis
    Chen, Wengling
    Hays, James
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9416 - 9425
  • [5] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [6] Goodfellow I., 2020, ADV NEUR IN, V63, P139, DOI [DOI 10.1145/3422622, 10.1145/3422622]
  • [7] Image-to-Image Translation with Conditional Adversarial Networks
    Isola, Phillip
    Zhu, Jun-Yan
    Zhou, Tinghui
    Efros, Alexei A.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5967 - 5976
  • [8] Kingma D. P., 2013, ARXIV13126114
  • [9] Multi-scale Residual Network for Image Super-Resolution
    Li, Juncheng
    Fang, Faming
    Mei, Kangfu
    Zhang, Guixu
    [J]. COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 527 - 542
  • [10] Selective Kernel Networks
    Li, Xiang
    Wang, Wenhai
    Hu, Xiaolin
    Yang, Jian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 510 - 519