Face sketch-to-photo transformation with multi-scale self-attention GAN

被引:12
|
作者
Lei, Yingtao [1 ]
Du, Weiwei [2 ]
Hu, Qinghua [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] Kyoto Inst Technol, Informat & Human Sci, Kyoto 6068585, Japan
基金
中国国家自然科学基金;
关键词
Image transformation; Sketch-to-photo; Divide and conquer; Multi-scale; Attention mechanism; Generative adversarial network;
D O I
10.1016/j.neucom.2020.02.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we investigate the sketch-to-photo problem, which currently poses a significant challenge in the field of computer vision. A large number of GAN-based encoder-decoder methods have been proposed for image transformation, inspired by the pix2pix model; however, these methods do not produce satisfactory results for photo generation, due to the fact that (1) they miss detailed information of input images because of a single-scale convolution operator in the shallow encoder layers, and (2) they fail to learn long-range dependencies in the deep encoder layers. To better handle these challenges, we present an approach that follows a "divide and conquer" strategy. Our method combines the advantages of a multi-scale convolutional neural network and an attention mechanism and applies these two modules to different encoder layers. Additionally, by optimizing a well-designed loss function, the complex correlations between the sketch and the photo can be calculated. Experimental results show that our method is able to generate high-quality photos from sketch images, and qualitative and quantitative analysis demonstrates its effectiveness and superiority over state-of-the-art models. This work paves a path to replace the traditional encoder structure with the "divide and conquer" strategy to handle image transformation tasks. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:13 / 23
页数:11
相关论文
共 50 条
  • [1] Multi-Scale Gradients Self-Attention Residual Learning for Face Photo-Sketch Transformation
    Duan, Shuchao
    Chen, Zhenxue
    Wu, Q. M. Jonathan
    Cai, Lei
    Lu, Dan
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 1218 - 1230
  • [2] Child Face Age Progression and Regression using Self-Attention Multi-Scale Patch GAN
    Chandaliya, Praveen Kumar
    Nain, Neeta
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [3] Forensic sketch-to-photo transformation with improved Generative Adversarial Network (GAN)
    Khatoon, Shadma
    Umar, Mohammad Sarosh
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2022,
  • [4] FRAN: feature-filtered residual attention network for realistic face sketch-to-photo transformation
    Wan, Weiguo
    Yang, Yong
    Huang, Shuying
    Gan, Lixin
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 15946 - 15956
  • [5] FRAN: feature-filtered residual attention network for realistic face sketch-to-photo transformation
    Weiguo Wan
    Yong Yang
    Shuying Huang
    Lixin Gan
    [J]. Applied Intelligence, 2023, 53 : 15946 - 15956
  • [6] Multi-Scale Self-Attention for Text Classification
    Guo, Qipeng
    Qiu, Xipeng
    Liu, Pengfei
    Xue, Xiangyang
    Zhang, Zheng
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7847 - 7854
  • [7] Multi-scale self-attention mixup for graph classification *
    Kong, Youyong
    Li, Jiaxing
    Zhang, Ke
    Wu, Jiasong
    [J]. PATTERN RECOGNITION LETTERS, 2023, 168 : 100 - 106
  • [8] Multi-Scale Self-Attention Network for Denoising Medical Images
    Lee, Kyungsu
    Lee, Haeyun
    Lee, Moon Hwan
    Chang, Jin Ho
    Kuo, C. -C. Jay
    Oh, Seung-June
    Woo, Jonghye
    Hwang, Jae Youn
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (05) : 1 - 26
  • [9] An Efficient Transformer Based on Global and Local Self-Attention for Face Photo-Sketch Synthesis
    Yu, Wangbo
    Zhu, Mingrui
    Wang, Nannan
    Wang, Xiaoyu
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 483 - 495
  • [10] A MULTI-SCALE SELF-ATTENTION NETWORK TO DISCRIMINATE PULMONARY NODULES
    Moreno, Alejandra
    Rueda, Andrea
    Martinez, Fabio
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,