Face sketch-to-photo transformation with multi-scale self-attention GAN

被引:12
|
作者
Lei, Yingtao [1 ]
Du, Weiwei [2 ]
Hu, Qinghua [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] Kyoto Inst Technol, Informat & Human Sci, Kyoto 6068585, Japan
基金
中国国家自然科学基金;
关键词
Image transformation; Sketch-to-photo; Divide and conquer; Multi-scale; Attention mechanism; Generative adversarial network;
D O I
10.1016/j.neucom.2020.02.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we investigate the sketch-to-photo problem, which currently poses a significant challenge in the field of computer vision. A large number of GAN-based encoder-decoder methods have been proposed for image transformation, inspired by the pix2pix model; however, these methods do not produce satisfactory results for photo generation, due to the fact that (1) they miss detailed information of input images because of a single-scale convolution operator in the shallow encoder layers, and (2) they fail to learn long-range dependencies in the deep encoder layers. To better handle these challenges, we present an approach that follows a "divide and conquer" strategy. Our method combines the advantages of a multi-scale convolutional neural network and an attention mechanism and applies these two modules to different encoder layers. Additionally, by optimizing a well-designed loss function, the complex correlations between the sketch and the photo can be calculated. Experimental results show that our method is able to generate high-quality photos from sketch images, and qualitative and quantitative analysis demonstrates its effectiveness and superiority over state-of-the-art models. This work paves a path to replace the traditional encoder structure with the "divide and conquer" strategy to handle image transformation tasks. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:13 / 23
页数:11
相关论文
共 50 条
  • [21] DEEPCHORUS: A HYBRID MODEL OF MULTI-SCALE CONVOLUTION AND SELF-ATTENTION FOR CHORUS DETECTION
    He, Qiqi
    Sun, Xiaoheng
    Yu, Yi
    Li, Wei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 411 - 415
  • [22] Multi-Scale Aggregation with Self-Attention Network for Modeling Electrical Motor Dynamics
    Huang, Kuan-Chih
    Yang, Hao-Hsiang
    Chen, Wei-Ting
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 7097 - 7103
  • [23] Multi-scale self-attention generative adversarial network for pathology image restoration
    Meiyan Liang
    Qiannan Zhang
    Guogang Wang
    Na Xu
    Lin Wang
    Haishun Liu
    Cunlin Zhang
    [J]. The Visual Computer, 2023, 39 : 4305 - 4321
  • [24] Clothing Parsing Based on Multi-Scale Fusion and Improved Self-Attention Mechanism
    陈诺
    王绍宇
    陆然
    李文萱
    覃志东
    石秀金
    [J]. Journal of Donghua University(English Edition), 2023, 40 (06) : 661 - 666
  • [25] A fuzzy rule based multimodal framework for face sketch-to-photo retrieval
    Khan, Mohd Aamir
    Jalal, Anand Singh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 134 : 138 - 152
  • [26] Unsupervised self-attention lightweight photo-to-sketch synthesis with feature maps
    Zhong, Kunru
    Chen, Zhenxue
    Liu, Chengyun
    Wu, Q. M. Jonathan
    Duan, Shuchao
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [27] Recognizing Food Places in Egocentric Photo-Streams Using Multi-Scale Atrous Convolutional Networks and Self-Attention Mechanism
    Sarker, Md Mostafa Kamal
    Rashwan, Hatem A.
    Akram, Farhan
    Talavera, Estefania
    Banu, Syeda Furruka
    Radeva, Petia
    Puig, Domenec
    [J]. IEEE ACCESS, 2019, 7 : 39069 - 39082
  • [28] Footprint Pressure Image Retrieval Algorithm Based on Multi-scale Self-attention Convolution
    Zhu M.
    Wang T.
    Wang N.
    Tang J.
    Lu X.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (12): : 1097 - 1103
  • [29] A Serial-Parallel Self-Attention Network Joint With Multi-Scale Dilated Convolution
    Gaihua, Wang
    Tianlun, Zhang
    Yingying, Dai
    Jinheng, Lin
    Lei, Cheng
    [J]. IEEE ACCESS, 2021, 9 : 71909 - 71919
  • [30] A self-attention multi-scale convolutional neural network method for SAR image despeckling
    Wen, Zhiqing
    He, Yi
    Yao, Sheng
    Yang, Wang
    Zhang, Lifeng
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (03) : 902 - 923