Swin-CasUNet: Cascaded U-Net with Swin Transformer for Masked Face Restoration

被引:1
|
作者
Zeng, Chengbin [1 ]
Liu, Yi [1 ]
Song, Chunli [1 ]
机构
[1] Guizhou Inst Technol, Sch Big Data, Guiyang 550003, Guizhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICPR56361.2022.9956183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Masked face restoration is one of the most valuable challenges in the computer vision community. With the in-depth study of u-shaped architectures, also known as U-Net, great progress has been achieved in the development of masked face restoration during the past few years. However, previous restoration methods fail to fully model the long-range dependency due to the locality of convolution layers of the U-Net. To address this problem, we propose a shifted windows Transformer (Swin Transformer) based cascaded U-Net framework called Swin-CasUNet, which incorporates the long-range dependency merit of Transformer into the cascaded U-Net architecture to effectively enhance the functionality and generalization of Ushaped architecture. Specifically, we design a two-stage cascaded U-Net architecture to implement the coarse-to-fine restoration of the masked face. Swin Transformers is adopted to extract global self-attention contexts for the feature map produced by the encoder part of the U-Net. An improved face structure loss is proposed to supervise structure learning. To evaluate the robustness of our masked face restoration model, we collect 3800 pairs of full face images and corresponding masked face images from the real-world and web. Experiments on the datasets demonstrate that our proposed method can generate high quality restoration results. In order to quantitatively compare with previous face restoration methods, we modify the input of our system by manually adding regular and irregular white masks on CelebA face datasets, and then retrain our network. Experiments show that our Swin-CasUNet outperforms previous methods on benchmark datasets.
引用
收藏
页码:386 / 392
页数:7
相关论文
共 50 条
  • [1] PARSE CHALLENGE 2022: PULMONARY ARTERIES SEGMENTATION USING SWIN U-NET TRANSFORMER(SWIN UNETR) AND U-NET
    Padhy, Rohan
    Maurya, Akansh
    Patil, Kunal Dasharath
    Ramakrishna, Kalluri
    Krishnamurthi, Ganapathy
    [J]. 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [2] Cascaded transformer U-net for image restoration
    Yan, Longbin
    Zhao, Min
    Liu, Shumin
    Shi, Shuaikai
    Chen, Jie
    [J]. SIGNAL PROCESSING, 2023, 206
  • [3] Swin Deformable Attention U-Net Transformer (SDAUT) for Explainable Fast MRI
    Huang, Jiahao
    Xing, Xiaodan
    Gao, Zhifan
    Yang, Guang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI, 2022, 13436 : 538 - 548
  • [4] Adaptive enhanced swin transformer with U-net for remote sensing image segmentation*
    Gu, Xingjian
    Li, Sizhe
    Ren, Shougang
    Zheng, Hengbiao
    Fan, Chengcheng
    Xu, Huanliang
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [5] Crack _PSTU: Crack detection based on the U-Net framework combined with Swin Transformer
    Lu, Weizhong
    Qian, Meiling
    Xia, Yiyi
    Lu, Yiming
    Shen, Jiyun
    Fu, Qiming
    Lu, You
    [J]. STRUCTURES, 2024, 62
  • [6] DS-TransUNet: Dual Swin Transformer U-Net for Medical Image Segmentation
    Lin, Ailiang
    Chen, Bingzhi
    Xu, Jiayu
    Zhang, Zheng
    Lu, Guangming
    Zhang, David
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [7] SwinVI:3D Swin Transformer Model with U-net for Video Inpainting
    Zhang, Wei
    Cao, Yang
    Zhai, Junhai
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] MeltPondNet: A Swin Transformer U-Net for Detection of Melt Ponds on Arctic Sea Ice
    Sudakow, Ivan
    Asari, Vijayan K.
    Liu, Ruixu
    Demchev, Denis
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8776 - 8784
  • [9] CST-UNet: Cross Swin Transformer Enhanced U-Net with Masked Bottleneck for Single-Channel Speech Enhancement
    Zhang, Zipeng
    Chen, Wei
    Guo, Weiwei
    Liu, Yiming
    Yang, Jianhua
    Liu, Houguang
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (09) : 5989 - 6010
  • [10] STU3Net: An Improved U-Net With Swin Transformer Fusion for Thyroid Nodule Segmentation
    Deng, Xiangyu
    Dang, Zhiyan
    Pan, Lihao
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (05)