Deep Learning for Image/Video Restoration and Super-resolution

被引:0
|
作者
Tekalp, A. Murat [1 ]
机构
[1] Koc Univ, Dept Elect & Elect Engn, Istanbul, Turkey
关键词
IMAGE QUALITY ASSESSMENT; VIDEO SUPERRESOLUTION; FIDELITY-CRITERION; INFORMATION; NETWORK; BACKPROPAGATION; MODEL;
D O I
10.1561/0600000100
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recent advances in neural signal processing led to significant improvements in the performance of learned image/video restoration and super-resolution (SR). An important benefit of data-driven deep learning approaches to image processing is that neural models can be optimized for any differentiable loss function, including perceptual loss functions, leading to perceptual image/video restoration and SR, which cannot be easily handled by traditional model-based methods. We start with a brief problem statement and a short discussion on traditional vs. data-driven solutions. We next review recent advances in neural architectures, such as residual blocks, dense connections, residual-in-residual dense blocks, residual blocks with generative neurons, self-attention and visual transformers. We then discuss loss functions and evaluation (assessment) criteria for image/video restoration and SR, including fidelity (distortion) and perceptual criteria, and the relation between them, where we briefly review the perception vs. distortion trade-off. We can consider learned image/video restoration and SR as learning either a nonlinear regressive mapping from degraded to ideal images based on the universal approximation theorem, or a generative model that captures the probability distribution of ideal images. We first review regressive inference via residual and/or dense convolutional networks (ConvNet). We also show that using a new architecture with residual blocks based on a generative neuron model can outperform classical residual ConvNets in peak-signal-to-noise ratio (PSNR). We next discuss generative inference based on adversarial training, such as SRGAN and ESRGAN, which can reproduce realistic textures, or based on normalizing flow such as SRFlow by optimizing log-likelihood. We then discuss problems in applying supervised training to real-life restoration and SR, including overfitting image priors and overfitting the degradation model seen in the training set. We introduce multiple-model SR and real-world SR (from unpaired training data) formulations to overcome these problems. Integration of traditional model-based methods and deep learning for non-blind restoration/SR is introduced as another solution to model overfitting in supervised learning. In learned video restoration and SR (VSR), we first discuss how to best exploit temporal correlations in video, including sliding temporal window vs. recurrent architectures for propagation, and aligning frames in the pixel domain using optical flow vs. in the feature space using deformable convolutions. We next introduce early fusion with feature-space alignment, employed by the EDVR network, which obtains excellent PSNR performance. However, it is well-known that videos with the highest PSNR may not be the most appealing to humans, since minimizing the mean-square error may result in blurring of details. We then address perceptual optimization of VSR models to obtain natural texture and motion. Although perception-distortion tradeoff has been well studied for images, few works address perceptual VSR. In addition to using perceptual losses, such as MS-SSIM, LPIPS, and/or adversarial training, we also discuss explicit loss functions/criteria to enforce and evaluate temporal consistency. We conclude with a discussion of open problems.
引用
收藏
页码:1 / 110
页数:110
相关论文
共 50 条
  • [1] Review on Deep Learning Based Image Super-resolution Restoration Algorithms
    Sun X.
    Li X.-G.
    Li J.-F.
    Zhuo L.
    [J]. Li, Xiao-Guang (lxg@bjut.edu.cn), 1600, Science Press (43): : 697 - 709
  • [2] Deep learning for image super-resolution
    Yang, Wenming
    Zhou, Fei
    Zhu, Rui
    Fukui, Kazuhiro
    Wang, Guijin
    Xue, Jing-Hao
    [J]. NEUROCOMPUTING, 2020, 398 (398) : 291 - 292
  • [3] A Survey of Deep Learning Video Super-Resolution
    Baniya, Arbind Agrahari
    Lee, Tsz-Kwan
    Eklund, Peter W.
    Aryal, Sunil
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2655 - 2676
  • [4] Effect of Training and Test Datasets on Image Restoration and Super-Resolution by Deep Learning
    Kirmemis, Ogun
    Tekalp, A. Murat
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 514 - 518
  • [5] Deep Learning for Image Super-Resolution: A Survey
    Wang, Zhihao
    Chen, Jian
    Hoi, Steven C. H.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3365 - 3387
  • [6] Advanced deep learning for image super-resolution
    Shamsolmoali, Pourya
    Sadka, Abdul Hamid
    Zhou, Huiyu
    Yang, Wankou
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 82
  • [7] Omnidirectional Video Super-Resolution Using Deep Learning
    Baniya, Arbind Agrahari
    Lee, Tsz-Kwan
    Eklund, Peter W.
    Aryal, Sunil
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 540 - 554
  • [8] Micro-displacement super-resolution based on video image restoration
    Han, Mei
    Wang, Minghui
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS, 2007, : 51 - +
  • [9] Deep Learning for Remote Sensing Image Super-Resolution
    Ul Hoque, Md Reshad
    Burks, Roland, III
    Kwan, Chiman
    Li, Jiang
    [J]. 2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 286 - 292
  • [10] Deep Learning for Multiple-Image Super-Resolution
    Kawulok, Michal
    Benecki, Pawel
    Piechaczek, Szymon
    Hrynczenko, Krzysztof
    Kostrzewa, Daniel
    Nalepa, Jakub
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (06) : 1062 - 1066