Efficient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion

被引:29
|
作者
Maggioni, Matteo [1 ]
Huang, Yibin [1 ]
Li, Cheng [1 ]
Xiao, Shuai [1 ]
Fu, Zhongqian [1 ]
Song, Fenglong [1 ]
机构
[1] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR46437.2021.00347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, denoising methods based on deep learning have achieved unparalleled performance at the cost of large computational complexity. In this work, we propose an Efficient Multi-stage Video Denoising algorithm, called EMVD, to drastically reduce the complexity while maintaining or even improving the performance. First, a fusion stage reduces the noise through a recursive combination of all past frames in the video. Then, a denoising stage removes the noise in the fused frame. Finally, a refinement stage restores the missing high frequency in the denoised frame. All stages operate on a transform-domain representation obtained by learnable and invertible linear operators which simultaneously increase accuracy and decrease complexity of the model. A single loss on the final output is sufficient for successful convergence, hence making EMVD easy to train. Experiments on real raw data demonstrate that EMVD outperforms the state of the art when complexity is constrained, and even remains competitive against methods whose complexities are several orders of magnitude higher. Further, the low complexity and memory requirements of EMVD enable real-time video denoising on commercial SoC in mobile devices.
引用
收藏
页码:3465 / 3474
页数:10
相关论文
共 50 条
  • [31] Adaptive and Recursive Based Spatio-Temporal Filtering for Video Denoising with RWT Transformation
    Shylaja, S. L.
    Kohir, Vinayadatta V.
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [32] Spatio-temporal Sampling for Video
    Shankar, Mohan
    Pitsiauis, Nikos P.
    Brady, David
    [J]. IMAGE RECONSTRUCTION FROM INCOMPLETE DATA V, 2008, 7076
  • [33] Efficient spatio-temporal decomposition for perceptual processing of video sequences
    Lindh, P
    Lambrecht, CJVB
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL III, 1996, : 331 - 334
  • [34] Spatio-Temporal feature based VLAD for efficient Video retrieval
    Reddy, Mopuri K.
    Arora, Sahil
    Babu, R. Venkatesh
    [J]. 2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [35] Efficient Motion Weighted Spatio-Temporal Video SSIM Index
    Moorthy, Anush K.
    Bovik, Alan C.
    [J]. HUMAN VISION AND ELECTRONIC IMAGING XV, 2010, 7527
  • [36] Efficient Online Spatio-Temporal Filtering for Video Event Detection
    Yan, Xinchen
    Yuan, Junsong
    Liang, Hui
    [J]. COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 769 - 785
  • [37] Deep Spatio-Temporal Random Fields for Efficient Video Segmentation
    Chandra, Siddhartha
    Couprie, Camille
    Kokkinos, Iasonas
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8915 - 8924
  • [38] An Empirical Investigation of Efficient Spatio-Temporal Modeling in Video Restoration
    Fan, Yuchen
    Yu, Jiahui
    Liu, Ding
    Huang, Thomas S.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2159 - 2168
  • [39] Multi-Stage Raw Video Denoising with Adversarial Loss and Gradient Mask
    Paliwal, Avinash
    Zeng, Libing
    Kalantari, Nima Khademi
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL PHOTOGRAPHY (ICCP), 2021,
  • [40] Video Waterdrop Removal via Spatio-Temporal Fusion in Driving Scenes
    Wen, Qiang
    Wu, Yue
    Chen, Qifeng
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10003 - 10009