Efficient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion

被引:29
|
作者
Maggioni, Matteo [1 ]
Huang, Yibin [1 ]
Li, Cheng [1 ]
Xiao, Shuai [1 ]
Fu, Zhongqian [1 ]
Song, Fenglong [1 ]
机构
[1] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR46437.2021.00347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, denoising methods based on deep learning have achieved unparalleled performance at the cost of large computational complexity. In this work, we propose an Efficient Multi-stage Video Denoising algorithm, called EMVD, to drastically reduce the complexity while maintaining or even improving the performance. First, a fusion stage reduces the noise through a recursive combination of all past frames in the video. Then, a denoising stage removes the noise in the fused frame. Finally, a refinement stage restores the missing high frequency in the denoised frame. All stages operate on a transform-domain representation obtained by learnable and invertible linear operators which simultaneously increase accuracy and decrease complexity of the model. A single loss on the final output is sufficient for successful convergence, hence making EMVD easy to train. Experiments on real raw data demonstrate that EMVD outperforms the state of the art when complexity is constrained, and even remains competitive against methods whose complexities are several orders of magnitude higher. Further, the low complexity and memory requirements of EMVD enable real-time video denoising on commercial SoC in mobile devices.
引用
收藏
页码:3465 / 3474
页数:10
相关论文
共 50 条
  • [1] Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement
    Liu, Jing
    Fan, Zhiwei
    Yang, Ziwen
    Su, Yuting
    Yang, Xiaokang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2444 - 2455
  • [2] User-Ranking Video Summarization With Multi-Stage Spatio-Temporal Representation
    Huang, Siyu
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) : 2654 - 2664
  • [3] A multi-stage spatio-temporal adaptive network for video super-resolution
    Zhang, Yuhang
    Chen, Zhenzhong
    Liu, Shan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [4] DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal Fusion
    Duzceker, Arda
    Galliani, Silvano
    Vogel, Christoph
    Speciale, Pablo
    Dusmanu, Mihai
    Pollefeys, Marc
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15319 - 15328
  • [5] Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification
    Tang, Ziyi
    Zhang, Ruimao
    Peng, Zhanglin
    Chen, Jinrui
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7917 - 7929
  • [6] MFDGCN: Multi-Stage Spatio-Temporal Fusion Diffusion Graph Convolutional Network for Traffic Prediction
    Cui, Zhengyan
    Zhang, Junjun
    Noh, Giseop
    Park, Hyun Jun
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [7] Spatio-Temporal Two-stage Fusion for video question answering
    Xu, Feifei
    Zhu, Yitao
    Wang, Chun
    Cao, Yangze
    Zhong, Zheng
    Li, Xiongmin
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 237
  • [8] Spatio-temporal Markov random field for video denoising
    Chen, Jia
    Tang, Chi-Keung
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2232 - +
  • [9] Spatio-Temporal Video Denoising Based on Attention Mechanism
    Ji, Kai
    Lei, Weimin
    Zhang, Wei
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (06)
  • [10] Adaptive Video Denoising Based on Spatio-temporal Combination
    Di, Hongwei
    Zhang, Kaihan
    Gao, Hui
    [J]. MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 1230 - 1233