Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring

被引:2
|
作者
Zhang, Huicong [1 ]
Xie, Haozhe [2 ]
Yao, Hongxun [1 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
[2] Nanyang Technol Univ, S Lab, Singapore, Singapore
来源
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年
关键词
D O I
10.1109/CVPR52733.2024.00258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video deblurring relies on leveraging information from other frames in the video sequence to restore the blurred regions in the current frame. Mainstream approaches employ bidirectional feature propagation, spatio-temporal transformers, or a combination of both to extract information from the video sequence. However, limitations in memory and computational resources constraints the temporal window length of the spatio-temporal transformer, preventing the extraction of longer temporal contextual information from the video sequence. Additionally, bidirectional feature propagation is highly sensitive to inaccurate optical flow in blurry frames, leading to error accumulation during the propagation process. To address these issues, we propose BSSTNet, Blur-aware Spatio-temporal Sparse Transformer Network. It introduces the blur map, which converts the originally dense attention into a sparse form, enabling a more extensive utilization of information throughout the entire video sequence. Specifically, BSSTNet (1) uses a longer temporal window in the transformer, leveraging information from more distant frames to restore the blurry pixels in the current frame. (2) introduces bidirectional feature propagation guided by blur maps, which reduces error accumulation caused by the blur frame. The experimental results demonstrate the proposed BSSTNet outperforms the state-of-the-art methods on the GoPro and DVD datasets.
引用
收藏
页码:2673 / 2681
页数:9
相关论文
共 50 条
  • [31] Spatio-temporal Sampling for Video
    Shankar, Mohan
    Pitsiauis, Nikos P.
    Brady, David
    IMAGE RECONSTRUCTION FROM INCOMPLETE DATA V, 2008, 7076
  • [32] Interframe motion deblurring using spatio-temporal regularization
    Tsubaki, Ikuko
    Komatsu, Takashi
    Saito, Takahiro
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 2105 - +
  • [33] Spatio-Temporal Learning for Video Deblurring based on Two-Stream Generative Adversarial Network
    Song, Liyao
    Wang, Quan
    Lie, Haiwei
    Fan, Jiancun
    Hu, Bingliang
    NEURAL PROCESSING LETTERS, 2021, 53 (04) : 2701 - 2714
  • [34] Spatio-Temporal Learning for Video Deblurring based on Two-Stream Generative Adversarial Network
    Liyao Song
    Quan Wang
    Haiwei Li
    Jiancun Fan
    Bingliang Hu
    Neural Processing Letters, 2021, 53 : 2701 - 2714
  • [35] Spatio-Temporal Catcher: a Self-Supervised Transformer for Deepfake Video Detection
    Li, Maosen
    Li, Xurong
    Yu, Kun
    Deng, Cheng
    Huang, Heng
    Mao, Feng
    Xue, Hui
    Li, Minghao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8707 - 8718
  • [36] Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding
    Dai, Wenrui
    Shen, Yangmei
    Tang, Xin
    Zou, Junni
    Xiong, Hongkai
    Chen, Chang Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (10) : 4580 - 4595
  • [37] MODELING SPARSE SPATIO-TEMPORAL REPRESENTATIONS FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT
    Shabeer, Muhammed P.
    Bhati, Saurabhchand
    Channappayya, Sumohana S.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1220 - 1224
  • [38] Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection
    Li, Guoqiu
    Cai, Guanxiong
    Zeng, Xingyu
    Zhao, Rui
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 333 - 350
  • [39] Video Segmentation with Spatio-Temporal Tubes
    Trichet, Remi
    Nevatia, Ramakant
    2013 10TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2013), 2013, : 330 - 335
  • [40] Spatio-temporal segmentation for video surveillance
    Sun, HZ
    Tan, TN
    ELECTRONICS LETTERS, 2001, 37 (01) : 20 - 21