A New Dataset and Transformer for Stereoscopic Video Super-Resolution

被引:10
|
作者
Imani, Hassan [1 ]
Islam, Md Baharul [1 ,2 ]
Wong, Lai-Kuan [3 ]
机构
[1] Bahcesehir Univ, Istanbul, Turkey
[2] Amer Univ Malta, Cospicua, Malta
[3] Multimedia Univ, Cyberjaya, Malaysia
关键词
ENHANCEMENT; ATTENTION;
D O I
10.1109/CVPRW56347.2022.00086
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stereo video super-resolution (SVSR) aims to enhance the spatial resolution of the low-resolution video by reconstructing the high-resolution video. The key challenges in SVSR are preserving the stereo-consistency and temporal-consistency, without which viewers may experience 3D fatigue. There are several notable works on stereoscopic image super-resolution, but there is little research on stereo video super-resolution. In this paper, we propose a novel Transformer-based model for SVSR, namely Trans-SVSR. Trans-SVSR comprises two key novel components: a spatio-temporal convolutional self-attention layer and an optical flow-based feed-forward layer that discovers the correlation across different video frames and aligns the features. The parallax attention mechanism (PAM) that uses the cross-view information to consider the significant disparities is used to fuse the stereo views. Due to the lack of a benchmark dataset suitable for the SVSR task, we collected a new stereoscopic video dataset, SVSR-Set, containing 71 full high-definition (HD) stereo videos captured using a professional stereo camera. Extensive experiments on the collected dataset, along with two other datasets, demonstrate that the Trans-SVSR can achieve competitive performance compared to the state-of-the-art methods. Project code and additional results are available at https://github.com/H-deep/Trans-SVSR/.
引用
收藏
页码:705 / 714
页数:10
相关论文
共 50 条
  • [1] Simultaneously retargeting and super-resolution for stereoscopic video
    Kai Kang
    Yang Cao
    Zengfu Wang
    [J]. Multimedia Tools and Applications, 2017, 76 : 11081 - 11095
  • [2] Simultaneously Retargeting and Super-Resolution for Stereoscopic Video
    Kang, Kai
    Zhang, Jing
    Cao, Yang
    Wang, Zeng-Fu
    [J]. COMPUTER VISION, CCCV 2015, PT II, 2015, 547 : 31 - 40
  • [3] Simultaneously retargeting and super-resolution for stereoscopic video
    Kang, Kai
    Cao, Yang
    Wang, Zengfu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (08) : 11081 - 11095
  • [4] Deformable transformer for endoscopic video super-resolution
    Song, Xiaowei
    Tang, Hui
    Yang, Chunfeng
    Zhou, Guangquan
    Wang, Yangang
    Huang, Xinjun
    Hua, Jie
    Coatrieux, Gouenou
    He, Xiaopu
    Chen, Yang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77
  • [5] Self-guided Transformer for Video Super-Resolution
    Xue, Tong
    Wang, Qianrui
    Huang, Xinyi
    Li, Dengshi
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2024, 14434 LNCS : 186 - 198
  • [6] Self-guided Transformer for Video Super-Resolution
    Xue, Tong
    Wang, Qianrui
    Huang, Xinyi
    Li, Dengshi
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 186 - 198
  • [7] Learning Trajectory-Aware Transformer for Video Super-Resolution
    Liu, Chengxu
    Yang, Huan
    Fu, Jianlong
    Qian, Xueming
    [J]. arXiv, 2022,
  • [8] Learning Trajectory-Aware Transformer for Video Super-Resolution
    Liu, Chengxu
    Yang, Huan
    Fu, Jianlong
    Qian, Xueming
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5677 - 5686
  • [9] Learning Trajectory-Aware Transformer for Video Super-Resolution
    Liu, Chengxu
    Yang, Huan
    Fu, Jianlong
    Qian, Xueming
    [J]. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, 2022-June : 5677 - 5686
  • [10] NTIRE 2019 Challenge on Video Deblurring and Super-Resolution: Dataset and Study
    Nah, Seungjun
    Baik, Sungyong
    Hong, Seokil
    Moon, Gyeongsik
    Son, Sanghyun
    Timofte, Radu
    Lee, Kyoung Mu
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1996 - 2005