Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution

被引:57
|
作者
Cong, Ruixuan [1 ,2 ,3 ]
Sheng, Hao [1 ,2 ,3 ]
Yang, Da [1 ,2 ,3 ]
Cui, Zhenglong [1 ,2 ,3 ]
Chen, Rongshan [1 ,2 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Beihang Hangzhou Innovat Inst Yuhang, Hangzhou 310023, Peoples R China
[3] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China
关键词
Transformers; Computational modeling; Superresolution; Spatial resolution; Feature extraction; Light fields; Convolution; Light field; transformer; super-resolution; sub-sampling spatial modeling; multi-scale angular modeling; NETWORK;
D O I
10.1109/TMM.2023.3282465
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Global context information is particularly important for comprehensive scene understanding. It helps clarify local confusions and smooth predictions to achieve fine-grained and coherent results. However, most existing light field processing methods leverage convolution layers to model spatial and angular information. The limited receptive field restricts them to learn long-range dependency in LF structure. In this article, we propose a novel network based on deep efficient transformers (i.e., LF-DET) for LF spatial super-resolution. It develops a spatial-angular separable transformer encoder with two modeling strategies termed as sub-sampling spatial modeling and multi-scale angular modeling for global context interaction. Specifically, the former utilizes a sub-sampling convolution layer to alleviate the problem of huge computational cost when capturing spatial information within each sub-aperture image. In this way, our model can cascade more transformers to continuously enhance feature representation with limited resources. The latter processes multi-scale macro-pixel regions to extract and aggregate angular features focusing on different disparity ranges to well adapt to disparity variations. Besides, we capture strong similarities among surrounding pixels by dynamic positional encodings to fill the gap of transformers that lack of local information interaction. The experimental results on both real-world and synthetic LF datasets confirm our LF-DET achieves a significant performance improvement compared with state-of-the-art methods. Furthermore, our LF-DET shows high robustness to disparity variations through the proposed multi-scale angular modeling.
引用
收藏
页码:1421 / 1435
页数:15
相关论文
共 50 条
  • [1] Exploiting Spatial and Angular Correlations with Deep Efficient Transformers for Light Field Image Super-Resolution
    Cong, Ruixuan
    Sheng, Hao
    Yang, Da
    Cui, Zhenglong
    Chen, Rongshan
    IEEE Transactions on Multimedia, 2024, 26 : 1421 - 1435
  • [2] Light Field Image Super-Resolution With Transformers
    Liang, Zhengyu
    Wang, Yingqian
    Wang, Longguang
    Yang, Jungang
    Zhou, Shilin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 563 - 567
  • [3] Light Field Spatial Super-Resolution Using Deep Efficient Spatial-Angular Separable Convolution
    Yeung, Henry Wing Fung
    Hou, Junhui
    Chen, Xiaoming
    Chen, Jie
    Chen, Zhibo
    Chung, Yuk Ying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2319 - 2330
  • [4] LF-SAET: Cascaded Spatial-Angular-EPI Transformers for Light Field Image Super-Resolution
    Zhang, Hao
    Yu, Junle
    Wu, Chenyu
    Meng, Jiahan
    Zhou, Wenhui
    PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 525 - 538
  • [5] Joint Light Field Spatial and Angular Super-Resolution From a Single Image
    Ivan, Andre
    Williem
    Park, In Kyu
    IEEE ACCESS, 2020, 8 : 112562 - 112573
  • [6] Deep Spatial-Angular Regularization for Light Field Imaging, Denoising, and Super-Resolution
    Guo, Mantang
    Hou, Junhui
    Jin, Jing
    Chen, Jie
    Chau, Lap-Pui
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6094 - 6110
  • [7] Hierarchical spatial-angular integration for lightweight light field image super-resolution
    Li, Meng
    Ma, Bo
    Wang, Shunzhou
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [8] Light Field Super-Resolution Based on Spatial and Angular Attention
    Li, Donglin
    Yang, Da
    Wang, Sizhe
    Sheng, Hao
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 314 - 325
  • [9] Spatial-angular-epipolar transformer for light field spatial and angular super-resolution
    Wang, Sizhe
    Sheng, Hao
    Chen, Rongshan
    Yang, Da
    Cui, Zhenglong
    Cong, Ruixuan
    Xiong, Zhang
    DISPLAYS, 2024, 85
  • [10] Light field image super-resolution based on raw data with transformers
    Guo, Xiao
    Sang, Xinzhu
    Yan, Binbin
    Chen, Duo
    Wang, Peng
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2022, 39 (12) : 2131 - 2141