Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution

被引:57
|
作者
Cong, Ruixuan [1 ,2 ,3 ]
Sheng, Hao [1 ,2 ,3 ]
Yang, Da [1 ,2 ,3 ]
Cui, Zhenglong [1 ,2 ,3 ]
Chen, Rongshan [1 ,2 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Beihang Hangzhou Innovat Inst Yuhang, Hangzhou 310023, Peoples R China
[3] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China
关键词
Transformers; Computational modeling; Superresolution; Spatial resolution; Feature extraction; Light fields; Convolution; Light field; transformer; super-resolution; sub-sampling spatial modeling; multi-scale angular modeling; NETWORK;
D O I
10.1109/TMM.2023.3282465
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Global context information is particularly important for comprehensive scene understanding. It helps clarify local confusions and smooth predictions to achieve fine-grained and coherent results. However, most existing light field processing methods leverage convolution layers to model spatial and angular information. The limited receptive field restricts them to learn long-range dependency in LF structure. In this article, we propose a novel network based on deep efficient transformers (i.e., LF-DET) for LF spatial super-resolution. It develops a spatial-angular separable transformer encoder with two modeling strategies termed as sub-sampling spatial modeling and multi-scale angular modeling for global context interaction. Specifically, the former utilizes a sub-sampling convolution layer to alleviate the problem of huge computational cost when capturing spatial information within each sub-aperture image. In this way, our model can cascade more transformers to continuously enhance feature representation with limited resources. The latter processes multi-scale macro-pixel regions to extract and aggregate angular features focusing on different disparity ranges to well adapt to disparity variations. Besides, we capture strong similarities among surrounding pixels by dynamic positional encodings to fill the gap of transformers that lack of local information interaction. The experimental results on both real-world and synthetic LF datasets confirm our LF-DET achieves a significant performance improvement compared with state-of-the-art methods. Furthermore, our LF-DET shows high robustness to disparity variations through the proposed multi-scale angular modeling.
引用
收藏
页码:1421 / 1435
页数:15
相关论文
共 50 条
  • [21] LFC-SASR: LIGHT FIELD CODING USING SPATIAL AND ANGULAR SUPER-RESOLUTION
    Cetinkaya, Ekrem
    Amirpour, Hadi
    Timmerer, Christian
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [22] Light Field Angular Super-Resolution Network Based on Convolutional Transformer and Deep Deblurring
    Liu, Deyang
    Mao, Yifan
    Zuo, Yifan
    An, Ping
    Fang, Yuming
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2024, 10 : 1736 - 1748
  • [23] A Deep Learning Based Spatial Super-Resolution Approach for Light Field Content
    Wafa, Abrar
    Pourazad, Mahsa T.
    Nasiopoulos, Panos
    IEEE ACCESS, 2021, 9 : 2080 - 2092
  • [24] Learning a Deep Convolutional Network for Light-Field Image Super-Resolution
    Yoon, Youngjin
    Jeon, Hae-Gon
    Yoo, Donggeun
    Lee, Joon-Young
    Kweon, In So
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 57 - 65
  • [25] FLEXIBLE SPATIAL AND ANGULAR LIGHT FIELD SUPER RESOLUTION
    Ma, Dizhi
    Lumsdaine, Andrew
    Zhou, Wenhui
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2970 - 2974
  • [26] Spatial-angular enhanced network for light-field image super-resolution with geometry-assisted upsampling
    Liu, Deyang
    Li, Shizheng
    Chen, Yiren
    Zhang, Peng
    Zhou, Xiaofei
    Zha, Hainie
    JOURNAL OF ELECTRONIC IMAGING, 2025, 34 (01)
  • [27] Residual Networks for Light Field Image Super-Resolution
    Zhang, Shuo
    Lin, Youfang
    Sheng, Hao
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11038 - 11047
  • [28] Light Field Angular Super-Resolution via Spatial-Angular Correlation Extracted by Deformable Convolutional Network
    Li, Daichuan
    Zhong, Rui
    Yang, Yungang
    SENSORS, 2025, 25 (04)
  • [29] Learning from EPI-Volume-Stack for Light Field image angular super-resolution
    Liu, Deyang
    Wu, Qiang
    Huang, Yan
    Huang, Xinpeng
    An, Ping
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 97
  • [30] Light Field Super-Resolution By Jointly Exploiting Internal and External Similarities
    Cheng, Zhen
    Xiong, Zhiwei
    Liu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (08) : 2604 - 2616