Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution

被引：57

作者：

Cong, Ruixuan ^{[1
,2
,3
]}

Sheng, Hao ^{[1
,2
,3
]}

Yang, Da ^{[1
,2
,3
]}

Cui, Zhenglong ^{[1
,2
,3
]}

Chen, Rongshan ^{[1
,2
,3
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Beihang Hangzhou Innovat Inst Yuhang, Hangzhou 310023, Peoples R China

[3] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Transformers; Computational modeling; Superresolution; Spatial resolution; Feature extraction; Light fields; Convolution; Light field; transformer; super-resolution; sub-sampling spatial modeling; multi-scale angular modeling; NETWORK;

D O I：

10.1109/TMM.2023.3282465

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Global context information is particularly important for comprehensive scene understanding. It helps clarify local confusions and smooth predictions to achieve fine-grained and coherent results. However, most existing light field processing methods leverage convolution layers to model spatial and angular information. The limited receptive field restricts them to learn long-range dependency in LF structure. In this article, we propose a novel network based on deep efficient transformers (i.e., LF-DET) for LF spatial super-resolution. It develops a spatial-angular separable transformer encoder with two modeling strategies termed as sub-sampling spatial modeling and multi-scale angular modeling for global context interaction. Specifically, the former utilizes a sub-sampling convolution layer to alleviate the problem of huge computational cost when capturing spatial information within each sub-aperture image. In this way, our model can cascade more transformers to continuously enhance feature representation with limited resources. The latter processes multi-scale macro-pixel regions to extract and aggregate angular features focusing on different disparity ranges to well adapt to disparity variations. Besides, we capture strong similarities among surrounding pixels by dynamic positional encodings to fill the gap of transformers that lack of local information interaction. The experimental results on both real-world and synthetic LF datasets confirm our LF-DET achieves a significant performance improvement compared with state-of-the-art methods. Furthermore, our LF-DET shows high robustness to disparity variations through the proposed multi-scale angular modeling.

引用

页码：1421 / 1435

页数：15

共 50 条

[21] LFC-SASR: LIGHT FIELD CODING USING SPATIAL AND ANGULAR SUPER-RESOLUTION
Cetinkaya, Ekrem
Amirpour, Hadi
Timmerer, Christian
2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
[22] Light Field Angular Super-Resolution Network Based on Convolutional Transformer and Deep Deblurring
Liu, Deyang
Mao, Yifan
Zuo, Yifan
An, Ping
Fang, Yuming
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2024, 10 : 1736 - 1748
[23] A Deep Learning Based Spatial Super-Resolution Approach for Light Field Content
Wafa, Abrar
Pourazad, Mahsa T.
Nasiopoulos, Panos
IEEE ACCESS, 2021, 9 : 2080 - 2092
[24] Learning a Deep Convolutional Network for Light-Field Image Super-Resolution
Yoon, Youngjin
Jeon, Hae-Gon
Yoo, Donggeun
Lee, Joon-Young
Kweon, In So
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 57 - 65
[25] FLEXIBLE SPATIAL AND ANGULAR LIGHT FIELD SUPER RESOLUTION
Ma, Dizhi
Lumsdaine, Andrew
Zhou, Wenhui
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2970 - 2974
[26] Spatial-angular enhanced network for light-field image super-resolution with geometry-assisted upsampling
Liu, Deyang
Li, Shizheng
Chen, Yiren
Zhang, Peng
Zhou, Xiaofei
Zha, Hainie
JOURNAL OF ELECTRONIC IMAGING, 2025, 34 (01)
[27] Residual Networks for Light Field Image Super-Resolution
Zhang, Shuo
Lin, Youfang
Sheng, Hao
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11038 - 11047
[28] Light Field Angular Super-Resolution via Spatial-Angular Correlation Extracted by Deformable Convolutional Network
Li, Daichuan
Zhong, Rui
Yang, Yungang
SENSORS, 2025, 25 (04)
[29] Learning from EPI-Volume-Stack for Light Field image angular super-resolution
Liu, Deyang
Wu, Qiang
Huang, Yan
Huang, Xinpeng
An, Ping
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 97
[30] Light Field Super-Resolution By Jointly Exploiting Internal and External Similarities
Cheng, Zhen
Xiong, Zhiwei
Liu, Dong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (08) : 2604 - 2616

← 1 2 3 4 5 →