Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution

被引：57

作者：

Cong, Ruixuan ^{[1
,2
,3
]}

Sheng, Hao ^{[1
,2
,3
]}

Yang, Da ^{[1
,2
,3
]}

Cui, Zhenglong ^{[1
,2
,3
]}

Chen, Rongshan ^{[1
,2
,3
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Beihang Hangzhou Innovat Inst Yuhang, Hangzhou 310023, Peoples R China

[3] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Transformers; Computational modeling; Superresolution; Spatial resolution; Feature extraction; Light fields; Convolution; Light field; transformer; super-resolution; sub-sampling spatial modeling; multi-scale angular modeling; NETWORK;

D O I：

10.1109/TMM.2023.3282465

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Global context information is particularly important for comprehensive scene understanding. It helps clarify local confusions and smooth predictions to achieve fine-grained and coherent results. However, most existing light field processing methods leverage convolution layers to model spatial and angular information. The limited receptive field restricts them to learn long-range dependency in LF structure. In this article, we propose a novel network based on deep efficient transformers (i.e., LF-DET) for LF spatial super-resolution. It develops a spatial-angular separable transformer encoder with two modeling strategies termed as sub-sampling spatial modeling and multi-scale angular modeling for global context interaction. Specifically, the former utilizes a sub-sampling convolution layer to alleviate the problem of huge computational cost when capturing spatial information within each sub-aperture image. In this way, our model can cascade more transformers to continuously enhance feature representation with limited resources. The latter processes multi-scale macro-pixel regions to extract and aggregate angular features focusing on different disparity ranges to well adapt to disparity variations. Besides, we capture strong similarities among surrounding pixels by dynamic positional encodings to fill the gap of transformers that lack of local information interaction. The experimental results on both real-world and synthetic LF datasets confirm our LF-DET achieves a significant performance improvement compared with state-of-the-art methods. Furthermore, our LF-DET shows high robustness to disparity variations through the proposed multi-scale angular modeling.

引用

页码：1421 / 1435

页数：15

共 50 条

[1] Exploiting Spatial and Angular Correlations with Deep Efficient Transformers for Light Field Image Super-Resolution
Cong, Ruixuan
Sheng, Hao
Yang, Da
Cui, Zhenglong
Chen, Rongshan
IEEE Transactions on Multimedia, 2024, 26 : 1421 - 1435
[2] Light Field Image Super-Resolution With Transformers
Liang, Zhengyu
Wang, Yingqian
Wang, Longguang
Yang, Jungang
Zhou, Shilin
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 563 - 567
[3] Light Field Spatial Super-Resolution Using Deep Efficient Spatial-Angular Separable Convolution
Yeung, Henry Wing Fung
Hou, Junhui
Chen, Xiaoming
Chen, Jie
Chen, Zhibo
Chung, Yuk Ying
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2319 - 2330
[4] LF-SAET: Cascaded Spatial-Angular-EPI Transformers for Light Field Image Super-Resolution
Zhang, Hao
Yu, Junle
Wu, Chenyu
Meng, Jiahan
Zhou, Wenhui
PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 525 - 538
[5] Joint Light Field Spatial and Angular Super-Resolution From a Single Image
Ivan, Andre
Williem
Park, In Kyu
IEEE ACCESS, 2020, 8 : 112562 - 112573
[6] Deep Spatial-Angular Regularization for Light Field Imaging, Denoising, and Super-Resolution
Guo, Mantang
Hou, Junhui
Jin, Jing
Chen, Jie
Chau, Lap-Pui
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6094 - 6110
[7] Hierarchical spatial-angular integration for lightweight light field image super-resolution
Li, Meng
Ma, Bo
Wang, Shunzhou
KNOWLEDGE-BASED SYSTEMS, 2025, 315
[8] Light Field Super-Resolution Based on Spatial and Angular Attention
Li, Donglin
Yang, Da
Wang, Sizhe
Sheng, Hao
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 314 - 325
[9] Spatial-angular-epipolar transformer for light field spatial and angular super-resolution
Wang, Sizhe
Sheng, Hao
Chen, Rongshan
Yang, Da
Cui, Zhenglong
Cong, Ruixuan
Xiong, Zhang
DISPLAYS, 2024, 85
[10] Light field image super-resolution based on raw data with transformers
Guo, Xiao
Sang, Xinzhu
Yan, Binbin
Chen, Duo
Wang, Peng
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2022, 39 (12) : 2131 - 2141

← 1 2 3 4 5 →