Focal Aggregation Transformer for Light Field Image Super-Resolution

被引:0
|
作者
Wang, Shunzhou [1 ,3 ]
Lu, Yao [2 ,3 ]
Xia, Wang [3 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China
[2] Shenzhen MSU BIT Univ, Guangdong Lab Machine Percept & Intelligent Comp, Dept Engn, Shenzhen 518172, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci, Beijing 100081, Peoples R China
关键词
Light field; Image super-resolution; Inter-intra view feature aggregation; Hierarchical feature aggregation; Transformer; NETWORK;
D O I
10.1007/978-981-97-8685-5_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer has achieved significant progress in light field image super-resolution (LFSR) due to its long-range dependency learning ability for inter-intra view feature aggregation. However, locality information of each sub-aperture view is ignored in intra-view and inter-view aggregation with Transformer, hampering the high-quality light field image reconstruction. To this end, we propose a global to local aggregation approach termed Focal Aggregation for LFSR. In particular, Focal Aggregation includes two strategies: inter-view global to local aggregation (InterG2L) and intra-view global to local aggregation (IntraG2L). InterG2L is proposed to obtain complementary information from different views. IntraG2L is developed to extract efficient representations of a single sub-aperture view. InterG2L and IntraG2L are organized in a cascade way so that the global information of the input can be gathered for each sub-aperture image in a coarse to fine aggregation approach. Meanwhile, we also develop a global to local hierarchical feature aggregation approach named HierG2L, which enhances the last hierarchical feature used for light field reconstruction according to the input. Based on the above three global to local aggregation strategies, we construct a focal aggregation transformer (FAT) for LFSR. Experiments are performed on commonly-used LFSR benchmarks. Results demonstrate that FAT achieves superior results compared with other leading methods on synthesized and real data.
引用
收藏
页码:524 / 538
页数:15
相关论文
共 50 条
  • [1] MULTI-GRANULARITY AGGREGATION TRANSFORMER FOR LIGHT FIELD IMAGE SUPER-RESOLUTION
    Wang, Zijian
    Lu, Yao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 261 - 265
  • [2] Dual Aggregation Transformer for Image Super-Resolution
    Chen, Zheng
    Zhang, Yulun
    Gu, Jinjin
    Kong, Linghe
    Yang, Xiaokang
    Yu, Fisher
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12278 - 12287
  • [3] Detail-Preserving Transformer for Light Field Image Super-resolution
    Wang, Shunzhou
    Zhou, Tianfei
    Lu, Yao
    Di, Huijun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2522 - 2530
  • [4] LOCAL-GLOBAL FEATURE AGGREGATION FOR LIGHT FIELD IMAGE SUPER-RESOLUTION
    Wang, Yan
    Lu, Yao
    Wang, Shunzhou
    Zhang, Wenyao
    Wang, Zijian
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2160 - 2164
  • [5] Light Field Image Super-Resolution With Transformers
    Liang, Zhengyu
    Wang, Yingqian
    Wang, Longguang
    Yang, Jungang
    Zhou, Shilin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 563 - 567
  • [6] Reivew of Light Field Image Super-Resolution
    Yu, Li
    Ma, Yunpeng
    Hong, Song
    Chen, Ke
    ELECTRONICS, 2022, 11 (12)
  • [7] Local-global aggregation transformer for enhanced image super-resolution
    Wu, Yuxiang
    Wang, Xiaoyan
    Gao, Yuzhao
    Liu, Xiaoyan
    Dou, Yan
    DIGITAL SIGNAL PROCESSING, 2025, 161
  • [8] Transformer for Single Image Super-Resolution
    Lu, Zhisheng
    Li, Juncheng
    Liu, Hong
    Huang, Chaoyan
    Zhang, Linlin
    Zeng, Tieyong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 456 - 465
  • [9] Residual Networks for Light Field Image Super-Resolution
    Zhang, Shuo
    Lin, Youfang
    Sheng, Hao
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11038 - 11047
  • [10] Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-Resolution
    Hu, Zeke Zexi
    Chen, Xiaoming
    Chung, Vera Yuk Ying
    Shen, Yiran
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1334 - 1348