A Lightweight CNN-Transformer Implemented via Structural Re-Parameterization and Hybrid Attention for Remote Sensing Image Super-Resolution

被引:0
|
作者
Wang, Jie [1 ]
Li, Hongwei [1 ]
Li, Yifan [2 ]
Qin, Zilong [3 ]
机构
[1] Zhengzhou Univ, Sch Geosci & Technol, Zhengzhou 450052, Peoples R China
[2] Univ Cologne, Inst Geophys & Meteorol, D-50923 Cologne, Germany
[3] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
super-resolution; remote sensing; CNN-Transformer; lightweight; hybrid attention;
D O I
10.3390/ijgi14010008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Remote sensing imagery contains rich information about geographical targets, and performing super-resolution (SR) reconstruction on such images requires greater feature representation capabilities. Convolutional neural network (CNN)-based methods excel at extracting intricate local features but fall short in terms of capturing global representations. While transformer methods are capable of learning long-distance dependencies, they often overlook local feature details, which can diminish the discriminability between the background and the foreground. Moreover, the distinctive architectures of transformers, their extensive parameter counts, and their reliance on large-scale training datasets impose constraints on transformer applications in remote sensing image feature extraction tasks. To address these challenges, this study introduces a novel hybrid CNN-Transformer network model named RepCHAT for remote sensing single image reconstruction, which incorporates a structural re-parameterization technique and a hybrid attention mechanism. This method leverages the strengths of transformers in terms of learning long-distance dependencies (global features) and CNNs with respect to extracting local features. The proposed approach achieves SR reconstruction for remote sensing images with fewer parameters and less computational overhead than those of traditional transformers and high-performance CNN models. We develop a multiscale feature extraction module that integrates both spatial- and frequency-domain features and employs structural re-parameterization theory to increase the inference efficiency of the model. Furthermore, we incorporate depthwise-separable convolution into the transformer block to bolster the local feature learning capabilities of the transformer. The method we propose achieves the optimal performance for remote sensing single-image super-resolution reconstruction and outperforms the competing methods by 0.28-1.05 dB (x4 scale) in terms of signal-to-noise ratio (PSNR). Experimental results indicate that the RepCHAT model proposed in this study maintains a high performance with significantly reduced complexity, making it suitable for deployment on edge devices.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Design of lightweight re-parameterized remote sensing image super-resolution network
    Yi J.
    Chen J.
    Cao F.
    Li J.
    Xie W.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, (02): : 268 - 285
  • [22] Lightweight image super-resolution via multi-branch aware CNN and efficient transformer
    Xiang Gao
    Sining Wu
    Ying Zhou
    Xinrong Wu
    Fan Wang
    Xiaopeng Hu
    Neural Computing and Applications, 2024, 36 : 5285 - 5303
  • [23] SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution
    Bao, Qiqi
    Liu, Yunmeng
    Gang, Bowen
    Yang, Wenming
    Liao, Qingmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8554 - 8565
  • [24] Lightweight image super-resolution via multi-branch aware CNN and efficient transformer
    Gao, Xiang
    Wu, Sining
    Zhou, Ying
    Wu, Xinrong
    Wang, Fan
    Hu, Xiaopeng
    NEURAL COMPUTING & APPLICATIONS, 2023, 36 (10): : 5285 - 5303
  • [25] Multi-scale implicit transformer with re-parameterization for arbitrary-scale super-resolution
    Zhu, Jinchen
    Zhang, Mingjian
    Zheng, Ling
    Weng, Shizhuang
    PATTERN RECOGNITION, 2025, 162
  • [26] RingMo-Lite: A Remote Sensing Lightweight Network With CNN-Transformer Hybrid Framework
    Wang, Yuelei
    Zhang, Ting
    Zhao, Liangjin
    Hu, Lin
    Wang, Zhechao
    Niu, Ziqing
    Cheng, Peirui
    Chen, Kaiqiang
    Zeng, Xuan
    Wang, Zhirui
    Wang, Hongqi
    Sun, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 20
  • [27] Lightweight Remote-Sensing Image Super-Resolution via Re-Parameterized Feature Distillation Network
    Zhang, Tianlin
    Bian, Chunjiang
    Zhang, Xiaoming
    Chen, Hongzhen
    Chen, Shi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [28] Efficient Swin Transformer for Remote Sensing Image Super-Resolution
    Kang, Xudong
    Duan, Puhong
    Li, Jier
    Li, Shutao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6367 - 6379
  • [29] Dual Attention Fusion Enhancement Network for Lightweight Remote-Sensing Image Super-Resolution
    Chen, Wangyou
    Qu, Shenming
    Luo, Laigan
    Lu, Yongyong
    REMOTE SENSING, 2025, 17 (06)
  • [30] Remote sensing image super-resolution via cross-scale hierarchical transformer
    Xiao, Yi
    Yuan, Qiangqiang
    He, Jiang
    Zhang, Liangpei
    GEO-SPATIAL INFORMATION SCIENCE, 2024, 27 (06) : 1914 - 1930