A Lightweight CNN-Transformer Implemented via Structural Re-Parameterization and Hybrid Attention for Remote Sensing Image Super-Resolution

被引:0
|
作者
Wang, Jie [1 ]
Li, Hongwei [1 ]
Li, Yifan [2 ]
Qin, Zilong [3 ]
机构
[1] Zhengzhou Univ, Sch Geosci & Technol, Zhengzhou 450052, Peoples R China
[2] Univ Cologne, Inst Geophys & Meteorol, D-50923 Cologne, Germany
[3] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
super-resolution; remote sensing; CNN-Transformer; lightweight; hybrid attention;
D O I
10.3390/ijgi14010008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Remote sensing imagery contains rich information about geographical targets, and performing super-resolution (SR) reconstruction on such images requires greater feature representation capabilities. Convolutional neural network (CNN)-based methods excel at extracting intricate local features but fall short in terms of capturing global representations. While transformer methods are capable of learning long-distance dependencies, they often overlook local feature details, which can diminish the discriminability between the background and the foreground. Moreover, the distinctive architectures of transformers, their extensive parameter counts, and their reliance on large-scale training datasets impose constraints on transformer applications in remote sensing image feature extraction tasks. To address these challenges, this study introduces a novel hybrid CNN-Transformer network model named RepCHAT for remote sensing single image reconstruction, which incorporates a structural re-parameterization technique and a hybrid attention mechanism. This method leverages the strengths of transformers in terms of learning long-distance dependencies (global features) and CNNs with respect to extracting local features. The proposed approach achieves SR reconstruction for remote sensing images with fewer parameters and less computational overhead than those of traditional transformers and high-performance CNN models. We develop a multiscale feature extraction module that integrates both spatial- and frequency-domain features and employs structural re-parameterization theory to increase the inference efficiency of the model. Furthermore, we incorporate depthwise-separable convolution into the transformer block to bolster the local feature learning capabilities of the transformer. The method we propose achieves the optimal performance for remote sensing single-image super-resolution reconstruction and outperforms the competing methods by 0.28-1.05 dB (x4 scale) in terms of signal-to-noise ratio (PSNR). Experimental results indicate that the RepCHAT model proposed in this study maintains a high performance with significantly reduced complexity, making it suitable for deployment on edge devices.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A lightweight distillation CNN-transformer architecture for remote sensing image super-resolution
    Wang, Yu
    Shao, Zhenfeng
    Lu, Tao
    Liu, Lifeng
    Huang, Xiao
    Wang, Jiaming
    Jiang, Kui
    Zeng, Kangli
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 3560 - 3579
  • [2] Lightweight Super-Resolution Reconstruction Vision Transformers of Remote Sensing Image Based on Structural Re-Parameterization
    Bian, Jiaming
    Liu, Ye
    Chen, Jun
    Mase, Atsushi
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [3] An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution
    Zhang, Wenjian
    Tan, Zheng
    Lv, Qunbo
    Li, Jiaao
    Zhu, Baoyu
    Liu, Yangyang
    REMOTE SENSING, 2024, 16 (05)
  • [4] An Amalgamated CNN-Transformer Network for Lightweight Image Super-Resolution
    Fang, Jinsheng
    Lin, Hanjiang
    Zeng, Kun
    Journal of Network Intelligence, 2024, 9 (03): : 1376 - 1387
  • [5] A CNN-transformer hybrid network with selective fusion and dual attention for image super-resolution
    Zhang, Chun
    Wang, Jin
    Shi, Yunhui
    Yin, Baocai
    Ling, Nam
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [6] A Hybrid Network of CNN and Transformer for Lightweight Image Super-Resolution
    Fang, Jinsheng
    Lin, Hanjiang
    Chen, Xinyu
    Zeng, Kun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1102 - 1111
  • [7] Image Super-Resolution Reconstruction Method Based on Lightweight Symmetric CNN-Transformer
    Wang, Tingwei
    Zhao, Jianwei
    Zhou, Zhenghua
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (07): : 626 - 637
  • [8] EHAT:Enhanced Hybrid Attention Transformer for Remote Sensing Image Super-Resolution
    Wang, Jian
    Xie, Zexin
    Du, Yanlin
    Song, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 225 - 237
  • [9] Lightweight Image Super-Resolution Based on Re-Parameterization and Self-Calibrated Convolution
    Zhang, Sufan
    Chen, Xi
    Huang, Xingwei
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] HSACT: A hierarchical semantic-aware CNN-Transformer for remote sensing image spectral super-resolution
    Zhou, Chengle
    He, Zhi
    Zou, Liwei
    Li, Yunfei
    Plaza, Antonio
    NEUROCOMPUTING, 2025, 636