A Unified Generative Adversarial Network With Convolution and Transformer for Remote Sensing Image Fusion

被引:0
|
作者
Wu, Yuanyuan [1 ,2 ]
Huang, Mengxing [1 ,3 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China
[2] Guangdong Ocean Univ, Sch Elect & Informat Engn, Zhanjiang 524088, Peoples R China
[3] Hainan Univ, State Key Lab Marine Resource Utilizat South China, Haikou 570228, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Spatial resolution; Image resolution; Transformers; Generative adversarial networks; Biological system modeling; Pansharpening; Data models; Bidirectional local-global feature encoder; convolution and Transformer; multihead cross-attention fusion; multiresolution convolutional Transformer discriminators; remote sensing image (RSI) unified fusion model; SATELLITE IMAGES; LANDSAT; QUALITY; REFLECTANCE; FRAMEWORK; MODEL; MS;
D O I
10.1109/TGRS.2024.3441719
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Images derived from an individual sensor fail to simultaneously satisfy the demands of high spatial, spectral, and temporal resolutions. Multisource remote sensing image (RSI) fusion provides efficient access to high-spatial-resolution multispectral (HRMS) images [spatial-spectral fusion (SSF)] and high temporal- and spatial-resolution images [spatiotemporal fusion (STF)]. While existing deep learning (DL)-based models can mainly implement either SSF or STF, there is an urgent need for models that can simultaneously implement both SSF and STF. A unified generative adversarial network with convolution and Transformer (CTUGAN) for SSF and STF is proposed. CTUGAN contains a adaptive convolutional Transformer generator (ACTG) and multiresolution convolutional Transformer discriminator (MCTD), both with the convolution and Transformer. First, a bidirectional local-global feature encoder is devised in the ACTG to extract local-global features via a high-to-low resolution and a low-to-high resolution. Then, a multihead cross-attention fusion decoder (MCAFD) is devised to aggregate and fuse complementary local-global features of various levels and resolutions hierarchically to restore valuable information. Moreover, MCTDs adversely learn multiresolution local-global features to identify the relative reality of products, and a generalized loss function is built to accomplish full supervision. Finally, numerous experiments on the SSF data (Gaofen-2 (GF-2) and QuikBird) and STF data [Coleambally Irrigation Area (CIA) and lower Gwydir catchment (LGC)] demonstrate that the proposed CTUGAN model outperforms both subjective and objective evaluations.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Research on classification method of hyperspectral remote sensing image based on Generative Adversarial Network
    Zhang, Jian
    Bao, Wenxing
    National Remote Sensing Bulletin, 2022, 26 (02) : 416 - 430
  • [32] A remote sensing satellite image compression method based on conditional generative adversarial network
    Cheng, Kan
    Zou, Yafang
    Zhao, Yuting
    Jin, Hao
    Li, Chengchao
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXIX, 2023, 12733
  • [33] Multimodal Fusion Generative Adversarial Network for Image Synthesis
    Zhao, Liang
    Hu, Qinghao
    Li, Xiaoyuan
    Zhao, Jingyuan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1865 - 1869
  • [34] Unified Binary Generative Adversarial Network for Image Retrieval and Compression
    Jingkuan Song
    Tao He
    Lianli Gao
    Xing Xu
    Alan Hanjalic
    Heng Tao Shen
    International Journal of Computer Vision, 2020, 128 : 2243 - 2264
  • [35] Unified Binary Generative Adversarial Network for Image Retrieval and Compression
    Song, Jingkuan
    He, Tao
    Gao, Lianli
    Xu, Xing
    Hanjalic, Alan
    Shen, Heng Tao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 2243 - 2264
  • [36] Hyperspectral Image Classification Based on Transformer and Generative Adversarial Network
    Wang, Yajie
    Shi, Zhonghui
    Han, Shengyu
    Wei, Zhihao
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 212 - 225
  • [37] An Attention Encoder-Decoder Network Based on Generative Adversarial Network for Remote Sensing Image Dehazing
    Zhao, Liquan
    Zhang, Yupeng
    Cui, Ying
    IEEE SENSORS JOURNAL, 2022, 22 (11) : 10890 - 10900
  • [38] Attentive generative adversarial network for removing thin cloud from a single remote sensing image
    Chen, Hui
    Chen, Rong
    Li, Nannan
    IET IMAGE PROCESSING, 2021, 15 (04) : 856 - 867
  • [39] Remote sensing image dehazing using generative adversarial network with texture and color space enhancement
    Shen, Helin
    Zhong, Tie
    Jia, Yanfei
    Wu, Chunming
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [40] An improved unsupervised representation learning generative adversarial network for remote sensing image scene classification
    Wei, Yufan
    Luo, Xiaobo
    Hu, Lixin
    Peng, Yidong
    Feng, Jiangfan
    REMOTE SENSING LETTERS, 2020, 11 (06) : 598 - 607