Multiscale spatial-spectral transformer network for hyperspectral and multispectral image fusion

被引:36
|
作者
Jia, Sen
Min, Zhichao
Fu, Xiyou [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Hyperspectral image (HSI); Multispectral image (MSI); Transformer; Pre-training; Spectral multi-head self-attention; Image fusion; TENSOR FACTORIZATION; SUPERRESOLUTION; NET;
D O I
10.1016/j.inffus.2023.03.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fusing hyperspectral images (HSIs) and multispectral images (MSIs) is an economic and feasible way to obtain images with both high spectral resolution and spatial resolution. Due to the limited receptive field of convolution kernels, fusion methods based on convolutional neural networks (CNNs) fail to take advantage of the global relationship in a feature map. In this paper, to exploit the powerful capability of Transformer to extract global information from the whole feature map for fusion, we propose a novel Multiscale Spatial- spectral Transformer Network (MSST-Net). The proposed network is a two-branch network that integrates the self-attention mechanism of the Transformer to extract spectral features from HSI and spatial features from MSI, respectively. Before feature extraction, cross-modality concatenations are performed to achieve cross -modality information interaction between the two branches. Then, we propose a spectral Transformer (SpeT) to extract spectral features and introduce multiscale band/patch embeddings to obtain multiscale features through SpeTs and spatial Transformers (SpaTs). To further improve the network's performance and generalization, we proposed a self-supervised pre-training strategy, in which a masked bands autoencoder (MBAE) and a masked patches autoencoder (MPAE) are specially designed for self-supervised pre-training of the SpeTs and SpaTs. Extensive experiments on simulated and real datasets illustrate that the proposed network can achieve better performance when compared to other state-of-the-art fusion methods. The code of MSST-Net will be available at http://www.jiasen.tech/papers/ for the sake of reproducibility.
引用
收藏
页码:117 / 129
页数:13
相关论文
共 50 条
  • [1] Hybrid Multiscale Spatial-Spectral Transformer for Hyperspectral Image Classification
    He, Yan
    Tu, Bing
    Liu, Bo
    Chen, Yunyun
    Li, Jun
    Plaza, Antonio
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [2] Multispectral and hyperspectral image fusion with spatial-spectral sparse representation
    Dian, Renwei
    Li, Shutao
    Fang, Leyuan
    Wei, Qi
    [J]. INFORMATION FUSION, 2019, 49 : 262 - 270
  • [3] Semantic and spatial-spectral feature fusion transformer network for the classification of hyperspectral image
    Xie, Erxin
    Chen, Na
    Peng, Jiangtao
    Sun, Weiwei
    Du, Qian
    You, Xinge
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (04) : 1308 - 1322
  • [4] Multiscale-Sparse Spatial-Spectral Transformer for Hyperspectral Image Denoising
    Xiao, Zilong
    Qin, Hanlin
    Yang, Shuowen
    Yan, Xiang
    Zhou, Huixin
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [5] GTFN: GCN and Transformer Fusion Network With Spatial-Spectral Features for Hyperspectral Image Classification
    Yang, Aitao
    Li, Min
    Ding, Yao
    Hong, Danfeng
    Lv, Yilong
    He, Yujie
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] GTFN: GCN and Transformer Fusion Network With Spatial-Spectral Features for Hyperspectral Image Classification
    Yang, Aitao
    Li, Min
    Ding, Yao
    Hong, Danfeng
    Lv, Yilong
    He, Yujie
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] Spatial-Spectral Transformer for Hyperspectral Image Denoising
    Li, Miaoyu
    Fu, Ying
    Zhang, Yulun
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 1368 - 1376
  • [8] Spatial-Spectral Transformer for Hyperspectral Image Classification
    He, Xin
    Chen, Yushi
    Lin, Zhouhan
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 22
  • [9] Two-branch global spatial-spectral fusion transformer network for hyperspectral image classification
    Xie, Erxin
    Chen, Na
    Zhang, Genwei
    Peng, Jiangtao
    Sun, Weiwei
    [J]. PHOTOGRAMMETRIC RECORD, 2024, 39 (186): : 392 - 411
  • [10] HYPERSPECTRAL AND MULTISPECTRAL IMAGE FUSION WITH DUAL-SOURCE SPATIAL-SPECTRAL DICTIONARY
    Tian, Jin
    Zhang, Yifan
    Lu, Yang
    Mei, Shaohui
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 7034 - 7037