A U-Shaped Convolution-Aided Transformer with Double Attention for Hyperspectral Image Classification

被引:1
|
作者
Qin, Ruiru [1 ]
Wang, Chuanzhi [1 ]
Wu, Yongmei [1 ]
Du, Huafei [1 ]
Lv, Mingyun [1 ]
机构
[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
关键词
convolutional neural networks; transformers; hyperspectral image classification; spectral attention; spatial attention; RESIDUAL NETWORK; NEURAL-NETWORKS; CHANNEL;
D O I
10.3390/rs16020288
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Convolutional neural networks (CNNs) and transformers have achieved great success in hyperspectral image (HSI) classification. However, CNNs are inefficient in establishing long-range dependencies, and transformers may overlook some local information. To overcome these limitations, we propose a U-shaped convolution-aided transformer (UCaT) that incorporates convolutions into a novel transformer architecture to aid classification. The group convolution is employed as parallel local descriptors to extract detailed features, and then the multi-head self-attention recalibrates these features in consistent groups, emphasizing informative features while maintaining the inherent spectral-spatial data structure. Specifically, three components are constructed using particular strategies. First, the spectral groupwise self-attention (spectral-GSA) component is developed for spectral attention, which selectively emphasizes diagnostic spectral features among neighboring bands and reduces the spectral dimension. Then, the spatial dual-scale convolution-aided self-attention (spatial-DCSA) encoder and spatial convolution-aided cross-attention (spatial-CCA) decoder form a U-shaped architecture for per-pixel classifications over HSI patches, where the encoder utilizes a dual-scale strategy to explore information in different scales and the decoder adopts the cross-attention for information fusion. Experimental results on three datasets demonstrate that the proposed UCaT outperforms the competitors. Additionally, a visual explanation of the UCaT is given, showing its ability to build global interactions and capture pixel-level dependencies.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Attention U-shaped network for hyperspectral image classification
    Wang, Ruirui
    Liu, Bing
    Yu, Anzhu
    Wang, Wenjie
    Jiao, Xuejun
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (03)
  • [2] Double Attention Transformer for Hyperspectral Image Classification
    Tang, Ping
    Zhang, Meng
    Liu, Zhihui
    Song, Rong
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [3] Convolution Transformer Mixer for Hyperspectral Image Classification
    Zhang, Junjie
    Meng, Zhe
    Zhao, Feng
    Liu, Hanqiang
    Chang, Zhenhui
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [4] A hybrid convolution transformer for hyperspectral image classification
    Arshad, Tahir
    Zhang, Junping
    Ullah, Inam
    [J]. EUROPEAN JOURNAL OF REMOTE SENSING, 2024,
  • [5] DBCTNet: Double Branch Convolution-Transformer Network for Hyperspectral Image Classification
    Xu, Rui
    Dong, Xue-Mei
    Li, Weijie
    Peng, Jiangtao
    Sun, Weiwei
    Xu, Yi
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [6] Feature Fusion Convolution-Aided Transformer for Automatic Modulation Recognition
    Hu, Mutian
    Ma, Jitong
    Yang, Zhengyan
    Wang, Jie
    Lu, Jingjing
    Wu, Zhanjun
    [J]. IEEE COMMUNICATIONS LETTERS, 2023, 27 (10) : 2643 - 2647
  • [7] Hierarchical Attention Transformer for Hyperspectral Image Classification
    Arshad, Tahir
    Zhang, Junping
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [8] Uformer: A General U-Shaped Transformer for Image Restoration
    Wang, Zhendong
    Cun, Xiaodong
    Bao, Jianmin
    Zhou, Wengang
    Liu, Jianzhuang
    Li, Houqiang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17662 - 17672
  • [9] Double-Branch Network with Pyramidal Convolution and Iterative Attention for Hyperspectral Image Classification
    Shi, Hao
    Cao, Guo
    Ge, Zixian
    Zhang, Youqiang
    Fu, Peng
    [J]. REMOTE SENSING, 2021, 13 (07)
  • [10] Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification
    Zhao, Feng
    Li, Shijie
    Zhang, Junjie
    Liu, Hanqiang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20