Hierarchical Unified Spectral-Spatial Aggregated Transformer for Hyperspectral Image Classification

被引:2
|
作者
Zhou, Weilian [1 ]
Kamata, Sei-Ichiro [1 ]
Luo, Zhengbo [1 ]
Chen, Xiaoyue [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Japan
关键词
D O I
10.1109/ICPR56361.2022.9956396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision Transformer (ViT) has recently been introduced into the computer vision (CV) field with its self-attention mechanism and gotten remarkable performance. However, simply applying ViT for hyperspectral image (HSI) classification is not applicable due to 1) ViT is a spatial-only self-attention model, but rich spectral information exists in HSI; 2) ViT needs sufficient training samples, but HSI suffers from limited samples; 3) ViT does not well learn local features; 4) multi-scale features for ViT are not considered. Furthermore, the methods which combine convolutional neural network (CNN) and ViT generally suffer from a large computational burden. Hence, this paper tends to design a suitable pure ViT based model for HSI classification as the following points: 1) spectral-only vision transformer with all tokens' aggregation; 2) spatial-only local-global transformer; 3) cross-scale local-global feature fusion, and 4) a cooperative loss function to unify the spectral and spatial features. As a result, the proposed idea achieves competitive classification performance on three public datasets than other state-of-the-art methods.
引用
收藏
页码:3041 / 3047
页数:7
相关论文
共 50 条
  • [21] Spectral-Spatial Response for Hyperspectral Image Classification
    Wei, Yantao
    Zhou, Yicong
    Li, Hong
    REMOTE SENSING, 2017, 9 (03):
  • [22] S2IT: Spectral-Spatial Interactive Transformer for Hyperspectral Image Classification
    Wang, Minhui
    Sun, Yaxiu
    Xiang, Jianhong
    Zhong, Yu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [23] Masked Auto-Encoding Spectral-Spatial Transformer for Hyperspectral Image Classification
    Ibanez, Damian
    Fernandez-Beltran, Ruben
    Pla, Filiberto
    Yokoya, Naoto
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [24] MASSFormer: Memory-Augmented Spectral-Spatial Transformer for Hyperspectral Image Classification
    Sun, Le
    Zhang, Hang
    Zheng, Yuhui
    Wu, Zebin
    Ye, Zhonglin
    Zhao, Haixing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [25] Spectral-Spatial Constraint Hyperspectral Image Classification
    Ji, Rongrong
    Gao, Yue
    Hong, Richang
    Liu, Qiong
    Tao, Dacheng
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (03): : 1811 - 1824
  • [26] Spectral-Spatial Mamba for Hyperspectral Image Classification
    Huang, Lingbo
    Chen, Yushi
    He, Xin
    REMOTE SENSING, 2024, 16 (13)
  • [27] HYPERSPECTRAL IMAGE CLASSIFICATION USING HIERARCHICAL SPATIAL-SPECTRAL TRANSFORMER
    Song, Chao
    Mei, Shaohui
    Ma, Mingyang
    Xu, Fulin
    Zhang, Yifan
    Du, Qian
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3584 - 3587
  • [28] Pyramid Hierarchical Spatial-Spectral Transformer for Hyperspectral Image Classification
    Ahmad, Muhammad
    Butt, Muhammad Hassaan Farooq
    Mazzara, Manuel
    Distefano, Salvatore
    Khan, Adil Mehmood
    Altuwaijri, Hamad Ahmed
    IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 17 : 17681 - 17689
  • [29] Spectral-Spatial Transformer Network for Hyperspectral Image Classification: A Factorized Architecture Search Framework
    Zhong, Zilong
    Li, Ying
    Ma, Lingfei
    Li, Jonathan
    Zheng, Wei-Shi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [30] MSTSENet: Multiscale Spectral-Spatial Transformer with Squeeze and Excitation network for hyperspectral image classification
    Ahmad, Irfan
    Farooque, Ghulam
    Liu, Qichao
    Hadi, Fazal
    Xiao, Liang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 134