Hyperspectral Image Classification Based on Multi-stage Vision Transformer with Stacked Samples

被引:11
|
作者
Chen, Xiaoyue [1 ]
Kamata, Sei-Ichiro [1 ]
Zhou, Weilian [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka, Japan
关键词
Hyperspectral image classification; image processing; deep learning; Vision Transformer; data augmentation;
D O I
10.1109/TENCON54134.2021.9707289
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperspectral image classification (HSIC) is a task assigning the correct label to each pixel. It is a hot topic in the remote sensing field, which has been processed in several deep learning methods. Recently, there are some works that apply Vision Transformer (ViT) methods to the HSIC task, but the performance is not as good as some CNN-structured methods, considering that Vision Transformer uses attention to capture global information but ignores local characteristics. In this paper, a multi-stage Vision Transformer model referring to the feature extraction structure of CNN is proposed, and the result shows the realizability and reliability. Besides, experiments show that the modified ViT structure needs more samples for training. An innovative data augmentation method is used to generate extended samples with virtual yet reliable labels. The generated samples are combined with the original ones as the stacked samples, which are used for the following feature extraction process. Experiments explain the optimization of the multi-stage Vision Transformer structure with stacked samples in the accuracy term compared with other methods.
引用
收藏
页码:441 / 446
页数:6
相关论文
共 50 条
  • [1] Vision Transformer-Based Ensemble Learning for Hyperspectral Image Classification
    Liu, Jun
    Guo, Haoran
    He, Yile
    Li, Huali
    [J]. REMOTE SENSING, 2023, 15 (21)
  • [2] Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification
    Zhou, Jingyi
    Sheng, Jiamu
    Ye, Peng
    Fan, Jiayuan
    He, Tong
    Wang, Bin
    Chen, Tao
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
  • [3] Vision Transformer With Contrastive Learning for Hyperspectral Image Classification
    Zhou, Heng
    Zhang, Xin
    Zhang, Chunlei
    Ma, Qiaoyu
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [4] CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification
    He, Wenxuan
    Huang, Weiliang
    Liao, Shuhong
    Xu, Zhen
    Yan, Jingwen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9266 - 9277
  • [5] HYBRID VISION TRANSFORMER MODEL FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Yang, Jiaqi
    Du, Bo
    Wu, Chen
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1388 - 1391
  • [6] CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification
    He, Wenxuan
    Huang, Weiliang
    Liao, Shuhong
    Xu, Zhen
    Yan, Jingwen
    [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15 : 9266 - 9277
  • [7] Multi-granularity vision transformer via semantic token for hyperspectral image classification
    Li, Bin
    Ouyang, Er
    Hu, Wenjing
    Zhang, Guoyun
    Zhao, Lin
    Wu, Jianhui
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (17) : 6538 - 6560
  • [8] A Multi-Stage Transformer Network for Image Dehazing Based on Contrastive Learning
    Gao F.
    Ji S.
    Guo J.
    Hou J.
    Ouyang C.
    Yang B.
    [J]. Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2023, 57 (01): : 195 - 210
  • [9] Multi-Stage Convolutional Broad Learning with Block Diagonal Constraint for Hyperspectral Image Classification
    Kong, Yi
    Wang, Xuesong
    Cheng, Yuhu
    Chen, C. L. Philip
    [J]. REMOTE SENSING, 2021, 13 (17)
  • [10] PolSAR image runways detection based on multi-stage classification
    [J]. Han, P. (hanpingcauc@163.com), 1600, Chinese Institute of Electronics (36):