Hyperspectral Image Classification Based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms

被引:0
|
作者
Sun, Qian [1 ]
Zhao, Guangrui [1 ]
Xia, Xinyuan [2 ]
Xie, Yu [2 ]
Fang, Chenrong [3 ]
Sun, Le [4 ,5 ]
Wu, Zebin [2 ]
Pan, Chengsheng [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300000, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[5] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
hyperspectral image classification; convolutional neural network; Transformer; attention mechanism; multi-scale features; SPATIAL CLASSIFICATION; NETWORKS;
D O I
10.3390/rs16122185
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Convolutional neural network (CNN)-based and Transformer-based methods for hyperspectral image (HSI) classification have rapidly advanced due to their unique characterization capabilities. However, the fixed kernel sizes in convolutional layers limit the comprehensive utilization of multi-scale features in HSI land cover analysis, while the Transformer's multi-head self-attention (MHSA) mechanism faces challenges in effectively encoding feature information across various dimensions. To tackle this issue, this article introduces an HSI classification method, based on multi-scale convolutional features and multi-attention mechanisms (i.e., MSCF-MAM). Firstly, the model employs a multi-scale convolutional module to capture features across different scales in HSIs. Secondly, to enhance the integration of local and global channel features and establish long-range dependencies, a feature enhancement module based on pyramid squeeze attention (PSA) is employed. Lastly, the model leverages a classical Transformer Encoder (TE) and linear layers to encode and classify the transformed spatial-spectral features. The proposed method is evaluated on three publicly available datasets-Salina Valley (SV), WHU-Hi-HanChuan (HC), and WHU-Hi-HongHu (HH). Extensive experimental results have demonstrated that the MSCF-MAM method outperforms several representative methods in terms of classification performance.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Hyperspectral image classification based on multi-scale hybrid convolutional network
    Yang, Yun
    Zhou, Yao
    Chen, Jia-ning
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (03) : 368 - 377
  • [2] Hyperspectral Image Classification Based on Multi-Scale Residual Network with Attention Mechanism
    Qing, Yuhao
    Liu, Wenyi
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 18
  • [3] A New Multi-Scale Convolutional Model Based on Multiple Attention for Image Classification
    Yang, Yadong
    Xu, Chengji
    Dong, Feng
    Wang, Xiaofeng
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [4] Multi-Scale Dilated Convolutional Neural Network for Hyperspectral Image Classification
    Shanshan Zheng
    Wen Liu
    Rui Shan
    Jingyi Zhao
    Guoqian Jiang
    Zhi Zhang
    [J]. Journal of Harbin Institute of Technology(New series), 2021, 28 (04) : 25 - 32
  • [5] MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion
    Liang, Zhiwei
    Zhao, Kui
    Liang, Gang
    Li, Siyu
    Wu, Yifei
    Zhou, Yiping
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [6] Deep Multi-scale Convolutional Neural Network for Hyperspectral Image Classification
    Zhang Feng-zhe
    Yang Xia
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [7] Hierarchical Multi-Scale Convolutional Neural Networks for Hyperspectral Image Classification
    Li, Simin
    Zhu, Xueyu
    Bao, Jie
    [J]. SENSORS, 2019, 19 (07)
  • [8] Res2Net-based multi-scale and multi-attention model for traffic scene image classification
    Gao, Guanghui
    Guo, Yining
    Zhou, Lumei
    Li, Li
    Shi, Gang
    [J]. PLOS ONE, 2024, 19 (05):
  • [9] Multi-scale Multi-attention Network for Moire Document Image Binarization
    Guo, Yanqing
    Ji, Caijuan
    Zheng, Xin
    Wang, Qianyu
    Luo, Xiangyang
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 90
  • [10] Residual networks with multi-attention mechanism for hyperspectral image classification
    Shao Y.
    Lan J.
    Liang Y.
    Hu J.
    [J]. Arabian Journal of Geosciences, 2021, 14 (4)