Spectral-spatial Attention Residual Networks for Hyperspectral Image Classification

被引：0

作者：

Wang Feifei ^{[1
,3
]}

Zhao Huijie ^{[1
,2
,3
]}

Li Na ^{[1
,2
,3
]}

Li Siyuan ^{[4
]}

Cai Yu ^{[5
]}

机构：

[1] Beihang Univ, Sch Instrumentat & Optoelect Engn, Key Lab Precis Optomech Technol, Minist Educ, Beijing 100191, Peoples R China

[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China

[3] Beihang Univ, Key Lab Minist Ind & Informat Technol, Aerosp Opt Microwave Integrated Precis Intelligen, Beijing 100191, Peoples R China

[4] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol, Xian 710119, Peoples R China

[5] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China

来源：

ACTA PHOTONICA SINICA | 2023年 / 52卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Spectral-spatial feature; Residual network; Hyperspectral image classification; Spectral attention mechanism; Spatial attention mechanism;

D O I：

10.3788/gzxb20235212.1210002

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

Hyperspectral image classification is a research hotspot in the field of hyperspectral image processing and application. Classification models predict the class of each pixel by analyzing the spectral and spatial information of each pixel and compare it to the actual features. In the hyperspectral classification task, the spatial context information of the data can be used to improve the classification accuracy, so this paper uses the powerful learning ability of 3D-CNN to extract effective spectral and spatial features into hyperspectral images, and then fuses the extracted spectra and spatial features to enhance the flow between different levels of the network, thereby improving the classification efficiency. Although CNN operations can mine deeper feature information as the network deepens, CNN is ineffective in modeling long- distance dependencies, so consider combining CNN with attention mechanisms. This combination can focus on the local position of the given information, assign corresponding weights to it, emphasize the key features in the feature map, adjust the global information of the attention statistics image through weight re- annotation, retain the features that are more conducive to the classification task, and improve the representation ability of extracted features. But the common attention mechanism is to calculate the average globally, that is, the pixel values of the entire image block, inevitably introducing information from different categories of pixels around it, which is not needed in classification tasks. Another spectral attention mechanism based on the center pixel provides weight values that ignore the effects of surrounding pixels in the same category. Therefore, a simple spectral attention mechanism in the central region is proposed, in which the central region is selected with the central pixel as the reference and the surrounding 3x3 range as the central region, on the one hand, the range contains certain spectral information of the same category, and on the other hand, the interference of different categories of pixels is reduced as much as possible. The spectral attention mechanism in the central region can minimize the influence of interfering pixels on spectral features while extracting as many effective spectral features as possible. Based on the spectral attention mechanism of the central region, this paper proposes a spectral spatial attention residual network for hyperspectral classification, which mainly includes spectral feature learning, spatial feature learning and classifier. The network first selects appropriately sized image blocks from hyperspectral images and then classifies them. Starting from balancing computing resources and overall accuracy, experimental comparison shows that the size of the image patch is uniformly 13x13. The spectral feature learning part includes 1 frequency spectral attention module and 1 spectral residual network module. The spectral attention module adopts the central spectral attention mechanism, which can effectively suppress redundant bands and increase the weight of important bands. The spectral features after the attention mechanism will be extracted by the spectral residual network module, and more spectral features can be extracted. Convolution kernels of 1x1xn do not affect the spatial structure when extracting spectral features while maintaining spatial correlation. The spatial feature learning component includes 1 spatial attention module and 2 spatial residual network modules. The spatial attention module can obtain the important spatial information of the pixels to be classified, and use the spatial residual network to extract its spatial information. Add a hop connection between each module in the network to connect the presentation layer of the hierarchical features into a continuous residual block to mitigate the loss of accuracy. Finally, these rich spectral and spatial features are sent to the classifier to obtain the final classification result. The proposed algorithm is compared with the latest algorithm on four public datasets. Indicators and visualization results verify the superiority of the proposed algorithm.

引用

页数：19

共 25 条

[1] [方帅 Fang Shuai], 2021, [中国图象图形学报, Journal of Image and Graphics], V26, P1940
[2] Hyperspectral Image Classification Using a Hybrid 3D-2D Convolutional Neural Networks
Ghaderizadeh, Saeed
Abbasi-Moghadam, Dariush
Sharifi, Alireza
Zhao, Na
Tariq, Aqil
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 7570 - 7588
[3] Identity Mappings in Deep Residual Networks
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645
[4] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[5] Reducing the dimensionality of data with neural networks
Hinton, G. E.
Salakhutdinov, R. R.
[J]. SCIENCE, 2006, 313 (5786) : 504 - 507
[6] Deep Convolutional Neural Networks for Hyperspectral Image Classification
Hu, Wei
Huang, Yangyu
Wei, Li
Zhang, Fan
Li, Hengchao
[J]. JOURNAL OF SENSORS, 2015, 2015
[7] Densely Connected Convolutional Networks
Huang, Gao
Liu, Zhuang
van der Maaten, Laurens
Weinberger, Kilian Q.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
[8] Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network
Li, Ying
Zhang, Haokui
Shen, Qiang
[J]. REMOTE SENSING, 2017, 9 (01)
[9] Sparse-Adaptive Hypergraph Discriminant Analysis for Hyperspectral Image Classification
Luo, Fulin
Zhang, Liangpei
Zhou, Xiaocheng
Guo, Tan
Cheng, Yanxiang
Yin, Tailang
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (06) : 1082 - 1086
[10] Luo YA, 2018, 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), P464, DOI 10.1109/ICALIP.2018.8455251

← 1 2 3 →