Multiscale Attention Feature Fusion Based on Improved Transformer for Hyperspectral Image and LiDAR Data Classification

被引：0

作者：

Wang, Aili ^{[1
]}

Lei, Guilong ^{[1
]}

Dai, Shiyu ^{[1
]}

Wu, Haibin ^{[1
]}

Iwahori, Yuji ^{[2
]}

机构：

[1] Harbin Univ Sci & Technol, Coll Measurement & Control Technol & Commun Engn, Heilongjiang Prov Key Lab Laser Spect Technol & Ap, Harbin 150080, Peoples R China

[2] Chubu Univ, Dept Comp Sci, Kasugai 4878501, Japan

来源：

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING | 2025年 / 18卷

关键词：

Feature extraction; Transformers; Laser radar; Data mining; Convolutional neural networks; Convolution; Hyperspectral imaging; Correlation; Computer vision; Training; Hyperspectral image (HSI); interaction transformer; light detection and ranging (LiDAR); multisource data classification; three-dimensional convolutional neural network (3D-CNN);

D O I：

10.1109/JSTARS.2024.3524443

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the uninterrupted evolution of remote sensing data, the list of available data sources has expanded, effectively utilizing useful information from multiple sources for better land surface observation, which has become an intriguing and challenging problem. However, the complexity of urban areas and their surrounding structures makes it extremely difficult to capture correlations between features. This article proposes a novel multiscale attention feature fusion network, composed of hierarchical convolutional neural networks and transformer to enhance joint classification accuracy of hyperspectral image (HSI) and light detection and ranging (LiDAR) data. First, a multiscale fusion Swin transformer module is employed to eliminate information loss in feature propagation, which explores deep spatial-spectral features of HSI while extracting height information from LiDAR data. This structure combines the advantages of the Swin transformer, featuring a nonlocal receptive field fusion by progressively expanding the window's receptive field layer by layer while preserving the spatial features of the image. It also exhibits excellent robustness against spatial misalignment. For the dual branches of hyperspectral and LiDAR, a dual-source feature interactor is designed, which facilitates interaction between hyperspectral and LiDAR features by establishing a dynamic attention mechanism, which effectively captures correlated information between the two modalities and fuses it into a unified feature representation. The efficacy of the proposed approach is validated using three standard datasets (Huston2013, Trento, and MUUFL) in the experiments. The classification results indicate that the proposed framework, by fully utilizing spatial context information and effectively integrating feature information, significantly outperforms state-of-the-art classification methods.

引用

页码：4124 / 4140

页数：17

共 50 条

[41] Hyperspectral Image Classification Based on Multiscale Spatial Information Fusion
Li, Hong
Song, Yalong
Chen, C. L. Philip
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (09): : 5302 - 5312
[42] Double Attention Transformer for Hyperspectral Image Classification
Tang, Ping
Zhang, Meng
Liu, Zhihui
Song, Rong
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[43] Hierarchical Attention Transformer for Hyperspectral Image Classification
Arshad, Tahir
Zhang, Junping
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[44] A Multiscale Dual-Branch Feature Fusion and Attention Network for Hyperspectral Images Classification
Gao, Hongmin
Zhang, Yiyan
Chen, Zhonghao
Li, Chenming
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8180 - 8192
[45] AN IMPROVED SPECTRAL REFLECTANCE AND DERIVATIVE FEATURE FUSION FOR HYPERSPECTRAL IMAGE CLASSIFICATION
Wang, Qingyan
Zhang, Junping
Chen, Jiawei
Zhang, Ye
2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 1696 - 1699
[46] CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification
He, Wenxuan
Huang, Weiliang
Liao, Shuhong
Xu, Zhen
Yan, Jingwen
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9266 - 9277
[47] Multiscale Super Token Transformer for Hyperspectral Image Classification
Meng, Zhe
Zhang, Taizheng
Zhao, Feng
Chen, Gaige
Liang, Miaomiao
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
[48] CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification
He, Wenxuan
Huang, Weiliang
Liao, Shuhong
Xu, Zhen
Yan, Jingwen
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15 : 9266 - 9277
[49] 2D-SSA BASED MULTISCALE FEATURE FUSION FOR FEATURE EXTRACTION AND DATA CLASSIFICATION IN HYPERSPECTRAL IMAGERY
Fu, Hang
Sun, Genyun
Ren, Jinchang
Zabalza, Jamie
Zhang, Aizhu
Yao, Yanjuan
IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 76 - 79
[50] Multiscale and Multidirection Feature Extraction Network for Hyperspectral and LiDAR Classification
Liu, Yi
Ye, Zhen
Xi, Yongqiang
Liu, Huan
Li, Wei
Bai, Lin
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 9961 - 9973

← 1 2 3 4 5 →