Joint Classification of Hyperspectral and LiDAR Data Using Hierarchical Multimodal Feature Aggregation-Based Multihead Axial Attention Transformer

被引:0
|
作者
Zhu, Fei [1 ]
Shi, Cuiping [2 ]
Shi, Kaijie [1 ]
Wang, Liguo [3 ]
机构
[1] Qiqihar Univ, Dept Commun Engn, Qiqihar 161000, Peoples R China
[2] Huzhou Univ, Coll Informat Engn, Huzhou 313000, Peoples R China
[3] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian 116000, Peoples R China
基金
中国国家自然科学基金;
关键词
Axial attention; convolutional neural networks (CNNs); feature aggregation; hyperspectral; light detection and ranging (LiDAR); multimodal; transformer; REMOTE-SENSING DATA; EXTINCTION PROFILES; FUSION;
D O I
10.1109/TGRS.2025.3533475
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The rapid development of sensor and multimodal technology has provided more possibilities for multisource remote sensing image classification. However, some existing joint classification methods are limited to single-level feature fusion and fail to fully explore the deep correlation between cross-level features, thus limiting the effective interaction and complementarity of information between different modal data. To alleviate this issue, this article proposes a hierarchical multimodal feature aggregation-based multihead axial attention transformer (HMAT) for joint classification of hyperspectral and light detection and ranging (LiDAR) data. First, a hierarchical multimodal feature aggregation module (HMFA) is proposed to more effectively fuse spatial-spectral features of hyperspectral images (HSIs) and elevation features of LiDAR data and generate more discriminative low-dimensional feature representations. Second, a pyramid-inverted pyramid convolution module (PIP) is designed. Through the complementary feature extraction structure, PIP can more fully capture the multiscale local features in the fused feature map of hyperspectral and LiDAR data. Finally, a multihead axial attention (MHAA) component is constructed to capture information at different scales in the fused feature maps, thereby accurately modeling global dependencies. The proposed HMAT has been extensively tested on three publicly available datasets. The experimental results demonstrate that the classification performance of the proposed method outperforms that of several state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Joint Classification of Hyperspectral and LiDAR Data Using a Hierarchical CNN and Transformer
    Zhao, Guangrui
    Ye, Qiaolin
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    Jeon, Byeungwoo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [2] Local Semantic Feature Aggregation-Based Transformer for Hyperspectral Image Classification
    Tu, Bing
    Liao, Xiaolong
    Li, Qianming
    Peng, Yishu
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Multiscale Attention Feature Fusion Based on Improved Transformer for Hyperspectral Image and LiDAR Data Classification
    Wang, Aili
    Lei, Guilong
    Dai, Shiyu
    Wu, Haibin
    Iwahori, Yuji
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 4124 - 4140
  • [4] Deep Hierarchical Vision Transformer for Hyperspectral and LiDAR Data Classification
    Xue, Zhixiang
    Tan, Xiong
    Yu, Xuchu
    Liu, Bing
    Yu, Anzhu
    Zhang, Pengqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3095 - 3110
  • [5] A novel graph-attention based multimodal fusion network for joint classification of hyperspectral image and LiDAR data
    Cai, Jianghui
    Zhang, Min
    Yang, Haifeng
    He, Yanting
    Yang, Yuqing
    Shi, Chenhui
    Zhao, Xujun
    Xun, Yaling
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [6] Hyperspectral and LiDAR Data Classification Using Joint CNNs and Morphological Feature Learning
    Roy, Swalpa Kumar
    Deria, Ankur
    Hong, Danfeng
    Ahmad, Muhammad
    Plaza, Antonio
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] Interactive Enhanced Network Based on Multihead Self-Attention and Graph Convolution for Classification of Hyperspectral and LiDAR Data
    Gao, Hongmin
    Feng, Hao
    Zhang, Yiyan
    Fei, Shuyu
    Shen, Runhua
    Xu, Shufang
    Zhang, Bing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] Multi-Feature Cross Attention-Induced Transformer Network for Hyperspectral and LiDAR Data Classification
    Li, Zirui
    Liu, Runbang
    Sun, Le
    Zheng, Yuhui
    REMOTE SENSING, 2024, 16 (15)
  • [9] Attention Fusion of Transformer-Based and Scale-Based Method for Hyperspectral and LiDAR Joint Classification
    Zhang, Maqun
    Gao, Feng
    Zhang, Tiange
    Gan, Yanhai
    Dong, Junyu
    Yu, Hui
    REMOTE SENSING, 2023, 15 (03)
  • [10] Joint Classification of Hyperspectral and LiDAR Data Using Binary-Tree Transformer Network
    Song, Huacui
    Yang, Yuanwei
    Gao, Xianjun
    Zhang, Maqun
    Li, Shaohua
    Liu, Bo
    Wang, Yanjun
    Kou, Yuan
    REMOTE SENSING, 2023, 15 (11)