Semantic interleaving global channel attention for multilabel remote sensing image classification

被引:1
|
作者
Liu, Yongkun [1 ]
Ni, Kesong [3 ]
Zhang, Yuhan [4 ]
Zhou, Lijian [2 ]
Zhao, Kun [2 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Software Engn, Zhuhai, Peoples R China
[2] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China
[3] Dalian Univ Technol, Sch Software, Dalian, Peoples R China
[4] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[5] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao 266520, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; multilabel classification; gnn; channel attention; label relation; LEARNING APPROACH; NEURAL-NETWORK; LAND-COVER; RETRIEVAL; FRAMEWORK;
D O I
10.1080/01431161.2023.2297175
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Multilabel remote sensing image classification (MLRSIC) has received increasing research interest. Taking the co-occurrence relationship of multiple labels as additional information helps to improve the overall performance. However, current methods only focus on using it to constrain the final feature which is output from a convolutional neural network (CNN). On the one hand, these methods need to exploit the potential of label correlation in feature representation fully. On the other hand, they increase the label noise sensitivity of the system, resulting in poor robustness. In this paper, a novel method called 'Semantic Interleaving Global chaNnel Attention' (SIGNA) is proposed for MLRSIC. First, the label co-occurrence graph is obtained according to the statistical information of the training set and fed into a graph neural network (GNN) to generate optimal semantic feature representations of each label. Next, the semantic features are interleaved with visual features which are extracted by CNNs to guide the overall features of the input image transform from the original feature space to the semantic feature space with embedded label relations. Then, global attention triggered by semantic interleaving is used to emphasize visual features in important channels. Finally, to make SIGNA easier to use and more optimized, multihead SIGNA-based feature adaptive weighting networks are proposed as plug-in blocks to plug into any layers of a CNN. For remote sensing images, better classification performance can be achieved by inserting the plug-in blocks into the shallow layers of CNNs. We conducted extensive experimental comparisons on three data sets: UCM, AID and DFC15. Experimental results demonstrate that the proposed SIGNA achieves superior classification performance compared to state-of-the-art (SOTA) methods. Notes that the codes of this paper will be open to the community for reproducibility research.
引用
收藏
页码:393 / 419
页数:27
相关论文
共 50 条
  • [1] Multilabel Remote Sensing Image Classification with Capsule Networks
    Topcu, Mucahit
    Dede, Abdulkadir
    Eken, Suleyman
    Sayar, Ahmet
    [J]. 2ND INTERNATIONAL CONGRESS ON HUMAN-COMPUTER INTERACTION, OPTIMIZATION AND ROBOTIC APPLICATIONS (HORA 2020), 2020, : 316 - 318
  • [2] Exploring Transformer and Multilabel Classification for Remote Sensing Image Captioning
    Kandala, Hitesh
    Saha, Sudipan
    Banerjee, Biplab
    Zhu, Xiao Xiang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [3] Multilabel Remote Sensing Image Annotation With Multiscale Attention and Label Correlation
    Huang, Rui
    Zheng, Fengcai
    Huang, Wei
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 6951 - 6961
  • [4] Global Context-Based Multilevel Feature Fusion Networks for Multilabel Remote Sensing Image Scene Classification
    Wang, Xin
    Duan, Lin
    Ning, Chen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 11179 - 11196
  • [5] Channel-Attention-Based DenseNet Network for Remote Sensing Image Scene Classification
    Tong, Wei
    Chen, Weitao
    Han, Wei
    Li, Xianju
    Wang, Lizhe
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 4121 - 4132
  • [6] Remote Sensing Image Scene Classification Based on Global Self-Attention Module
    Li, Qingwen
    Yan, Dongmei
    Wu, Wanrong
    [J]. REMOTE SENSING, 2021, 13 (22)
  • [7] Toward Multilabel Image Retrieval for Remote Sensing
    Imbriaco, Raffaele
    Sebastian, Clint
    Bondarev, Egor
    de With, Peter H. N.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] Recurrent Attention and Semantic Gate for Remote Sensing Image Captioning
    Li, Yunpeng
    Zhang, Xiangrong
    Gu, Jing
    Li, Chen
    Wang, Xin
    Tang, Xu
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [9] Multitask Fine-Grained Feature Mining for Multilabel Remote Sensing Image Classification
    Guo, Jie
    Sun, Hao
    Han, Jinheng
    Song, Bin
    Chi, Yuhao
    Song, Bingxi
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [10] Transformer based on channel-spatial attention for accurate classification of scenes in remote sensing image
    Guo, Jingxia
    Jia, Nan
    Bai, Jinniu
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)