Semantic interleaving global channel attention for multilabel remote sensing image classification

被引:1
|
作者
Liu, Yongkun [1 ]
Ni, Kesong [3 ]
Zhang, Yuhan [4 ]
Zhou, Lijian [2 ]
Zhao, Kun [2 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Software Engn, Zhuhai, Peoples R China
[2] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China
[3] Dalian Univ Technol, Sch Software, Dalian, Peoples R China
[4] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[5] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao 266520, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; multilabel classification; gnn; channel attention; label relation; LEARNING APPROACH; NEURAL-NETWORK; LAND-COVER; RETRIEVAL; FRAMEWORK;
D O I
10.1080/01431161.2023.2297175
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Multilabel remote sensing image classification (MLRSIC) has received increasing research interest. Taking the co-occurrence relationship of multiple labels as additional information helps to improve the overall performance. However, current methods only focus on using it to constrain the final feature which is output from a convolutional neural network (CNN). On the one hand, these methods need to exploit the potential of label correlation in feature representation fully. On the other hand, they increase the label noise sensitivity of the system, resulting in poor robustness. In this paper, a novel method called 'Semantic Interleaving Global chaNnel Attention' (SIGNA) is proposed for MLRSIC. First, the label co-occurrence graph is obtained according to the statistical information of the training set and fed into a graph neural network (GNN) to generate optimal semantic feature representations of each label. Next, the semantic features are interleaved with visual features which are extracted by CNNs to guide the overall features of the input image transform from the original feature space to the semantic feature space with embedded label relations. Then, global attention triggered by semantic interleaving is used to emphasize visual features in important channels. Finally, to make SIGNA easier to use and more optimized, multihead SIGNA-based feature adaptive weighting networks are proposed as plug-in blocks to plug into any layers of a CNN. For remote sensing images, better classification performance can be achieved by inserting the plug-in blocks into the shallow layers of CNNs. We conducted extensive experimental comparisons on three data sets: UCM, AID and DFC15. Experimental results demonstrate that the proposed SIGNA achieves superior classification performance compared to state-of-the-art (SOTA) methods. Notes that the codes of this paper will be open to the community for reproducibility research.
引用
收藏
页码:393 / 419
页数:27
相关论文
共 50 条
  • [31] Transformer-Driven Semantic Relation Inference for Multilabel Classification of High-Resolution Remote Sensing Images
    Tan, Xiaowei
    Xiao, Zhifeng
    Zhu, Jianjun
    Wan, Qiao
    Wang, Kai
    Li, Deren
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 1884 - 1901
  • [32] Exploring Hybrid Contrastive Learning and Scene-to-Label Information for Multilabel Remote Sensing Image Classification
    Song, Tiecheng
    Bai, Shufen
    Yang, Feng
    Gao, Chenqiang
    Chen, Haonan
    Li, Jun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [33] MCAFNet: A Multiscale Channel Attention Fusion Network for Semantic Segmentation of Remote Sensing Images
    Yuan, Min
    Ren, Dingbang
    Feng, Qisheng
    Wang, Zhaobin
    Dong, Yongkang
    Lu, Fuxiang
    Wu, Xiaolin
    REMOTE SENSING, 2023, 15 (02)
  • [34] Semantic Segmentation of Remote Sensing Data Based on Channel Attention and Feature Information Entropy
    Duan, Sining
    Zhao, Jingyi
    Huang, Xinyi
    Zhao, Shuhe
    SENSORS, 2024, 24 (04)
  • [35] Semantic segmentation of remote sensing images based on dual-channel attention mechanism
    Jiang, Jionghui
    Feng, Xi'an
    Huang, Hui
    IET IMAGE PROCESSING, 2024, 18 (09) : 2346 - 2356
  • [36] Multi-Label Remote Sensing Image Classification with Latent Semantic Dependencies
    Ji, Junchao
    Jing, Weipeng
    Chen, Guangsheng
    Lin, Jingbo
    Song, Houbing
    REMOTE SENSING, 2020, 12 (07)
  • [37] Unsupervised Cross-View Semantic Transfer for Remote Sensing Image Classification
    Sun, Hao
    Liu, Shuai
    Zhou, Shilin
    Zou, Huanxin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (01) : 13 - 17
  • [38] Contextual Spatial-Channel Attention Network for Remote Sensing Scene Classification
    Hou, Yan-e
    Yang, Kang
    Dang, Lanxue
    Liu, Yang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [39] Spatial global context information network for semantic segmentation of remote sensing image
    Wu Z.-K.
    Zhao S.
    Li H.-W.
    Jiang Y.-R.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (04): : 795 - 802
  • [40] Classification from Sky: A Robust Remote Sensing Time Series Image Classification Using Spatial Encoder and Multi -Fast Channel Attention
    Sarpong, Kwabena
    Jackson, Jehoiada Kofi
    Effah, Derrick
    Addo, Daniel
    Yussif, Sophyani Banaamwini
    Awrangjeb, Mohammad
    Patamia, Rutherford Agbeshi
    Danso, Juliana Mantebea
    Qin, Zhiguang
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 10405 - 10422