Semantic interleaving global channel attention for multilabel remote sensing image classification

被引：1

作者：

Liu, Yongkun ^{[1
]}

Ni, Kesong ^{[3
]}

Zhang, Yuhan ^{[4
]}

Zhou, Lijian ^{[2
]}

Zhao, Kun ^{[2
,5
]}

机构：

[1] Sun Yat Sen Univ, Sch Software Engn, Zhuhai, Peoples R China

[2] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China

[3] Dalian Univ Technol, Sch Software, Dalian, Peoples R China

[4] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China

[5] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao 266520, Peoples R China

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2024年 / 45卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Remote sensing; multilabel classification; gnn; channel attention; label relation; LEARNING APPROACH; NEURAL-NETWORK; LAND-COVER; RETRIEVAL; FRAMEWORK;

D O I：

10.1080/01431161.2023.2297175

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Multilabel remote sensing image classification (MLRSIC) has received increasing research interest. Taking the co-occurrence relationship of multiple labels as additional information helps to improve the overall performance. However, current methods only focus on using it to constrain the final feature which is output from a convolutional neural network (CNN). On the one hand, these methods need to exploit the potential of label correlation in feature representation fully. On the other hand, they increase the label noise sensitivity of the system, resulting in poor robustness. In this paper, a novel method called 'Semantic Interleaving Global chaNnel Attention' (SIGNA) is proposed for MLRSIC. First, the label co-occurrence graph is obtained according to the statistical information of the training set and fed into a graph neural network (GNN) to generate optimal semantic feature representations of each label. Next, the semantic features are interleaved with visual features which are extracted by CNNs to guide the overall features of the input image transform from the original feature space to the semantic feature space with embedded label relations. Then, global attention triggered by semantic interleaving is used to emphasize visual features in important channels. Finally, to make SIGNA easier to use and more optimized, multihead SIGNA-based feature adaptive weighting networks are proposed as plug-in blocks to plug into any layers of a CNN. For remote sensing images, better classification performance can be achieved by inserting the plug-in blocks into the shallow layers of CNNs. We conducted extensive experimental comparisons on three data sets: UCM, AID and DFC15. Experimental results demonstrate that the proposed SIGNA achieves superior classification performance compared to state-of-the-art (SOTA) methods. Notes that the codes of this paper will be open to the community for reproducibility research.

引用

页码：393 / 419

页数：27

共 50 条

[31] Transformer-Driven Semantic Relation Inference for Multilabel Classification of High-Resolution Remote Sensing Images
Tan, Xiaowei
Xiao, Zhifeng
Zhu, Jianjun
Wan, Qiao
Wang, Kai
Li, Deren
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 1884 - 1901
[32] Exploring Hybrid Contrastive Learning and Scene-to-Label Information for Multilabel Remote Sensing Image Classification
Song, Tiecheng
Bai, Shufen
Yang, Feng
Gao, Chenqiang
Chen, Haonan
Li, Jun
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[33] MCAFNet: A Multiscale Channel Attention Fusion Network for Semantic Segmentation of Remote Sensing Images
Yuan, Min
Ren, Dingbang
Feng, Qisheng
Wang, Zhaobin
Dong, Yongkang
Lu, Fuxiang
Wu, Xiaolin
REMOTE SENSING, 2023, 15 (02)
[34] Semantic Segmentation of Remote Sensing Data Based on Channel Attention and Feature Information Entropy
Duan, Sining
Zhao, Jingyi
Huang, Xinyi
Zhao, Shuhe
SENSORS, 2024, 24 (04)
[35] Semantic segmentation of remote sensing images based on dual-channel attention mechanism
Jiang, Jionghui
Feng, Xi'an
Huang, Hui
IET IMAGE PROCESSING, 2024, 18 (09) : 2346 - 2356
[36] Multi-Label Remote Sensing Image Classification with Latent Semantic Dependencies
Ji, Junchao
Jing, Weipeng
Chen, Guangsheng
Lin, Jingbo
Song, Houbing
REMOTE SENSING, 2020, 12 (07)
[37] Unsupervised Cross-View Semantic Transfer for Remote Sensing Image Classification
Sun, Hao
Liu, Shuai
Zhou, Shilin
Zou, Huanxin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (01) : 13 - 17
[38] Contextual Spatial-Channel Attention Network for Remote Sensing Scene Classification
Hou, Yan-e
Yang, Kang
Dang, Lanxue
Liu, Yang
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[39] Spatial global context information network for semantic segmentation of remote sensing image
Wu Z.-K.
Zhao S.
Li H.-W.
Jiang Y.-R.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (04): : 795 - 802
[40] Classification from Sky: A Robust Remote Sensing Time Series Image Classification Using Spatial Encoder and Multi -Fast Channel Attention
Sarpong, Kwabena
Jackson, Jehoiada Kofi
Effah, Derrick
Addo, Daniel
Yussif, Sophyani Banaamwini
Awrangjeb, Mohammad
Patamia, Rutherford Agbeshi
Danso, Juliana Mantebea
Qin, Zhiguang
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 10405 - 10422

← 1 2 3 4 5 →