Cross-Attention-Driven Adaptive Graph Relational Network for Multilabel Remote Sensing Scene Classification

被引:0
|
作者
Bi, Haixia [1 ]
Chang, Honghao [1 ]
Wang, Xiaotian [2 ]
Hong, Danfeng [3 ,4 ]
机构
[1] Xi'an Jiaotong University, School of Information and Communications Engineering, Xi'an,710049, China
[2] Northwestern Polytechnical University, Unmanned System Research Institute, Xi'an,710072, China
[3] Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing,100094, China
[4] University of Chinese Academy of Sciences, School of Electronic, Electrical and Communication Engineering, Beijing,100049, China
基金
中国国家自然科学基金;
关键词
Feature extraction - Generative adversarial networks - Graph embeddings - Graph neural networks - Graphic methods - Labeled data - Network theory (graphs) - Remote sensing;
D O I
10.1109/TGRS.2024.3476089
中图分类号
学科分类号
摘要
Multilabel remote sensing scene classification (MLRSSC) has garnered growing attention in recent years, owing to its more comprehensive description of land covers compared to its single-label counterpart. However, challenges arise inevitably. First, the relations among multiple scene labels are sophisticated. How to excavate the interclass dependencies is, therefore, a key challenge for the MLRSSC task. Second, extracting discriminative semantic features is essential, yet challenging for scene prediction of remote sensing images. Another issue is that the multilabel dataset usually shows twofold sample imbalances, that is, class imbalance and positive-negative imbalance, which have not been explored in MLRSSC tasks so far. To overcome the above hurdles, we put forward a cross-attention-driven adaptive graph relational network for the MLRSSC task. Different from the chain-like long short-term memory (LSTM) or static label co-occurrence matrices, we propose to use image-specific relational graphs to dynamically model the interclass dependencies. We innovatively devise a cross-attention-driven representation learning approach, which uses learnable label embeddings to query the class-wise semantic features, explicitly establishing the feature-label connections. Moreover, we design a balanced focal loss (BFL) function, where the loss contributions of positive and negative samples are rebalanced based on the respective imbalance degrees of diverse classes. Extensive experiments were performed on UCM, AID, and DFC15 multilabel datasets. Experimental results demonstrated that our proposed method achieves state-of-the-art performance in the studied task. © 1980-2012 IEEE.
引用
收藏
相关论文
共 50 条
  • [31] Remote Sensing Scene Classification via Multi-Branch Local Attention Network
    Chen, Si-Bao
    Wei, Qing-Song
    Wang, Wen-Zhong
    Tang, Jin
    Luo, Bin
    Wang, Zu-Yuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 99 - 109
  • [32] Wavelet Attention ResNeXt Network for High-resolution Remote Sensing Scene Classification
    Song, Wanying
    Cong, Yifan
    Zhang, Yingying
    Zhang, Shiru
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 330 - 333
  • [33] Gradient-Guided Multiscale Focal Attention Network for Remote Sensing Scene Classification
    Zhao, Yue
    Gong, Maoguo
    Qin, A. K.
    Zhang, Mingyang
    Hu, Zhuping
    Gao, Tianqi
    Pu, Yan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [34] CAW: A Remote-Sensing Scene Classification Network Aided by Local Window Attention
    Wang, Wei
    Wen, Xiaowei
    Wang, Xin
    Tang, Chen
    Deng, Jiwei
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [35] Scene Classification of Remote Sensing Images Based on Saliency Dual Attention Residual Network
    Guo, Dongen
    Xia, Ying
    Luo, Xiaobo
    IEEE ACCESS, 2020, 8 : 6344 - 6357
  • [36] Channel-Attention-Based DenseNet Network for Remote Sensing Image Scene Classification
    Tong, Wei
    Chen, Weitao
    Han, Wei
    Li, Xianju
    Wang, Lizhe
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 4121 - 4132
  • [37] Self-Attention Network With Joint Loss for Remote Sensing Image Scene Classification
    Wu, Honglin
    Zhao, Shuzhen
    Li, Liang
    Lu, Chaoquan
    Chen, Wen
    IEEE ACCESS, 2020, 8 : 210347 - 210359
  • [38] Adaptive scene-aware deep attention network for remote sensing image compression
    Zhai, Guowei
    Liu, Gang
    He, Xiaohai
    Wang, Zhengyong
    Ren, Chao
    Chen, Zhengxin
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (05)
  • [39] Semantic interleaving global channel attention for multilabel remote sensing image classification
    Liu, Yongkun
    Ni, Kesong
    Zhang, Yuhan
    Zhou, Lijian
    Zhao, Kun
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (02) : 393 - 419
  • [40] Graph Relation Network: Modeling Relations Between Scenes for Multilabel Remote-Sensing Image Classification and Retrieval
    Kang, Jian
    Fernandez-Beltran, Ruben
    Hong, Danfeng
    Chanussot, Jocelyn
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05): : 4355 - 4369