Pixel Representation Augmented through Cross-Attention for High-Resolution Remote Sensing Imagery Segmentation

被引:0
|
作者
Luo, Yiyun [1 ,2 ]
Wang, Jinnian [1 ,2 ]
Yang, Xiankun [1 ,2 ]
Yu, Zhenyu [1 ,2 ]
Tan, Zixuan [1 ,2 ]
机构
[1] Guangzhou Univ, Sch Geog & Remote Sensing, Guangzhou 510006, Peoples R China
[2] Guangzhou Univ, Ctr Remote Sensing Big Data Intelligence Applicat, Guangzhou 510006, Peoples R China
基金
国家重点研发计划;
关键词
land cover classification; transformer; cross-attention; object embedding queries; LAND-COVER CLASSIFICATION; SEMANTIC SEGMENTATION; NETWORK;
D O I
10.3390/rs14215415
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Natural imagery segmentation has been transferred to land cover classification in remote sensing imagery with excellent performance. However, two key issues have been overlooked in the transfer process: (1) some objects were easily overwhelmed by the complex backgrounds; (2) interclass information for indistinguishable classes was not fully utilized. The attention mechanism in the transformer is capable of modeling long-range dependencies on each sample for per-pixel context extraction. Notably, per-pixel context from the attention mechanism can aggregate category information. Therefore, we proposed a semantic segmentation method based on pixel representation augmentation. In our method, a simplified feature pyramid was designed to decode the hierarchical pixel features from the backbone, and then decode the category representations into learnable category object embedding queries by cross-attention in the transformer decoder. Finally, pixel representation is augmented by an additional cross-attention in the transformer encoder under the supervision of auxiliary segmentation heads. The results of extensive experiments on the aerial image dataset Potsdam and satellite image dataset Gaofen Image Dataset with 15 categories (GID-15) demonstrate that the cross-attention is effective, and our method achieved the mean intersection over union (mIoU) of 86.2% and 62.5% on the Potsdam test set and GID-15 validation set, respectively. Additionally, we achieved an inference speed of 76 frames per second (FPS) on the Potsdam test dataset, higher than all the state-of-the-art models we tested on the same device.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] AANet: Adaptive Attention Networks for Semantic Segmentation of High-Resolution Remote Sensing Imagery
    Chen, Yan
    Zhang, Qianchuan
    Wang, Xiaofeng
    Dong, Quan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14640 - 14655
  • [2] Multiscale Progressive Segmentation Network for High-Resolution Remote Sensing Imagery
    Hang, Renlong
    Yang, Ping
    Zhou, Feng
    Liu, Qingshan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Lightweight multiscale framework for segmentation of high-resolution remote sensing imagery
    Bello, Inuwa M.
    Zhang, Ke
    Wang, Jingyu
    Li, Haoyu
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (03)
  • [4] Anomaly Segmentation for High-Resolution Remote Sensing Images Based on Pixel Descriptors
    Li, Jingtao
    Wang, Xinyu
    Zhao, Hengwei
    Wang, Shaoyu
    Zhong, Yanfei
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4426 - 4434
  • [5] A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation
    Zuo, Renxiang
    Zhang, Guangyun
    Zhang, Rongting
    Jia, Xiuping
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Unsupervised Domain Adaptation for Semantic Segmentation of High-Resolution Remote Sensing Imagery Driven by Category-Certainty Attention
    Chen, Jie
    Zhu, Jingru
    Guo, Ya
    Sun, Geng
    Zhang, Yi
    Deng, Min
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] Multi-level threshold segmentation of high-resolution panchromatic remote sensing imagery
    Yang, Yun
    Li, Yu
    Zhao, Quan-Hua
    [J]. Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2020, 28 (10): : 2370 - 2383
  • [8] Optimum segmentation of simple objects in high-resolution remote sensing imagery in coastal areas
    Jianyu Chen
    Delu Pan
    Zhihua Mao
    [J]. Science in China Series D: Earth Sciences, 2006, 49 : 1195 - 1203
  • [9] Analysis of high-resolution remote sensing imagery with textures derived from single pixel objects
    de Kok, R.
    Tasdemir, K.
    [J]. EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS II, 2011, 8181
  • [10] Optimum segmentation of simple objects in high-resolution remote sensing imagery in coastal areas
    CHEN Jianyu1
    2. Shanghai Institute of Technical Physics
    [J]. Science China Earth Sciences, 2006, (11) : 1195 - 1203