Building Extraction from Remote Sensing Images with Sparse Token Transformers

被引:97
|
作者
Chen, Keyan [1 ,2 ,3 ]
Zou, Zhengxia [4 ]
Shi, Zhenwei [1 ,2 ,3 ]
机构
[1] Beihang Univ, Sch Astronaut, Image Proc Ctr, Beijing 100191, Peoples R China
[2] Beihang Univ, Beijing Key Lab Digital Media, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Astronaut, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[4] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
remote sensing images; building extraction; transformers; sparse token sampler; EFFICIENT NETWORK; CLASSIFICATION; NET;
D O I
10.3390/rs13214441
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Deep learning methods have achieved considerable progress in remote sensing image building extraction. Most building extraction methods are based on Convolutional Neural Networks (CNN). Recently, vision transformers have provided a better perspective for modeling long-range context in images, but usually suffer from high computational complexity and memory usage. In this paper, we explored the potential of using transformers for efficient building extraction. We design an efficient dual-pathway transformer structure that learns the long-term dependency of tokens in both their spatial and channel dimensions and achieves state-of-the-art accuracy on benchmark building extraction datasets. Since single buildings in remote sensing images usually only occupy a very small part of the image pixels, we represent buildings as a set of "sparse " feature vectors in their feature space by introducing a new module called "sparse token sampler ". With such a design, the computational complexity in transformers can be greatly reduced over an order of magnitude. We refer to our method as Sparse Token Transformers (STT). Experiments conducted on the Wuhan University Aerial Building Dataset (WHU) and the Inria Aerial Image Labeling Dataset (INRIA) suggest the effectiveness and efficiency of our method. Compared with some widely used segmentation methods and some state-of-the-art building extraction methods, STT has achieved the best performance with low time cost.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] A review of road extraction from remote sensing images
    Weixing Wang
    Nan Yang
    Yi Zhang
    Fengping Wang
    Ting Cao
    Patrik Eklund
    [J]. Journal of Traffic and Transportation Engineering(English Edition), 2016, 3 (03) : 271 - 282
  • [42] Object and topology extraction from remote sensing images
    Maire, C
    Datcu, M
    [J]. 2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 1949 - 1952
  • [43] A review of road extraction from remote sensing images
    Wang, Weixing
    Yang, Nan
    Zhang, Yi
    Wang, Fengping
    Cao, Ting
    Eklund, Patrik
    [J]. JOURNAL OF TRAFFIC AND TRANSPORTATION ENGINEERING-ENGLISH EDITION, 2016, 3 (03) : 271 - 282
  • [44] Interactive objects extraction from remote sensing images
    Bucha, Victor
    Ablameyko, Sergey
    [J]. GEOGRAPHIC UNCERTAINTY IN ENVIRONMENTAL SECURITY, 2007, : 225 - +
  • [45] Rethinking Transformers for Semantic Segmentation of Remote Sensing Images
    Liu, Yuheng
    Zhang, Yifan
    Wang, Ye
    Mei, Shaohui
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [46] A review of research on remote sensing images shadow detection and application to building extraction
    Dong, Xueyan
    Cao, Jiannong
    Zhao, Weiheng
    [J]. EUROPEAN JOURNAL OF REMOTE SENSING, 2024, 57 (01)
  • [47] BuildMon: Building Extraction and Change Monitoring in Time Series Remote Sensing Images
    Wang, Yuxuan
    Chen, Shuailin
    Zhang, Ruixiang
    Xu, Fang
    Liang, Shuo
    Wang, Yujing
    Yang, Wen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 10813 - 10826
  • [48] Building-A-Nets: Robust Building Extraction From High-Resolution Remote Sensing Images With Adversarial Networks
    Li, Xiang
    Yao, Xiaojing
    Fang, Yi
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (10) : 3680 - 3687
  • [49] Learning Sparse Geometric Features for Building Segmentation from Low-Resolution Remote-Sensing Images
    Liu, Zeping
    Tang, Hong
    [J]. REMOTE SENSING, 2023, 15 (07)
  • [50] Distilling Segmenters From CNNs and Transformers for Remote Sensing Images' Semantic Segmentation
    Dong, Zhe
    Gao, Guoming
    Liu, Tianzhu
    Gu, Yanfeng
    Zhang, Xiangrong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61