Building Extraction from Remote Sensing Images with Sparse Token Transformers

被引:97
|
作者
Chen, Keyan [1 ,2 ,3 ]
Zou, Zhengxia [4 ]
Shi, Zhenwei [1 ,2 ,3 ]
机构
[1] Beihang Univ, Sch Astronaut, Image Proc Ctr, Beijing 100191, Peoples R China
[2] Beihang Univ, Beijing Key Lab Digital Media, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Astronaut, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[4] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
remote sensing images; building extraction; transformers; sparse token sampler; EFFICIENT NETWORK; CLASSIFICATION; NET;
D O I
10.3390/rs13214441
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Deep learning methods have achieved considerable progress in remote sensing image building extraction. Most building extraction methods are based on Convolutional Neural Networks (CNN). Recently, vision transformers have provided a better perspective for modeling long-range context in images, but usually suffer from high computational complexity and memory usage. In this paper, we explored the potential of using transformers for efficient building extraction. We design an efficient dual-pathway transformer structure that learns the long-term dependency of tokens in both their spatial and channel dimensions and achieves state-of-the-art accuracy on benchmark building extraction datasets. Since single buildings in remote sensing images usually only occupy a very small part of the image pixels, we represent buildings as a set of "sparse " feature vectors in their feature space by introducing a new module called "sparse token sampler ". With such a design, the computational complexity in transformers can be greatly reduced over an order of magnitude. We refer to our method as Sparse Token Transformers (STT). Experiments conducted on the Wuhan University Aerial Building Dataset (WHU) and the Inria Aerial Image Labeling Dataset (INRIA) suggest the effectiveness and efficiency of our method. Compared with some widely used segmentation methods and some state-of-the-art building extraction methods, STT has achieved the best performance with low time cost.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] On The Exploration of Vision Transformers in Remote Sensing Building Extraction
    Angelis, G. F.
    Domi, A.
    Zamichos, A.
    Tsourma, M.
    Drosou, A.
    Tzovaras, D.
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 208 - +
  • [2] A Lightweight Network for Building Extraction From Remote Sensing Images
    Huang, Huaigang
    Chen, Yiping
    Wang, Ruisheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Semantic Alignment Network for Building Extraction from Remote Sensing Images
    Fu, Wei
    Xie, Kai
    Du, Xingbei
    Fang, Leyuan
    [J]. International Geoscience and Remote Sensing Symposium (IGARSS), 2024, : 8127 - 8130
  • [4] Building Extraction From Remote Sensing Images With DoG as Prior Constraint
    Quan, Yujun
    Yu, Anzhu
    Cao, Xuefeng
    Qiu, Chunping
    Zhang, Xiaoyi
    Liu, Bing
    He, Peipei
    [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15 : 6559 - 6570
  • [5] Prototype Contrastive Learning for Building Extraction From Remote Sensing Images
    Chen, Zhenshuai
    Xiang, Wei
    Lin, Zhiyuan
    Yu, Chuang
    Liu, Yunpeng
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [6] Topological Building Extraction With Bidirectional Prediction From Remote Sensing Images
    Zhang, Mingming
    Du, Ye
    Hu, Zhenghui
    Wang, Wei
    Liu, Qingjie
    Wang, Yunhong
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 10324 - 10337
  • [7] Building Extraction From Remote Sensing Images With DoG as Prior Constraint
    Quan, Yujun
    Yu, Anzhu
    Cao, Xuefeng
    Qiu, Chunping
    Zhang, Xiaoyi
    Liu, Bing
    He, Peipei
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 6559 - 6570
  • [8] Homogeneous Aggregation Convolution for Building Extraction From Remote Sensing Images
    Zhang, Rouyu
    Lin, Baokai
    Zhang, Qian
    Zhang, Guixu
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [9] Feature Residual Analysis Network for Building Extraction from Remote Sensing Images
    Miao, Yuqi
    Jiang, Shanshan
    Xu, Yiming
    Wang, Dongjie
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [10] BUILDING EXTRACTION FROM REMOTE SENSING IMAGES WITH DEEP LEARNING IN A SUPERVISED MANNER
    Chen, Kaiqiang
    Fu, Kun
    Gao, Xin
    Yan, Menglong
    Sun, Xian
    Zhang, Huan
    [J]. 2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1672 - 1675