Automatic Building Extraction on High-Resolution Remote Sensing Imagery Using Deep Convolutional Encoder-Decoder With Spatial Pyramid Pooling

被引:88
|
作者
Liu, Yaohui [1 ,2 ]
Gross, Lutz [2 ]
Li, Zhiqiang [3 ]
Li, Xiaoli [3 ]
Fan, Xiwei [1 ]
Qi, Wenhua [1 ]
机构
[1] China Earthquake Adm, Inst Geol, Beijing 100029, Peoples R China
[2] Univ Queensland, Sch Earth & Environm Sci, Brisbane, Qld 4072, Australia
[3] China Earthquake Networks Ctr, Beijing 100045, Peoples R China
关键词
Deep learning; high-resolution remote sensing imagery; building extraction; fully convolutional networks; encoder-decoder; SCALE; CLASSIFICATION;
D O I
10.1109/ACCESS.2019.2940527
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic extraction of buildings from remote sensing imagery plays a significant role in many applications, such as urban planning and monitoring changes to land cover. Various building segmentation methods have been proposed for visible remote sensing images, especially state-of-the-art methods based on convolutional neural networks (CNNs). However, high-accuracy building segmentation from high-resolution remote sensing imagery is still a challenging task due to the potentially complex texture of buildings in general and image background. Repeated pooling and striding operations used in CNNs reduce feature resolution causing a loss of detailed information. To address this issue, we propose a light-weight deep learning model integrating spatial pyramid pooling with an encoder-decoder structure. The proposed model takes advantage of a spatial pyramid pooling module to capture and aggregate multi-scale contextual information and of the ability of encoder-decoder networks to restore losses of information. The proposed model is evaluated on two publicly available datasets; the Massachusetts roads and buildings dataset and the INRIA Aerial Image Labeling Dataset. The experimental results on these datasets show qualitative and quantitative improvement against established image segmentation models, including SegNet, FCN, U-Net, Tiramisu, and FRRN. For instance, compared to the standard U-Net, the overall accuracy gain is 1.0% (0.913 vs. 0.904) and 3.6% (0.909 vs. 0.877) with a maximal increase of 3.6% in model-training time on these two datasets. These results demonstrate that the proposed model has the potential to deliver automatic building segmentation from high-resolution remote sensing images at an accuracy that makes it a useful tool for practical application scenarios.
引用
收藏
页码:128774 / 128786
页数:13
相关论文
共 50 条
  • [31] BRRNet: A Fully Convolutional Neural Network for Automatic Building Extraction From High-Resolution Remote Sensing Images
    Shao, Zhenfeng
    Tang, Penghao
    Wang, Zhongyuan
    Saleem, Nayyer
    Yam, Sarath
    Sommai, Chatpong
    [J]. REMOTE SENSING, 2020, 12 (06)
  • [32] Automatic Polyp Segmentation in Colonoscopy Images Using a Modified Deep Convolutional Encoder-Decoder Architecture
    Eu, Chin Yii
    Tang, Tong Boon
    Lin, Cheng-Hung
    Lee, Lok Hua
    Lu, Cheng-Kai
    [J]. SENSORS, 2021, 21 (16)
  • [33] Building Extraction in Very High Resolution Remote Sensing Imagery Using Deep Learning and Guided Filters
    Xu, Yongyang
    Wu, Liang
    Xie, Zhong
    Chen, Zhanlong
    [J]. REMOTE SENSING, 2018, 10 (01)
  • [34] DE-Net: Deep Encoding Network for Building Extraction from High-Resolution Remote Sensing Imagery
    Liu, Hao
    Luo, Jiancheng
    Huang, Bo
    Hu, Xiaodong
    Sun, Yingwei
    Yang, Yingpin
    Xu, Nan
    Zhou, Nan
    [J]. REMOTE SENSING, 2019, 11 (20)
  • [35] Road extraction from high resolution remote sensing image via a deep residual and pyramid pooling network
    Han, Yibo
    Han, Pu
    Jia, Manlei
    [J]. IET IMAGE PROCESSING, 2021, 15 (13) : 3080 - 3093
  • [36] Deep Learning for Building Extraction from High-Resolution Remote Sensing Images
    Norelyaqine, Abderrahim
    Saadane, Abderrahim
    [J]. ADVANCED TECHNOLOGIES FOR HUMANITY, 2022, 110 : 116 - 128
  • [37] SISR of Hyperspectral Remote Sensing Imagery Using 3D Encoder-Decoder RUNet Architecture
    Aburaed, Nour
    Alkhatib, Mohammed Q.
    Marshall, Stephen
    Zabalza, Jaime
    Al Ahmad, Hussain
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1516 - 1519
  • [38] Ground-Based Remote Sensing Cloud Detection Using Dual Pyramid Network and Encoder-Decoder Constraint
    Zhang, Zhong
    Yang, Shuzhen
    Liu, Shuang
    Cao, Xiaozhong
    Durrani, Tariq S.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [39] Automatic building footprint extraction from very high-resolution imagery using deep learning techniques
    Rastogi, Kriti
    Bodani, Pankaj
    Sharma, Shashikant A.
    [J]. GEOCARTO INTERNATIONAL, 2022, 37 (05) : 1501 - 1513
  • [40] A Convolutional Neural Network for Coastal Aquaculture Extraction from High-Resolution Remote Sensing Imagery
    Deng, Jinpu
    Bai, Yongqing
    Chen, Zhengchao
    Shen, Ting
    Li, Cong
    Yang, Xuan
    [J]. SUSTAINABILITY, 2023, 15 (06)