Semantic Segmentation of Urban Buildings from VHR Remote Sensing Imagery Using a Deep Convolutional Neural Network

被引:174
|
作者
Yi, Yaning [1 ,2 ]
Zhang, Zhijie [3 ]
Zhang, Wanchang [1 ]
Zhang, Chuanrong [3 ]
Li, Weidong [3 ]
Zhao, Tian [4 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Connecticut, Dept Geog, Storrs, CT 06269 USA
[4] Univ Wisconsin, Dept Comp Sci, Milwaukee, WI 53211 USA
关键词
semantic segmentation; urban building extraction; deep convolutional neural network; VHR remote sensing imagery; U-Net; AERIAL IMAGES; CLASSIFICATION; EXTRACTION; LIDAR; AREAS; SVM;
D O I
10.3390/rs11151774
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Urban building segmentation is a prevalent research domain for very high resolution (VHR) remote sensing; however, various appearances and complicated background of VHR remote sensing imagery make accurate semantic segmentation of urban buildings a challenge in relevant applications. Following the basic architecture of U-Net, an end-to-end deep convolutional neural network (denoted as DeepResUnet) was proposed, which can effectively perform urban building segmentation at pixel scale from VHR imagery and generate accurate segmentation results. The method contains two sub-networks: One is a cascade down-sampling network for extracting feature maps of buildings from the VHR image, and the other is an up-sampling network for reconstructing those extracted feature maps back to the same size of the input VHR image. The deep residual learning approach was adopted to facilitate training in order to alleviate the degradation problem that often occurred in the model training process. The proposed DeepResUnet was tested with aerial images with a spatial resolution of 0.075 m and was compared in performance under the exact same conditions with six other state-of-the-art networks-FCN-8s, SegNet, DeconvNet, U-Net, ResUNet and DeepUNet. Results of extensive experiments indicated that the proposed DeepResUnet outperformed the other six existing networks in semantic segmentation of urban buildings in terms of visual and quantitative evaluation, especially in labeling irregular-shape and small-size buildings with higher accuracy and entirety. Compared with the U-Net, the F1 score, Kappa coefficient and overall accuracy of DeepResUnet were improved by 3.52%, 4.67% and 1.72%, respectively. Moreover, the proposed DeepResUnet required much fewer parameters than the U-Net, highlighting its significant improvement among U-Net applications. Nevertheless, the inference time of DeepResUnet is slightly longer than that of the U-Net, which is subject to further improvement.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
    Ouyang, Song
    Li, Yansheng
    [J]. REMOTE SENSING, 2021, 13 (01) : 1 - 22
  • [2] Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images
    Muhammad Alam
    Jian-Feng Wang
    Cong Guangpei
    LV Yunrong
    Yuanfang Chen
    [J]. Mobile Networks and Applications, 2021, 26 : 200 - 215
  • [3] Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images
    Alam, Muhammad
    Wang, Jian-Feng
    Guangpei, Cong
    Yunrong, L., V
    Chen, Yuanfang
    [J]. MOBILE NETWORKS & APPLICATIONS, 2021, 26 (01): : 200 - 215
  • [4] Semantic Segmentation of Building Roof in Dense Urban Environment with Deep Convolutional Neural Network: A Case Study Using GF2 VHR Imagery in China
    Qin, Yuchu
    Wu, Yunchao
    Li, Bin
    Gao, Shuai
    Liu, Miao
    Zhan, Yulin
    [J]. SENSORS, 2019, 19 (05)
  • [5] Semantic Segmentation of Remote Sensing Images Using Transfer Learning and Deep Convolutional Neural Network With Dense Connection
    Cui, Binge
    Chen, Xin
    Lu, Yan
    [J]. IEEE ACCESS, 2020, 8 (08): : 116744 - 116755
  • [6] Context-Aware Convolutional Neural Network for Object Detection in VHR Remote Sensing Imagery
    Gong, Yiping
    Xiao, Zhifeng
    Tan, Xiaowei
    Sui, Haigang
    Xu, Chuan
    Duan, Haiwang
    Li, Deren
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (01): : 34 - 44
  • [7] Semantic Segmentation of Small Objects and Modeling of Uncertainty in Urban Remote Sensing Images Using Deep Convolutional Neural Networks
    Kampffmeyer, Michael
    Salberg, Arnt-Borre
    Jenssen, Robert
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 680 - 688
  • [8] SEMANTIC SEGMENTATION OF URBAN BUILDINGS FROM VHR REMOTELY SENSED IMAGERY USING ATTENTION-BASED CNN
    Zhang, Zhijie
    Zhang, Chuanrong
    Li, Weidong
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1833 - 1836
  • [9] A New Multi-Channel Deep Convolutional Neural Network for Semantic Segmentation of Remote Sensing Image
    Liu, Wenjie
    Zhang, Yongjun
    Fan, Haisheng
    Zou, Yongjie
    Cui, Zhongwei
    [J]. IEEE ACCESS, 2020, 8 : 131814 - 131825
  • [10] A deep residual learning serial segmentation network for extracting buildings from remote sensing imagery
    Liu, Jiayun
    Wang, Shengsheng
    Hou, Xiaowei
    Song, Wenzhuo
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (14) : 5573 - 5587