Research on Building Extraction based on Neural Network with Feature Enhancement and ELU Activation Function

被引:0
|
作者
Tang Y. [1 ,2 ,3 ,4 ]
Liu Z. [1 ]
Yang Y. [1 ]
Gu H. [1 ]
Yang S. [2 ,3 ,4 ]
机构
[1] Institute of Photogrammetry and Remote Sensing, Chinese Academy of Surveying and Mapping, Beijing
[2] Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou
[3] National-Local Joint Engineering Research Center of Technologies and Applications for National Geographic State Monitoring, Lanzhou
[4] Gansu Provincial Engineering Laboratory for National Geographic State Monitoring, Lanzhou
基金
中国国家自然科学基金;
关键词
Building extraction; Convolutional neural network; Deep learning; End-to-end; Exponential Linear Units (ELU); Feature enhancement; Feature Enhancement Network (FE-Net); High-resolution remote sensing image;
D O I
10.12082/dqxxkx.2021.200130
中图分类号
学科分类号
摘要
In recent years, with the rapid development of the city, a large number of people turn to work and live in the city, resulting in an increasing number of urban buildings. Land resources and urban ecological environment (such as green space) are threatened to some extent. Thus, it is urgent to plan urban land resources and space reasonably, prevent illegal construction, improve urban living environment, and make the city sustainable, orderly, healthy, and green. With the high-resolution remote sensing image data becoming more and more abundant, accurate building extraction using high-resolution remote sensing images plays an important role in urban planning, urban management, and change detection of urban buildings. Based on the U-Net network model, using the Massachusetts building dataset, this paper explored the network model structure and proposed a network model called FE-Net with "encoder-feature enhancement-decoder" structure and ELU activation function. First, the best basic network model called U-Net6 was found by comparing the building extraction results using U-Net5, U-Net6, and U-Net7 with different number of network layers. Based on the U-Net6, the network model of "U-Net6+ReLU+feature enhancement" was established by adding the structure of feature enhancement. In order to optimize the activation function, the ReLU activation function was replaced by the ELU activation function, and then the network model called FE-Net (U-Net6+ELU+feature enhancement) was created. The FE-Net network model was compared with the building extraction results from the other two network models (U-Net6+ReLU and U-Net6+ReLU+feature enhancement). Results show that the FE-Net network model had the best building extraction performance. Its relaxed F1-measure reached 97.23%, which was 0.36% and 0.12% higher than the other two network models. Meanwhile, FE-Net also had the highest extraction accuracy compared with other studies using the same dataset of Massachusetts. The FE-Net network model can extract multi-scale buildings better, which can not only extract small-scale buildings accurately, but also roughly and completely extract buildings with irregular shape with relatively less missing and wrong detections. Thus, the FE-Net network model can be used to achieve end-to-end building extraction with a high accuracy. © 2021, Science Press. All right reserved.
引用
收藏
页码:692 / 709
页数:17
相关论文
共 30 条
  • [1] Han J W, Zhang D W, Cheng G, Et al., Object detection in optical remote sensing images based on weakly supervised learning and high-level feature learning, IEEE Transactions on Geoscience and Remote Sensing, 53, 6, pp. 3325-3337, (2015)
  • [2] Huang J K, Feng X F, Xu X L, Et al., Research on building extraction based on knowledge rule construction and morphological restoration, Geography and Geo-Information Science, 27, 4, pp. 28-31, (2011)
  • [3] Lin Y Z, Zhang B M, Xu J F, Et al., High resolution remote sensing image building extraction based on multi feature and multi-scale, Bulletin of Surveying and Mapping, 12, pp. 53-57, (2017)
  • [4] Lin X G, Zhang J X., Object oriented morphological building index and its application in building extraction from high-resolution remote sensing image, Acta Geodaetica et Cartographica Sinica, 46, 6, pp. 724-733, (2017)
  • [5] Tian H, Yang J, Wang Y M, Et al., Building extraction method based on prior shape constraint level set model, Acta Automatica Sinica, 36, 11, pp. 1502-1511, (2010)
  • [6] Huang X., Research on multi-scale texture, shape feature extraction and object-oriented classification of high-resolution remote sensing image, (2009)
  • [7] Levitt S, Aghdasi F., Texture measures for building recognition in aerial photographs, Communications and Singnal Processing, pp. 75-80, (1997)
  • [8] Lin C., Detection of buildings using perceptual grouping and shadows, IEEE Computer Vision & Pattern Recognition, pp. 62-69, (1994)
  • [9] Kim T, Muller J P., Development of a graph-based approach for building detection, Image and Vision Computing, 17, 1, pp. 3-14, (1999)
  • [10] Katartzis A, Sahli H, Nyssen E, Et al., Detection of buildings from a single airborne image using a markov random field model, Geoscience and Remote Sensing Symposium, 6, pp. 2832-2834, (2001)