Geographic knowledge graph-guided remote sensing image semantic segmentation

被引:0
|
作者
Li Y. [1 ]
Wu K. [1 ]
Ouyang S. [1 ]
Yang K. [2 ]
Li H. [3 ]
Zhang Y. [1 ]
机构
[1] School of Remote Sensing and Information Engineering, Wuhan University, Wuhan
[2] Basic Geographic Information Center of Guizhou Province, Guiyang
[3] Guizhou First Surveying and Mapping Institute, Guiyang
基金
中国国家自然科学基金;
关键词
deep semantic segmentation network; entity-level connectivity constraint; geographic knowledge embedding optimization; geographic knowledge graph; spatial symbiosis knowledge constraint;
D O I
10.11834/jrs.20231110
中图分类号
学科分类号
摘要
Although the Deep Semantic Segmentation Network (DSSN) has notably enhanced remote-sensing image semantic segmentation, it still falls short of human experts' visual interpretation. Unlike DSSN's data-driven, pixel-level optimization, human experts rely on visual features, semantic insight, and prior knowledge for remote-sensing image interpretation. DSSN's pixel-level approach is constrained by spatial scale, lacking comprehensive target inference and struggling to bridge structured data and unstructured knowledge. In response to the two issues above, this paper proposes a geographic knowledge graph-guided deep semantic segmentation network for remote-sensing imagery. We use the ground-object semantic information and geoscience prior knowledge extracted from the geographic knowledge graph to construct loss constraints, thereby autonomously guiding the training process of DSSN. The essence of our approach lies in the intricately crafted design of loss constraints. These loss constraints include the entity-level connectivity constraint and the inter-entity symbiosis constraint. The former calculates the loss in the unit of connected domain entities instead of pixels to achieve overall constraints on the entity. The latter embeds the spatial symbiosis knowledge quantified by the symbiosis conditional probability into the data-driven DSSN to constrain the spatial distribution of segmented entities. The entity-level connectivity constraint guides DSSN to autonomously learn entity-level feature representations during training. Accordingly, the segmentation results become more holistic and suppresses blurry boundaries and random noise. The inter-entity symbiosis constraint adjusts the spatial distribution of entities according to the spatial semantic information and the prior geoscience knowledge. This adjustment realizes the automatic optimization of the spatial distribution of segmented entities. Extensive experiments show that under the guidance of the entity-level connectivity constraint and the inter-entity symbiosis constraint, DSSN can complete the learning of entity-level features. It can also automatically optimize the spatial distribution of ground objects based on spatial symbiosis knowledge, thereby effectively improving the performance of remote-sensing image semantic segmentation. Our novel geographic knowledge graph-guided approach to deep semantic segmentation in remote-sensing imagery has successfully addressed the challenges posed by DSSN's pixel-level optimization. By incorporating entity-level connectivity and inter-entity symbiosis constraints, we have enabled DSSN to autonomously learn comprehensive feature representations and optimize spatial distribution. The resulting improvements in semantic segmentation performance showcase the potential of merging domain-specific knowledge with data-driven techniques, bridging the gap between automated methods and human interpretation in remote-sensing image analysis. © 2024 Science Press. All rights reserved.
引用
收藏
页码:455 / 469
页数:14
相关论文
共 43 条
  • [1] Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Susstrunk S., SLIC Superpixels compared to state-of-the-art superpixel methods, IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 11, pp. 2274-2282, (2012)
  • [2] Alirezaie M, Langkvist M, Sioutis M, Loutfi A., Semantic referee: a neural-symbolic framework for enhancing geospatial semantic segmentation, Semantic Web, 10, 5, pp. 863-880, (2019)
  • [3] Arvor D, Belgiu M, Falomir Z, Mougenot I, Durieux L., Ontologies to interpret remote sensing images: why do we need them?, GIScience and Remote Sensing, 56, 6, pp. 911-939, (2019)
  • [4] Badrinarayanan V, Kendall A, Cipolla R., SegNet: a deep convolutional encoder-decoder architecture for image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 12, pp. 2481-2495, (2017)
  • [5] Camps-Valls G, Tuia D, Bruzzone L, Benediktsson J A., Advances in hyperspectral image classification: earth monitoring with statistical learning methods, IEEE Signal Processing Magazine, 31, 1, pp. 45-54, (2014)
  • [6] Chen L C, Zhu Y K, Papandreou G, Schroff F, Adam H., Encoder-decoder with atrous separable convolution for semantic image segmentation, Proceedings of the 15th European Conference on Computer Vision, pp. 833-851, (2018)
  • [7] Dash T, Chitlangia S, Ahuja A, Srinivasan A., A review of some techniques for inclusion of domain-knowledge into deep neural networks, Scientific Reports, 12, 1, (2022)
  • [8] Demir I, Koperski K, Lindenbaum D, Pang G, Huang J, Basu S, Hughes F, Tuia D, Raska R., DeepGlobe 2018: a challenge to parse the earth through satellite images, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 172-17209, (2018)
  • [9] Fan J, Yu W Z, Wu W, Shen Y., Knowledge-guided fitting method for sparse time series remote sensing data, Journal of Remote Sensing, 21, 5, pp. 749-756, (2017)
  • [10] Goodenough D G, Goldberg M, Plunkett G, Zelek J., An expert system for remote- sensing, IEEE Transactions on Geoscience and Remote Sensing, GE-25, 3, pp. 349-359, (1987)