Performance benchmark on semantic web repositories for spatially explicit knowledge graph applications

被引:0
|
作者
Li, Wenwen [1 ]
Wang, Sizhe [1 ,2 ]
Wu, Sheng [3 ]
Gu, Zhining [1 ]
Tian, Yuanyuan [1 ]
机构
[1] School of Geographical Sciences and Urban Planning, Arizona State University, Tempe,AZ,85287-5302, United States
[2] School of Computing and Augmented Intelligence, Arizona State University, Tempe,AZ,85287-8809, United States
[3] School of Computer and Information Science, Southwest University, Chongqing,400715, China
基金
美国国家科学基金会;
关键词
Benchmarking - Data handling - Digital storage - Knowledge graph - Ontology - Query processing - Resource Description Framework (RDF) - Storage efficiency;
D O I
暂无
中图分类号
学科分类号
摘要
Knowledge graph has become a cutting-edge technology for linking and integrating heterogeneous, cross-domain datasets to address critical scientific questions. As big data has become prevalent in today's scientific analysis, semantic data repositories that can store and manage large knowledge graph data have become critical in successfully deploying spatially explicit knowledge graph applications. This paper provides a comprehensive evaluation of the popular semantic data repositories and their computational performance in managing and providing semantic support for spatial queries. There are three types of semantic data repositories: (1) triple store solutions (RDF4j, Fuseki, GraphDB, Virtuoso), (2) property graph databases (Neo4j), and (3) an Ontology-Based Data Access (OBDA) approach (Ontop). Experiments were conducted to compare each repository's efficiency (e.g., query response time) in handling geometric, topological, and spatial-semantic related queries. The results show that Virtuoso achieves the overall best performance in both non-spatial and spatial-semantic queries. The OBDA solution, Ontop, has the second-best query performance in spatial and complex queries and the best storage efficiency, requiring the least data-to-RDF conversion efforts. Other triple store solutions suffer from various issues that cause performance bottlenecks in handling spatial queries, such as inefficient memory management and lack of proper query optimization. © 2022 Elsevier Ltd
引用
下载
收藏
相关论文
共 38 条