Research on Knowledge Graph Data Management: A Survey

被引:0
|
作者
Wang X. [1 ,2 ]
Zou L. [3 ]
Wang C.-K. [4 ]
Peng P. [5 ]
Feng Z.-Y. [1 ,2 ]
机构
[1] College of Intelligence and Computing, Tianjin University, Tianjin
[2] Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin
[3] Institute of Computer Science and Technology, Peking University, Beijing
[4] School of Software, Tsinghua University, Beijing
[5] College of Computer Science and Electronic Engineering, Hunan University, Changsha
来源
Ruan Jian Xue Bao/Journal of Software | 2019年 / 30卷 / 07期
基金
中国国家自然科学基金;
关键词
Data management; Data model; Knowledge graph; Query language; Query operation; Storage management;
D O I
10.13328/j.cnki.jos.005841
中图分类号
学科分类号
摘要
Knowledge graphs have become the cornerstone of artificial intelligence. The construction and publication of large-scale knowledge graphs in various domains have posed new challenges on the data management of knowledge graphs. In this paper, in accordance with the structural and operational elements of a data model, the current theories, methods, technologies, and systems of knowledge graph data management are surveyed. First, the paper introduces knowledge graph data models, including the RDF graph model and the property graph model, and also introduces 5 knowledge graph query languages, including SPARQL, Cypher, Gremlin, PGQL, and G-CORE. Second, the storage management schemes of knowledge graphs are presented, including relational-based and native approaches. Third, three kinds of query operations are discussed, which are graph pattern matching, navigational, and analytical queries. Fourth, the paper introduces mainstream knowledge graph database management systems, which are categorized into RDF triple stores and native graph databases. Meanwhile, the state-of-the-art distributed systems and frameworks that are used for processing knowledge graphs are also described, and benchmarks are presented for knowledge graphs. Finally, the future research directions of knowledge graph data management are put forward as well. © Copyright 2019, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
下载
收藏
页码:2139 / 2174
页数:35
相关论文
共 189 条
  • [1] Knublauch H., Kontokostas D., Shapes constraint language (SHACL), W3C Editor's Draft
  • [2] Angles R., Gutierrez C., Survey of graph database models, ACM Computing Surveys, 40, 1, pp. 1-39, (2008)
  • [3] Gallagher B., Matching structure and semantics: A survey on graph-based pattern matching, 6, 2, pp. 45-53, (2006)
  • [4] Gutierrez C., Hurtado C.A., Mendelzon A.O., Foundations of semantic Web databases, Journal of Computer and System Sciences, 77, 3, pp. 520-541, (2011)
  • [5] Ozsu M.T., a Survey of RDF Data Management Systems, Frontiers of Computer Science, 10, 3, pp. 418-432, (2016)
  • [6] Zou L., Zsu M.T., Graph-based RDF data management, Data Science and Engineering, 2, 1, pp. 56-70, (2017)
  • [7] Wylot M., Hauswirth M., Cudre-Mauroux P., RDF data storage and query processing schemes: A survey, ACM Computing Surveys (CSUR), 51, 4, (2018)
  • [8] Angles R., Gutierrez C., An Introduction to Graph Data Management, pp. 1-32, (2018)
  • [9] Angles R., Arenas M., Barcelo P., Foundations of modern query languages for graph databases, ACM Computing Surveys, 50, 5, (2016)
  • [10] McCune R.R., Weninger T., Madey G., Thinking like a vertex: A survey of vertex-centric frameworks for large-scale distributed graph processing, ACM Computing Surveys (CSUR), 48, 2, (2015)