PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips

被引:0
|
作者
Hubert, Nicolas [1 ,2 ]
Monnin, Pierre [3 ]
d'Aquin, Mathieu [2 ]
Monticolo, Davy [1 ]
Brun, Armelle [2 ]
机构
[1] Univ Lorraine, ERPI, Nancy, France
[2] Univ Lorraine, CNRS, LORIA, Nancy, France
[3] Univ Cote Azur, CNRS, I3S, INRIA, Sophia Antipolis, France
来源
关键词
Knowledge Graph; Schema; Semantic Web; Synthetic Data Generator; BENCHMARK;
D O I
10.1007/978-3-031-60635-9_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graphs (KGs) have emerged as a prominent data representation and management paradigm. Being usually underpinned by a schema (e.g., an ontology), KGs capture not only factual information but also contextual knowledge. In some tasks, a few KGs established themselves as standard benchmarks. However, recent works outline that relying on a limited collection of datasets is not sufficient to assess the generalization capability of an approach. In some data-sensitive fields such as education or medicine, access to public datasets is even more limited. To remedy the aforementioned issues, we release PyGraft, a Python-based tool that generates highly customized, domain-agnostic schemas and KGs. The synthesized schemas encompass various RDFS and OWL constructs, while the synthesized KGs emulate the characteristics and scale of real-world KGs. Logical consistency of the generated resources is ultimately ensured by running a description logic (DL) reasoner. By providing a way of generating both a schema and KG in a single pipeline, PyGraft's aim is to empower the generation of a more diverse array of KGs for benchmarking novel approaches in areas such as graph-based machine learning (ML), or more generally KG processing. In graph-based ML in particular, this should foster a more holistic evaluation of model performance and generalization capability, thereby going beyond the limited collection of available benchmarks. PyGraft is available at: https://github.com/nicolas-hbt/pygraft.
引用
下载
收藏
页码:3 / 20
页数:18
相关论文
共 50 条
  • [21] A Semantic Annotation Pipeline towards the Generation of Knowledge Graphs in Tribology
    Kuegler, Patricia
    Marian, Max
    Dorsch, Rene
    Schleich, Benjamin
    Wartzack, Sandro
    LUBRICANTS, 2022, 10 (02)
  • [22] Knowledge grounded medical dialogue generation using augmented graphs
    Deeksha Varshney
    Aizan Zafar
    Niranshu Kumar Behera
    Asif Ekbal
    Scientific Reports, 13
  • [23] Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs
    Zhang, Houyu
    Liu, Zhenghao
    Xiong, Chenyan
    Liu, Zhiyuan
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2031 - 2043
  • [24] Knowledge grounded medical dialogue generation using augmented graphs
    Varshney, Deeksha
    Zafar, Aizan
    Behera, Niranshu Kumar
    Ekbal, Asif
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [25] Automated Template Generation for Question Answering over Knowledge Graphs
    Abujabal, Abdalghani
    Yahya, Mohamed
    Riedewald, Mirek
    Weikum, Gerhard
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 1191 - 1200
  • [26] Differentially Private Knowledge Distillation via Synthetic Text Generation
    University of Southern California, United States
    Proc. Annu. Meet. Assoc. Comput Linguist., (12957-12968):
  • [27] Domain Knowledge-Driven Generation of Synthetic Healthcare Data
    Hashemi, Atiye Sadat
    Soliman, Amira
    Lundstrom, Jens
    Etminani, Kobra
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 352 - 353
  • [28] Enterprise Knowledge Graphs: A Semantic Approach for Knowledge Management in the Next Generation of Enterprise Information Systems
    Galkin, Mikhail
    Auer, Soeren
    Vidal, Maria-Esther
    Scerri, Simon
    ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 2, 2017, : 88 - 98
  • [29] Formal Languages Generation in Systems of Knowledge Representation Based on Stratified Graphs
    Danciulescu, Daniela
    INFORMATICA, 2015, 26 (03) : 407 - 417
  • [30] Using knowledge graphs for prediction and visual hypothesis generation in drug discovery
    Wild, David
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258