Specification of the Schema of Spreadsheets for the Materialization of Ontologies from Integrated Data Sources

被引:0
|
作者
Alejandro Gomez, Sergio [1 ,2 ]
Ruben Fillottrani, Pablo [1 ,2 ]
机构
[1] Univ Nacl Sur, Dept Ciencias & Ingn Computac, Lab I D Ingn Software & Sistemas Informac LISSI, San Andres 800, Bahia Blanca, Buenos Aires, Argentina
[2] Comis Invest Cient Prov Buenos Aires CIC PBA, La Plata, Argentina
来源
关键词
Ontology-based data access; Ontologies; Relational; databases; Spreadsheets;
D O I
10.1007/978-3-030-75836-3_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Ontology-Based Data Access (OBDA), a knowledge base known as an ontology models both the problem domain and the underlying data sources. We are concerned with providing with tools for performing OBDA with relational and non-relational data sources. We developed an OBDA tool that is able to access H2 databases, CSV files and Excel spreadsheets allowing the user to explicitly formulate mappings, and populating an ontology that can be saved for later querying. In this paper, we present a language for specifying the schema of the data in a spreadsheet data application, which then can be used to access the contents of a set of Excel books with the ultimate goal of materializing its data as an OWL/RDF ontology. We characterize the syntax and semantics of the language, present a prototypical implementation and report on the performance tests showing that our implementation can handle a workload of Excel tables of the order of ten thousand records. We also show a case study in which the ontology of an idealized university library can be defined using the our tool integrating both relational and spreadsheet data.
引用
收藏
页码:247 / 262
页数:16
相关论文
共 50 条
  • [1] Building Ontologies from XML Data Sources
    Ghawi, Raji
    Cullot, Nadine
    [J]. PROCEEDINGS OF THE 20TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, 2009, : 480 - 484
  • [2] Materialization of OWL Ontologies from Relational Databases: A Practical Approach
    Alejandro Gomez, Sergio
    Ruben Fillottrani, Pablo
    [J]. COMPUTER SCIENCE - CACIC 2019, 2020, 1184 : 285 - 301
  • [3] SourceTrac: Tracing Data Sources within Spreadsheets
    Asuncion, Hazeline U.
    [J]. PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2012, 2012, 7525 : 1 - 10
  • [4] Schema Integration on Massive Data Sources
    Li, Tianbao
    Guo, Haifeng
    Yang, Donghua
    Li, Mengmeng
    Zheng, Bo
    Wang, Hongzhi
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT II, 2024, 14488 : 186 - 206
  • [5] Specification of Data Schema Mappings using Weaving Models
    Anicic, Nenad
    Neskovic, Sinisa
    Vuckovic, Milica
    Cvetkovic, Radovan
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2012, 9 (02) : 539 - 559
  • [6] Schema Discovery in RDF Data Sources
    Kellou-Menouer, Kenza
    Kedad, Zoubida
    [J]. CONCEPTUAL MODELING, ER 2015, 2015, 9381 : 481 - 495
  • [7] Incorporating function ontologies into the integration of data sources
    Tsai, HJ
    Xu, J
    Lin, S
    Miller, LL
    [J]. COMPUTERS AND THEIR APPLICATIONS, 2003, : 184 - 187
  • [8] JSON']JSON: Data model, Query languages and Schema specification
    Bourhis, Pierre
    Reutter, Juan L.
    Suarez, Fernando
    Vrgoc, Domagoj
    [J]. PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, : 123 - 135
  • [9] Distributed Clustering for Data Sources with Diverse Schema
    Visalakshi, N. Karthikeyani
    Thangavel, K.
    Alagambigai, P.
    [J]. THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 1058 - +
  • [10] Consistent answers from integrated data sources
    Bertossi, L
    Chomicki, J
    Cortés, A
    Gutiérrez, C
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2002, 2522 : 71 - 85