Analytic Processing in Data Lakes: A Semantic Query-Driven Discovery Approach

被引:0
|
作者
Diamantini, Claudia [1 ]
Potena, Domenico [1 ]
Storti, Emanuele [1 ]
机构
[1] Univ Politecn Marche, Dipartimento Ingn Informaz DII, Ancona, Italy
关键词
Data lake; Query-driven discovery; Knowledge graph; Multidimensional data; CHALLENGES;
D O I
10.1007/s10796-024-10471-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data integration and discovery are open issues in Data Lakes potentially storing hundreds of data sources. The present paper addresses these issues targeting multidimensional data sources, that is sources containing atomic or derived measures aggregated along a number of dimensions, typically derived from raw data for analytical and reporting purposes. Combining semantic models of metadata with existing data-driven techniques, the paper proposes an approach for the discovery of mappings between source metadata and concepts in a reference knowledge graph, enabling the definition of reasoning-based techniques to discover, integrate, and rank data sources relevant to a given analytical query. The efficiency and effectiveness of the approach is discussed by means of experiments on real-world scenarios.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Query-driven module discovery in microarray data
    Dhollander, Thomas
    Sheng, Qizheng
    Lemmens, Karen
    De Moor, Bart
    Marchal, Kathleen
    Moreau, Yves
    [J]. BIOINFORMATICS, 2007, 23 (19) : 2573 - 2580
  • [2] Query-Driven Graph Processing
    Bonifati, Angela
    [J]. COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 311 - 312
  • [3] A Knowledge-Based Approach to Support Analytic Query Answering in Semantic Data Lakes
    Diamantini, Claudia
    Potena, Domenico
    Storti, Emanuele
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2022, 2022, 13389 : 179 - 192
  • [4] RouPar: Routinely and Mixed Query-Driven Approach for Data Partitioning
    Bellatreche, Ladjel
    Kerkad, Amira
    Bress, Sebastian
    Geniet, Dominique
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2013 CONFERENCES, 2013, 8185 : 309 - 326
  • [5] Query-Driven Approach to Entity Resolution
    Altwaijry, Hotham
    Kalashnikov, Dmitri V.
    Mehrotra, Sharad
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1846 - 1857
  • [6] Pharos: Query-Driven Schema Inference for the Semantic Web
    Haller, David
    Lenz, Richard
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 112 - 124
  • [7] Query-Driven Discovery of Anomalous Subgraphs in Attributed Graphs
    Wu, Nannan
    Chen, Feng
    Li, Jianxin
    Huai, Jinpeng
    Li, Bo
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3105 - 3111
  • [8] Query-driven support pattern discovery for classification learning
    Han, YQ
    Lam, W
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 399 - 402
  • [9] Query-Driven Approach to Face Clustering and Tagging
    Zhang, Liyan
    Wang, Xikui
    Kalashnikov, Dmitri V.
    Mehrotra, Sharad
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (10) : 4504 - 4513
  • [10] A Query-Driven Approach for Checking the Semantic Correctness of Ontology-Based Process Representations
    Fellmann, Michael
    Thomas, Oliver
    Busch, Bastian
    [J]. BUSINESS INFORMATION SYSTEMS, 2011, 87 : 62 - 73