A dataspace-based framework for OLAP analyses in a high-variety multistore

被引:6
|
作者
Forresi, Chiara [1 ]
Gallinucci, Enrico [1 ]
Golfarelli, Matteo [1 ]
Ben Hamadou, Hamdi [2 ]
机构
[1] Univ Bologna, Cesena, Italy
[2] Aalborg Univ, Aalborg, Denmark
来源
VLDB JOURNAL | 2021年 / 30卷 / 06期
关键词
Multistore; NoSQL; Dataspace; GPSJ; Schemaless; OLAP; DATABASES;
D O I
10.1007/s00778-021-00682-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The success of NoSQL DBMSs has pushed the adoption of polyglot storage systems that take advantage of the best characteristics of different technologies and data models. While operational applications take great benefit from this choice, analytical applications suffer the absence of schema consistency, not only between different DBMSs but within a single NoSQL system as well. In this context, the discipline of data science is steering analysts away from traditional data warehousing and toward a more flexible and lightweight approach to data analysis. The idea is to perform OLAP analyses in a pay-as-you-go manner across heterogeneous schemas and data models, where the integration is progressively carried out by the user as the available data is explored. In this paper, we propose an approach to support data analysis within a high-variety multistore, with heterogeneous schemas and overlapping records. Our approach supports relational, document, wide-column, and key-value data models by automatically handling both data model and schema heterogeneity through a dataspace layer on top of the underlying DBMSs. The expressiveness we enable corresponds to GPSJ queries, which are the most common class of queries in OLAP applications. We rely on nested relational algebra to define a cross-database execution plan. The system has been prototyped on Apache Spark.
引用
收藏
页码:1017 / 1040
页数:24
相关论文
共 7 条
  • [1] A dataspace-based framework for OLAP analyses in a high-variety multistore
    Chiara Forresi
    Enrico Gallinucci
    Matteo Golfarelli
    Hamdi Ben Hamadou
    The VLDB Journal, 2021, 30 : 1017 - 1040
  • [2] A Framework for Lean Flow in Turbulent High-Variety Low-Volume Manufacturing Environments
    Alfnes, Erlend
    Thomassen, Maria Kollberg
    Gran, Erik
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: INITIATIVES FOR A SUSTAINABLE WORLD, 2016, 488 : 935 - 942
  • [3] Card-based delivery date promising in high-variety manufacturing with order release control
    Thuerer, Matthias
    Land, Martin J.
    Stevenson, Mark
    Fredendall, Lawrence D.
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2016, 172 : 19 - 30
  • [4] Extending the Theoretical Framework of Mass Customization: Initial and Adaptive Solution Space Development for High-Variety Production Environments
    Steiner, Frank
    PROCEEDINGS OF THE 7TH WORLD CONFERENCE ON MASS CUSTOMIZATION, PERSONALIZATION, AND CO-CREATION (MCPC 2014) - TWENTY YEARS OF MASS CUSTOMIZATION - TOWARDS NEW FRONTIERS, 2014, : 201 - 215
  • [5] A knowledge-based dynamic job-scheduling in low-volume/high-variety manufacturing
    Zhang, YX
    Chen, H
    ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1999, 13 (03): : 241 - 249
  • [6] Deploying ambidexterity through better management practices: an investigation based on high-variety, low-volume manufacturing
    Katic, Mile
    Cetindamar, Dilek
    Agarwal, Renu
    JOURNAL OF MANUFACTURING TECHNOLOGY MANAGEMENT, 2021, 32 (04) : 952 - 975
  • [7] Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses
    Jourdren, Laurent
    Bernard, Maria
    Dillies, Marie-Agnes
    Le Crom, Stephane
    BIOINFORMATICS, 2012, 28 (11) : 1542 - 1543