Logical Schema for Data Warehouse on Column-Oriented NoSQL Databases

被引:17
|
作者
Boussahoua, Mohamed [1 ]
Boussaid, Omar [1 ]
Bentayeb, Fadila [1 ]
机构
[1] Univ Lumiere Lyon 2, ERIC, EA 3083, 5 Ave Pierre Mendes France, F-69676 Bron, France
关键词
Data warehouses; NoSQL databases; Columns family;
D O I
10.1007/978-3-319-64471-4_20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The column-oriented NoSQL systems propose a flexible and highly denormalized data schema that facilitates data warehouse scalability. However, the implementation process of data warehouses with NoSQL databases is a challenging task as it involves a distributed data management policy on multi-nodes clusters. Indeed, in column-oriented NoSQL systems, the query performances can be improved by a careful data grouping. In this paper, we present a method that uses clustering techniques, in particular k-means, to model the better form of column families, from existing fact and dimensional tables. To validate our method, we adopt TPC-DS data benchmark. We have conducted several experiments to examine the benefits of clustering techniques for the creation of column families in a column-oriented NoSQL HBase database on Hadoop platform. Our experiments suggest that defining a good data grouping on HBase database during the implementation of a data warehouse increases significantly the performance of the decisional queries.
引用
收藏
页码:247 / 256
页数:10
相关论文
共 50 条
  • [21] The Column-oriented Data Store Performance Considerations
    Nowosielski, Artur
    Kowalski, Piotr A.
    Kulczyzki, Piotr
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 877 - 881
  • [22] Ingestion of a Data Lake into a NoSQL Data Warehouse: The Case of Relational Databases
    Abdelhedi, Fatma
    Jemmali, Rym
    Zurfluh, Gilles
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KMIS), VOL 3, 2021, : 64 - 72
  • [23] Automatic Transformation of Data Warehouse Schema To NoSQL Data Base: Comparative Study
    Yangui, Rania
    Nabli, Ahlem
    Gargouri, Faiez
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 264 - 273
  • [24] NOSOLAP: Moving from Data Warehouse Requirements to NoSQL Databases
    Prakash, Deepika
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE), 2019, : 452 - 458
  • [25] Design and Implementation of Hardware Cache Mechanism and NIC for Column-Oriented Databases
    Hamada, Akihiko
    Matsutani, Hiroki
    2016 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG16), 2016,
  • [26] Impact of Data Compression on the Performance of Column-oriented Data Stores
    Mladenova, Tsvetelina
    Kalmukov, Yordan
    Marinov, Milko
    Valova, Irena
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 416 - 421
  • [27] Storing and Indexing RDF Data in a Column-Oriented DBMS
    Wang, Xin
    Wang, Shuyi
    Du, Pufeng
    Feng, Zhiyong
    2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [28] An OAIS-Based Hospital Information System on the Cloud: Analysis of a NoSQL Column-Oriented Approach
    Celesti, Antonio
    Fazio, Maria
    Romano, Agata
    Bramanti, Alessia
    Bramanti, Placido
    Villari, Massimo
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (03) : 912 - 918
  • [29] DataCommandr: Column-oriented Data Integration, Transformation and Analysis
    Savinov, Alexandr
    IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 339 - 347
  • [30] Query Processing over Data Warehouse using Relational Databases and NoSQL
    Carniel, Anderson Chaves
    Sa, Aried de Aguiar
    Porto Brisighello, Vinicius Henrique
    Ribeiro, Marcela Xavier
    Bueno, Renato
    Ciferri, Ricardo Rodrigues
    de Aguiar Ciferri, Cristina Dutra
    2012 XXXVIII CONFERENCIA LATINOAMERICANA EN INFORMATICA (CLEI), 2012,