Logical Schema for Data Warehouse on Column-Oriented NoSQL Databases

被引:16
|
作者
Boussahoua, Mohamed [1 ]
Boussaid, Omar [1 ]
Bentayeb, Fadila [1 ]
机构
[1] Univ Lumiere Lyon 2, ERIC, EA 3083, 5 Ave Pierre Mendes France, F-69676 Bron, France
关键词
Data warehouses; NoSQL databases; Columns family;
D O I
10.1007/978-3-319-64471-4_20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The column-oriented NoSQL systems propose a flexible and highly denormalized data schema that facilitates data warehouse scalability. However, the implementation process of data warehouses with NoSQL databases is a challenging task as it involves a distributed data management policy on multi-nodes clusters. Indeed, in column-oriented NoSQL systems, the query performances can be improved by a careful data grouping. In this paper, we present a method that uses clustering techniques, in particular k-means, to model the better form of column families, from existing fact and dimensional tables. To validate our method, we adopt TPC-DS data benchmark. We have conducted several experiments to examine the benefits of clustering techniques for the creation of column families in a column-oriented NoSQL HBase database on Hadoop platform. Our experiments suggest that defining a good data grouping on HBase database during the implementation of a data warehouse increases significantly the performance of the decisional queries.
引用
收藏
页码:247 / 256
页数:10
相关论文
共 50 条
  • [1] Data Integrity Verification in Column-Oriented NoSQL Databases
    Weintraub, Grisha
    Gudes, Ehud
    [J]. DATA AND APPLICATIONS SECURITY AND PRIVACY XXXII, DBSEC 2018, 2018, 10980 : 165 - 181
  • [2] From Document Warehouse to Column-Oriented NoSQL Document Warehouse
    Ben Messaoud, Ines
    Ben Ali, Refka
    Feki, Jamel
    [J]. ICSOFT: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2017, : 85 - 94
  • [3] Implementation of Multidimensional Databases in Column-Oriented NoSQL Systems
    Chevalier, Max
    El Malki, Mohammed
    Kopliku, Arlind
    Teste, Olivier
    Tournier, Ronan
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2015, 2015, 9282 : 79 - 91
  • [4] Toward Automatic Generation of Column-Oriented NoSQL Databases in Big Data Context
    Esbai, Redouane
    Elotmani, Fouad
    Zahra Belkadi, Fatima
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2019, 15 (09) : 4 - 16
  • [5] Column-oriented databases
    Bößwetter D.
    Puppe F.
    Steinbauer D.
    [J]. Informatik-Spektrum, 2010, 33 (01) : 61 - 65
  • [6] An Efficient Schema Transformation Technique for Data Migration from Relational to Column-Oriented Databases
    Zaidi, Norwini
    Ishak, Iskandar
    Sidi, Fatimah
    Affendey, Lilly Suriani
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (03): : 1175 - 1188
  • [7] Incrementally Maintaining Materialized Temporal Views in Column-Oriented NoSQL Databases with Partial Deltas
    Hu, Yong
    Dessloch, Stefan
    [J]. NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2015), 2015, 539 : 88 - 96
  • [8] Startable: Multidimensional Modelling for Column-Oriented NoSQL
    Ferreira, Leandro Mendes
    Alves-Souza, Solange Nice
    da Silva, Luciana Maria
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 21 - 30
  • [9] VParC: A Compression Scheme for Numeric Data in Column-Oriented Databases
    Yan, Ke
    Zhu, Hong
    Lu, Kevin
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (01) : 1 - 11
  • [10] Document-oriented versus Column-oriented Data Storage for Social Graph Data Warehouse
    Challal, Zakia
    Bala, Wafaa
    Mokeddem, Hanifa
    Boukhalfa, Kamel
    Boussaid, Omar
    Benkhelifa, Elhadj
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 242 - 247