HaoLap: A Hadoop based OLAP system for big data

被引:29
|
作者
Song, Jie [1 ]
Guo, Chaopeng [1 ]
Wang, Zhi [1 ]
Zhang, Yichan [1 ]
Yu, Ge [2 ]
Pierson, Jean-Marc [3 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Sch Informat & Engn, Shenyang 110819, Peoples R China
[3] Univ Toulouse 3, Lab IRIT, F-31062 Toulouse, France
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Cloud data warehouse; Multidimensional data model; MapReduce;
D O I
10.1016/j.jss.2014.09.024
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, facing information explosion, industry and academia have adopted distributed file system and MapReduce programming model to address new challenges the big data has brought. Based on these technologies, this paper presents HaoLap (Hadoop based oLap), an OLAP (OnLine Analytical Processing) system for big data. Drawing on the experience of Multidimensional OLAP (MOLAP), HaoLap adopts the specified multidimensional model to map the dimensions and the measures; the dimension coding and traverse algorithm to achieve the roll up operation on dimension hierarchy; the partition and linearization algorithm to store dimensions and measures; the chunk selection algorithm to optimize OLAP performance; and MapReduce to execute OLAP. The paper illustrates the key techniques of HaoLap including system architecture, dimension definition, dimension coding and traversing, partition, data storage, OLAP and data loading algorithm. We evaluated HaoLap on a real application and compared it with Hive, HadoopDB, HBaseLattice, and Olap4Cloud. The experiment results show that HaoLap boost the efficiency of data loading, and has a great advantage in the OLAP performance of the data set size and query complexity, and meanwhile HaoLap also completely support dimension operations. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:167 / 181
页数:15
相关论文
共 50 条
  • [1] HaoLap:基于Hadoop的海量数据OLAP系统
    郭朝鹏
    王智
    韩峰
    张一川
    宋杰
    [J]. 计算机研究与发展, 2013, 50(S1) (S1) : 378 - 383
  • [2] Study on Geography Information OLAP and Data Mining System Based On Hadoop
    Yu, Jun
    Pang, Hengmao
    Mei, Zhu
    Song, Debing
    Zhu, Guangxin
    Chen, Haiyang
    Wang, Lin
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2019, 252
  • [3] Big medical data processing system based on hadoop
    Liu, W.
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 181 - 181
  • [4] Hadoop based Demography Big Data Management System
    Bukhari, Syeda Sana
    Park, JinHyuck
    Shin, Dong Ryeol
    [J]. 2018 19TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2018, : 93 - 98
  • [5] Research and Implementation of Big Data Preprocessing System Based on Hadoop
    Dai, Huadong
    Zhang, Shu
    Wang, Li
    Ding, Yan
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2016, : 90 - 94
  • [6] An Optimized Distributed OLAP System for Big Data
    Chen, Wenhao
    Wang, Haoxiang
    Zhang, Xingming
    Lin, Qidi
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 36 - 40
  • [7] Research of Road Disease Big Data Process Based on Hadoop System
    Liang, YinCheng
    Yang, Chaoyu
    Xu, Xinjun
    [J]. PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2015, : 1 - 5
  • [8] Design and Development of a Medical Big Data Processing System Based on Hadoop
    Yao, Qin
    Tian, Yu
    Li, Peng-Fei
    Tian, Li-Li
    Qian, Yang-Ming
    Li, Jing-Song
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2015, 39 (03)
  • [9] An approach for Big Data Security based on Hadoop Distributed File system
    Mahmoud, Hadeer
    Hegazy, Abdelfatah
    Khafagy, Mohamed H.
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMPUTER ENGINEERING (ITCE' 2018), 2018, : 109 - 114
  • [10] Design and Development of a Medical Big Data Processing System Based on Hadoop
    Qin Yao
    Yu Tian
    Peng-Fei Li
    Li-Li Tian
    Yang-Ming Qian
    Jing-Song Li
    [J]. Journal of Medical Systems, 2015, 39