PHC: A rapid parallel hierarchical cubing algorithm on high dimensional OLAP

被引:0
|
作者
Hu, Kongfa [1 ]
Chen, Ling [1 ]
Chen, Yixin [2 ]
机构
[1] Yangzhou Univ, Dept Comp Sci & Engn, Yangzhou 225009, Peoples R China
[2] Washington Univ, Dept Comp Sci & Engn, St Louis, MO 63130 USA
基金
中国国家自然科学基金;
关键词
data cube; parallel hierarchical cubing algorithm (PHC); high dimensional OLAP;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data cube has been playing an essential role in OLAP (online analytical processing). ne pre-computation of data cubes is critical for improving the response time of OLAP systems. However, as the size of data cube grows, the time it takes to perform this pre-computation becomes a significant performance bottleneck. In a high dimensional OLAP, it might not be practical to build all these cuboids and their indices. In this paper, we propose a parallel hierarchical cubing algorithm, based on an extension of the previous minimal cubing approach. The algorithm has two components: decomposition of the cube space based on multiple dimension attributes, and an efficient OLAP query engine based on a prefix bitmap encoding of the indices. This method partitions the high dimensional data cube into low dimensional cube segments. Such an approach permits a significant reduction of CPU and I/O overhead for many queries by restricting the number of cube segments to be processed for both the fact table and bitmap indices. The proposed data allocation and processing model support parallel I/O and parallel processing, as well as load balancing for disks and processors. Experimental results show that the proposed parallel hierarchical cubing method is significantly more efficient than other existing cubing methods.
引用
收藏
页码:72 / +
页数:2
相关论文
共 50 条
  • [11] High Performance OLAP and Data Mining on Parallel Computers
    Sanjay Goil
    Alok Choudhary
    [J]. Data Mining and Knowledge Discovery, 1997, 1 : 391 - 417
  • [12] An adaptive parallel hierarchical clustering algorithm
    Li, Zhaopeng
    Li, Kenli
    Xiao, Degui
    Yang, Lei
    [J]. HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 97 - 107
  • [13] An Efficient OLAP Query Algorithm Based on Dimension Hierarchical Encoding Storage and Shark
    Yao, Shengqiang
    He, Jieyue
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2014, PT II, 2014, 8795 : 180 - 187
  • [14] Parallel social spider clustering algorithm for high dimensional datasets
    Shukla, Urvashi Prakash
    Nanda, Satyasai Jagannath
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 56 : 75 - 90
  • [15] A Range Query Parallel Algorithm in High-dimensional Space
    Xu, Hongbo
    Yao, Nianmin
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2308 - 2313
  • [16] A parallel rendering algorithm based on hierarchical radiosity
    Caballer, M
    Guerrero, D
    Hernández, V
    Román, JE
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2002, 2003, 2565 : 523 - 536
  • [17] A parallel hierarchical radiosity algorithm for complex scenes
    Feng, CC
    Yang, SN
    [J]. 1997 IEEE SYMPOSIUM ON PARALLEL RENDERING (PRS '97), PROCEEDINGS, 1997, : 71 - +
  • [18] A Parallel data preprocessing algorithm for hierarchical clustering
    Li Zhao-Peng
    Li Zhao-jian
    [J]. 2013 FIFTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2013), 2013, : 70 - 73
  • [19] A Parallel K-means Algorithm for High Dimensional Text Data
    Shan, Xiaolei
    Shen, Yanming
    Wang, Yuxin
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [20] An efficient algorithm for the parallel solution of high-dimensional differential equations
    Klus, Stefan
    Sahai, Tuhin
    Liu, Cong
    Dellnitz, Michael
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2011, 235 (09) : 3053 - 3062