A parallel hierarchical aggregation algorithm in high dimensional data warehouse

被引:0
|
作者
Hu, Kongfa [1 ,2 ]
Liu, Jiajia [1 ]
Chen, Ling [1 ]
Da, Qingli [2 ]
机构
[1] Yangzhou Univ, Dept Comp Sci & Engn, Yangzhou 225009, Jiangsu, Peoples R China
[2] Southeast Univ, Sch Econ & Management, Nanjing 210096, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/FSKD.2007.106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
OLAP(on-line analytical processing) queries tend to be complex and ad hoc, often requiring computationally expensive operations such as multi-table joins and aggregation. In the high dimensional data warehouse(DW), we full materialized the data cube impossibly. In this paper, we propose a novel aggregation algorithm, PDHEPA(parallel pre-grouping aggregation based on the dimension hierarchical encoding), to vertically partition a high dimensional dataset into a set of disjoint low dimensional datasets called fragment mini-cubes. PDHEPA uses the small dimension hierarchical encoding and their prefix, so that it can drastically reduce the multi-table join operations. As a result, the method we proposed in this paper can greatly reduce the disk I/Os and highly improve the efficiency of OLAP queries. The analytical and experimental results show that the PDHEPA is more efficient than other existed ones.
引用
收藏
页码:36 / +
页数:2
相关论文
共 50 条
  • [1] A grouping aggregation algorithm based on the dimension hierarchical encoding in data warehouse
    Gong, Zhen-zhi
    Hu, Kong-fa
    Da, Qing-Li
    [J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT APPLICATIONS, PROCEEDINGS, 2007, : 135 - +
  • [2] A high performance hierarchical cubing algorithm and efficient OLAP in high-dimensional data warehouse
    Hu, Kongfa
    Gong, Zhenzhi
    Da, Qingli
    Chen, Ling
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 357 - +
  • [3] A rapid dimension hierarchical aggregation algorithm on high dimensional OLAP
    Hu, Kong-Fa
    Chen, Ling
    Liu, Hai-Dong
    Liu, Jia-Jia
    Zhang, Chang-Hai
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1547 - +
  • [4] Parallel telemetric data warehouse balancing algorithm
    Gorawski, M
    Chechelski, R
    [J]. 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, PROCEEDINGS, 2005, : 387 - 392
  • [5] PHC: A rapid parallel hierarchical cubing algorithm on high dimensional OLAP
    Hu, Kongfa
    Chen, Ling
    Chen, Yixin
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PROCEEDINGS, 2007, 4494 : 72 - +
  • [6] A Parallel data preprocessing algorithm for hierarchical clustering
    Li Zhao-Peng
    Li Zhao-jian
    [J]. 2013 FIFTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2013), 2013, : 70 - 73
  • [7] A Parallel K-means Algorithm for High Dimensional Text Data
    Shan, Xiaolei
    Shen, Yanming
    Wang, Yuxin
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [8] AN OPTIMIZED LOAD ALGORITHM OF PARALLEL DATA WAREHOUSE BASED ON THE CLOUD COMPUTING PLATFORM
    Yang, Zhengqiu
    Li, Tong
    Xiu, Jiapeng
    Liu, Chen
    [J]. 2012 IEEE 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENT SYSTEMS (CCIS) VOLS 1-3, 2012, : 413 - 417
  • [9] Visual Analysis of High-Dimensional Event Sequence Data via Dynamic Hierarchical Aggregation
    Gotz, David
    Zhang, Jonathan
    Wang, Wenyuan
    Shrestha, Joshua
    Borland, David
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) : 440 - 450
  • [10] Parallel clustering algorithm based on sparse index sort of high dimensional data
    Wu, Sen
    Feng, Xiao-Dong
    Wu, Qing-Hai
    [J]. Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2011, 31 (SUPPL. 2): : 13 - 18