Holistic and Algebraic Data Cube Computation Using MapReduce

被引:0
|
作者
Yang, Haile [1 ]
Han, Chunyan [1 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang, Liaoning, Peoples R China
关键词
Data Cube Computation; MapReduce; Distributed Computing; OLAP;
D O I
10.1109/IHMSC.2017.126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an important method of accelerating large data analysis, data cube computing has been a hot area. At present, the researches based on MapReduce mainly focus on algebraic data cube. In this paper, we use MapReduce as the basis of the algorithm. For the algebraic data cube, data cube lattice is divided into several regions, according to different types of regions, different calculation algorithms are adopted. In this paper, for holistic measures, the data cube computation of Count Distinct measure by using Bitmap is proposed, and then the applicability of the algorithm to incremental computation and the combination with algebraic metric are discussed. The final experimental results show that the methods are more efficient and show good practicability.
引用
收藏
页码:47 / 50
页数:4
相关论文
共 50 条
  • [31] USING SYMBOLIC COMPUTATION TO FIND ALGEBRAIC INVARIANTS
    KEREN, D
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (11) : 1143 - 1149
  • [32] Learning numerical and algebraic computation using ANAWEB
    de Oliveira Camargo-Brunetto, Maria Angelica
    Arabori, Rafael Yul
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCES AND ITS APPLICATIONS, PROCEEDINGS, 2008, : 313 - 320
  • [33] Full and Partial Data Cube Computation and Representation over Commodity PCs
    Moreira, Angelica Aparecida
    Lima, Joubert de Castro
    [J]. 2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2012, : 672 - 679
  • [34] Cube data model for multilevel statistics computation of live execution traces
    Ezzati-Jivan, Naser
    Dagenais, Michel R.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (05): : 1069 - 1091
  • [35] Minimal Condensed Cube: Data Organization, Fast Computation, and Incremental Update
    Wang, Zhuo
    Xu, Ye
    [J]. ICICSE: 2008 INTERNATIONAL CONFERENCE ON INTERNET COMPUTING IN SCIENCE AND ENGINEERING, PROCEEDINGS, 2008, : 60 - 67
  • [36] Query-Driven Frequent Co-Occurring Term Computation over Relational Data Using MapReduce
    Li, Jianxin
    Liu, Chengfei
    Zhou, Rui
    Yu, Jeffrey Xu
    [J]. COMPUTER JOURNAL, 2015, 58 (03): : 497 - 513
  • [37] An Implementation Approach of Big Data Computation by Mapping Java']Java Classes to MapReduce
    Verma, Chitresh
    Pandey, Rajiv
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 282 - 287
  • [38] Multi-cube computation
    Yu, JX
    Lu, HJ
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2001, : 126 - 133
  • [39] Parallel Computation of k-Nearest Neighbor Joins Using MapReduce
    Kim, Wooyeol
    Kim, Younghoon
    Shim, Kyuseok
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 696 - 705
  • [40] Data and task parallelism in ILP using MapReduce
    Ashwin Srinivasan
    Tanveer A. Faruquie
    Sachindra Joshi
    [J]. Machine Learning, 2012, 86 : 141 - 168