An Optimized Distributed OLAP System for Big Data

被引:0
|
作者
Chen, Wenhao [1 ]
Wang, Haoxiang [1 ]
Zhang, Xingming [1 ]
Lin, Qidi [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
关键词
big data; decision making; OLAP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To solve the problems of heterogeneous data types and large amount of calculation in making decision for big data, an optimized distributed OLAP system for big data is proposed in this paper. The system provides data acquisition for different data sources, and supports two types of OLAP engines, Impala and Kylin. First of all, the architecture of the system is proposed, consisting of four modules, data acquisition, data storage, OLAP analysis and data visualization, and the specific implementation of each module is descripted in great detail. Then the optimization of the system is put forward, which is automatic metadata configuration and the cache for OLAP query. Finally, the performance test of the system is conduct to demonstrate that the efficiency of the system is significantly better than the traditional solution.
引用
下载
收藏
页码:36 / 40
页数:5
相关论文
共 50 条
  • [1] HaoLap: A Hadoop based OLAP system for big data
    Song, Jie
    Guo, Chaopeng
    Wang, Zhi
    Zhang, Yichan
    Yu, Ge
    Pierson, Jean-Marc
    JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 102 : 167 - 181
  • [2] Genetic Optimized Data Deduplication for Distributed Big Data Storage Systems
    Kumar, Naresh
    Antwal, Shobha
    Samarthyam, Ganesh
    Jain, S. C.
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTING AND CONTROL (ISPCC 2K17), 2017, : 7 - 15
  • [3] What-If Query Processing Policy for Big Data in OLAP System
    Xu, Huan
    Luo, Hao
    He, Jieyue
    2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 110 - 116
  • [4] OLAP*: Effectively and Efficiently Supporting Parallel OLAP over Big Data
    Cuzzocrea, Alfredo
    Moussa, Rim
    Xu, Guandong
    MODEL AND DATA ENGINEERING, MEDI 2013, 2013, 8216 : 38 - 49
  • [5] Memory-optimized distributed utility mining for big data
    Kumar, Sunil
    Mohbey, Krishna Kumar
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6491 - 6503
  • [6] Towards an Optimized Big Data Processing System
    Ghit, Bogdan
    Iosup, Alexandru
    Epema, Dick
    PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 83 - 86
  • [7] Advances in data warehousing and OLAP in the big Data Era
    Bellatreche, Ladjel
    Cuzzocrea, Alfredo
    Song, Il-Yeol
    INFORMATION SYSTEMS, 2015, 53 : 39 - 40
  • [8] Benchmarking Big Data OLAP NoSQL Databases
    El Malki, Mohammed
    Kopliku, Arlind
    Sabir, Essaid
    Teste, Olivier
    UBIQUITOUS NETWORKING, UNET 2018, 2018, 11277 : 82 - 94
  • [9] Bandlimited OLAP Cubes for Interactive Big Data Visualization
    Reach, Caleb
    North, Chris
    2015 IEEE 5TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2015, : 107 - 114
  • [10] VOLAP: A Scalable Distributed System for Real-Time OLAP with High Velocity Data
    Dehne, Frank
    Robillard, David
    Rau-Chaplin, Andrew
    Burke, Neil
    2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 354 - 363