Alovera: A Fast Stream Processing System for Large-Scale Data

被引:0
|
作者
Zhang, Zhen'An [1 ]
Zhang, Dongjie [2 ]
Yu, Xiaopeng [1 ]
Wang, Jing [2 ]
He, Chunjiang [3 ]
Yuan, Pingpeng [2 ]
Jin, Hai [2 ]
机构
[1] HAEPC Elect Power Res Inst, Zhengzhou, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Wuhan 430074, Peoples R China
[3] China Elect Power Res Inst, Beijing 100085, Peoples R China
关键词
Large-scale data analysis; query execution; columnar store; stream processing;
D O I
10.1109/ChinaGrid.2013.9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Growing of data volume poses challenges to data processing system. In this paper, Alovera, a fast stream processing system for large-scale data is presented. By using columnar data layout and stream processing, it is capable of pipelining data processing efficiently. It can process part of data instead of waiting for all data to be ready for the next operation. Thus, it can reduce the query time dramatically. Experimental results indicate significant performance improvement in a variety of tasks. In the experiments, we also evaluate our methods with different systems including HadoopDB and Hive. The extensive experiments confirm efficiency and better performance of our system.
引用
收藏
页码:74 / 79
页数:6
相关论文
共 50 条
  • [1] Optimizing data stream processing for large-scale applications
    Cappellari, Paolo
    Roantree, Mark
    Chun, Soon Ae
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (09): : 1607 - 1641
  • [2] Dynamic and fast processing of queries on large-scale RDF data
    Yuan, Pingpeng
    Xie, Changfeng
    Jin, Hai
    Liu, Ling
    Yang, Guang
    Shi, Xuanhua
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 311 - 334
  • [3] Dynamic and fast processing of queries on large-scale RDF data
    Pingpeng Yuan
    Changfeng Xie
    Hai Jin
    Ling Liu
    Guang Yang
    Xuanhua Shi
    [J]. Knowledge and Information Systems, 2014, 41 : 311 - 334
  • [4] Adaptive correlated prefetch with large-scale hybrid memory system for stream processing
    Lee, Sung Min
    Yoon, Su-Kyung
    Kim, Jeong-Geun
    Kim, Shin-Dug
    [J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (09): : 4746 - 4770
  • [5] Adaptive correlated prefetch with large-scale hybrid memory system for stream processing
    Sung Min Lee
    Su-Kyung Yoon
    Jeong-Geun Kim
    Shin-Dug Kim
    [J]. The Journal of Supercomputing, 2018, 74 : 4746 - 4770
  • [6] A Hybrid Processing System for Large-Scale Traffic Sensor Data
    Zhao, Zhuofeng
    Ding, Weilong
    Wang, Jianwu
    Han, Yanbo
    [J]. IEEE ACCESS, 2015, 3 : 2341 - 2351
  • [7] Towards Large-Scale Graph Stream Processing Platform
    Suzumura, Toyotaro
    Nishii, Shunsuke
    Ganse, Masaru
    [J]. WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 1321 - 1326
  • [8] ShuffleBench: A Benchmark for Large-Scale Data Shuffling Operations with Distributed Stream Processing Frameworks
    Henning, Soeren
    Vogel, Adriano
    Leichtfried, Michael
    Ertl, Otmar
    Rabiser, Rick
    [J]. PROCEEDINGS OF THE 15TH ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, ICPE 2024, 2024, : 2 - 13
  • [9] Boosting Algorithms for Large-Scale Data and Data Batch Stream
    Yoon, Young Joo
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2010, 23 (01) : 197 - 206
  • [10] Anomaly detection in large-scale data stream networks
    Duc-Son Pham
    Venkatesh, Svetha
    Lazarescu, Mihai
    Budhaditya, Saha
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (01) : 145 - 189