Tolerating Correlated Failures in Massively Parallel Stream Processing Engines

被引:0
|
作者
Su, Li [1 ]
Zhou, Yongluan [1 ]
机构
[1] Univ Southern Denmark, Odense, Denmark
来源
2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE) | 2016年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fault-tolerance techniques for stream processing engines can be categorized into passive and active approaches. A typical passive approach periodically checkpoints a processing task's runtime states and can recover a failed task by restoring its runtime state using its latest checkpoint. On the other hand, an active approach usually employs backup nodes to run replicated tasks. Upon failure, the active replica can take over the processing of the failed task with minimal latency. However, both approaches have their own inadequacies in Massively Parallel Stream Processing Engines (MPSPE). The passive approach incurs a long recovery latency especially when a number of correlated nodes fail simultaneously, while the active approach requires extra replication resources. In this paper, we propose a new fault-tolerance framework, which is Passive and Partially Active (PPA). In a PPA scheme, the passive approach is applied to all tasks while only a selected set of tasks will be actively replicated. The number of actively replicated tasks depends on the available resources. If tasks without active replicas fail, tentative outputs will be generated before the completion of the recovery process. We also propose effective and efficient algorithms to optimize a partially active replication plan to maximize the quality of tentative outputs. We implemented PPA on top of Storm, an open-source MPSPE and conducted extensive experiments using both real and synthetic datasets to verify the effectiveness of our approach.
引用
收藏
页码:517 / 528
页数:12
相关论文
共 50 条
  • [31] GePaRDT, a framework for massively parallel processing of dataflow graphs
    Schoech, Alexander
    Bach, Carlo
    Ettemeyer, Andreas
    Linz-Dittrich, Sabine
    REAL-TIME IMAGE AND VIDEO PROCESSING 2012, 2012, 8437
  • [32] Efficient approaches for constructing a massively parallel processing system
    Guan, HW
    Cheung, TY
    JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (13) : 1185 - 1190
  • [33] HAWQ: A Massively Parallel Processing SQL Engine in Hadoop
    Chang, Lei
    Wang, Zhanwei
    Ma, Tao
    Jian, Lirong
    Ma, Lili
    Goldshuv, Alon
    Lonergan, Luke
    Cohen, Jeffrey
    Welton, Caleb
    Sherry, Gavin
    Bhandarkar, Milind
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1223 - 1234
  • [34] Massively Parallel Hierarchical Scene Processing with Applications in Rendering
    Vinkler, Marek
    Bittner, Jiri
    Havran, Vlastimil
    Hapala, Michal
    COMPUTER GRAPHICS FORUM, 2013, 32 (08) : 13 - 25
  • [35] ScalaGraph: A Scalable Accelerator for Massively Parallel Graph Processing
    Yao, Pengcheng
    Zheng, Long
    Huang, Yu
    Wang, Qinggang
    Gui, Chuangyi
    Zeng, Zhen
    Liao, Xiaofei
    Jin, Hai
    Xue, Jingling
    2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 199 - 212
  • [36] Massively parallel processing distributed database for business intelligence
    Faculty of Sciences, Lebanese University, Lebanon
    Inf. Technol. J., 2008, 1 (70-76):
  • [37] Big Data normalization for massively parallel processing databases
    Golov, Nikolay
    Ronnback, Lars
    COMPUTER STANDARDS & INTERFACES, 2017, 54 : 86 - 93
  • [38] A RISC CENTRAL PROCESSING UNIT FOR A MASSIVELY PARALLEL ARCHITECTURE
    CAPPELLO, F
    BECHENNEC, JL
    ETIEMBLE, D
    MICROPROCESSING AND MICROPROGRAMMING, 1990, 30 (1-5): : 33 - 39
  • [39] CBP: A New Parallelization Paradigm for Massively Distributed Stream Processing
    Guo, Qingsong
    Zhou, Yongluan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT II, 2017, 10178 : 304 - 320
  • [40] A Fire Flame Simulation Scheme with Massively Parallel Processing
    Im, Byeonguk
    Baek, Nakhoon
    BIG DATA APPLICATIONS AND SERVICES 2017, 2019, 770 : 9 - 15