Tolerating Correlated Failures in Massively Parallel Stream Processing Engines

被引:0
|
作者
Su, Li [1 ]
Zhou, Yongluan [1 ]
机构
[1] Univ Southern Denmark, Odense, Denmark
来源
2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE) | 2016年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fault-tolerance techniques for stream processing engines can be categorized into passive and active approaches. A typical passive approach periodically checkpoints a processing task's runtime states and can recover a failed task by restoring its runtime state using its latest checkpoint. On the other hand, an active approach usually employs backup nodes to run replicated tasks. Upon failure, the active replica can take over the processing of the failed task with minimal latency. However, both approaches have their own inadequacies in Massively Parallel Stream Processing Engines (MPSPE). The passive approach incurs a long recovery latency especially when a number of correlated nodes fail simultaneously, while the active approach requires extra replication resources. In this paper, we propose a new fault-tolerance framework, which is Passive and Partially Active (PPA). In a PPA scheme, the passive approach is applied to all tasks while only a selected set of tasks will be actively replicated. The number of actively replicated tasks depends on the available resources. If tasks without active replicas fail, tentative outputs will be generated before the completion of the recovery process. We also propose effective and efficient algorithms to optimize a partially active replication plan to maximize the quality of tentative outputs. We implemented PPA on top of Storm, an open-source MPSPE and conducted extensive experiments using both real and synthetic datasets to verify the effectiveness of our approach.
引用
收藏
页码:517 / 528
页数:12
相关论文
共 50 条
  • [21] Direct stereo radargrammetric processing using massively parallel processing
    Balz, Timo
    Zhang, Lu
    Liao, Mingsheng
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 79 : 137 - 146
  • [22] MASSIVELY PARALLEL PROCESSING COMPUTER FOR SATELLITE IMAGE PROCESSING.
    Goel, U.C.
    Joshi, R.C.
    Students' Journal of the Institution of Electronics and Telecommunication Engineers, 1986, 27 (03): : 112 - 120
  • [23] Massively parallel and multiscale simulations of strongly correlated electronic systems
    Mark, Jarrell
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2009, 237
  • [24] Defining the execution semantics of stream processing engines
    Affetti L.
    Tommasini R.
    Margara A.
    Cugola G.
    Della Valle E.
    Journal of Big Data, 4 (1)
  • [25] Stream Processing Engines for Smart Healthcare Systems
    Khiati, Rhaed
    Hanif, Muhammed
    Lee, Choonhwa
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 467 - 471
  • [26] Neural Parallel Engine: A toolbox for massively parallel neural signal processing
    Tam, Wing-kin
    Yang, Zhi
    JOURNAL OF NEUROSCIENCE METHODS, 2018, 301 : 18 - 33
  • [27] Massively parallel heterogeneous VLSI architecture for MSIMD processing
    Nudd, G.R.
    Kerbyson, D.J.
    Atherton, T.J.
    Francis, N.D.
    Packwood, R.A.
    Vaudin, G.J.B.
    International Workshop on Algorithms and Parallel VLSI Architectures, 1991,
  • [28] A model of massively parallel call and service processing in telecommunications
    Sinkovic, V
    Lovrek, I
    JOURNAL OF SYSTEMS ARCHITECTURE, 1997, 43 (6-7) : 479 - 490
  • [29] Massively Parallel Multi-Versioned Transaction Processing
    Qian, Shujian
    Goel, Ashvin
    PROCEEDINGS OF THE 18TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2024, 2024, : 765 - 781
  • [30] Massively Parallel Processing of Signals in Dense Microphone Arrays
    Said, Amir
    Kalker, Ton
    Lee, Bowon
    Fozunbal, Majid
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 3080 - 3083