A comprehensive study on fault tolerance in stream processing systems

被引:4
|
作者
Wang, Xiaotong [1 ]
Zhang, Chunxi [1 ]
Fang, Junhua [2 ]
Zhang, Rong [1 ]
Qian, Weining [1 ]
Zhou, Aoying [1 ]
机构
[1] East China Normal Univ, Sch Data Sci & Engn, Shanghai 200062, Peoples R China
[2] Soochow Univ, Adv Data Analyt Lab, Suzhou 215006, Peoples R China
关键词
fault tolerance; performance evaluation; stream processing; MODEL;
D O I
10.1007/s11704-020-0248-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stream processing has emerged as a useful technology for applications which require continuous and low latency computation on infinite streaming data. Since stream processing systems (SPSs) usually require distributed deployment on clusters of servers in face of large-scale of data, it is especially common to meet with failures of processing nodes or communication networks, but should be handled seriously considering service quality. A failed system may produce wrong results or become unavailable, resulting in a decline in user experience or even significant financial loss. Hence, a large amount of fault tolerance approaches have been proposed for SPSs. These approaches often have their own priorities on specific performance concerns, e.g., runtime overhead and recovery efficiency. Nevertheless, there is a lack of a systematic overview and classification of the state-of-the-art fault tolerance approaches in SPSs, which will become an obstacle for the development of SPSs. Therefore, we investigate the existing achievements and develop a taxonomy of the fault tolerance in SPSs. Furthermore, we propose an evaluation framework tailored for fault tolerance, demonstrate the experimental results on two representative open-sourced SPSs and exposit the possible disadvantages in current designs. Finally, we specify future research directions in this domain.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A comprehensive study on fault tolerance in stream processing systems
    Xiaotong WANG
    Chunxi ZHANG
    Junhua FANG
    Rong ZHANG
    Weining QIAN
    Aoying ZHOU
    [J]. Frontiers of Computer Science., 2022, 16 (02) - 97
  • [2] A comprehensive study on fault tolerance in stream processing systems
    Xiaotong Wang
    Chunxi Zhang
    Junhua Fang
    Rong Zhang
    Weining Qian
    Aoying Zhou
    [J]. Frontiers of Computer Science, 2022, 16
  • [3] Pathfinder: Fault Tolerance for Stream Processing Systems
    Knasmuller, Bernhard
    Hochreiner, Christoph
    Schulte, Stefan
    [J]. 2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 29 - 39
  • [4] Fault-Tolerance Implementation in Typical Distributed Stream Processing Systems
    Chen, Wuhong
    Tsai, Jichiang
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1167 - 1186
  • [5] Approximate Fault Tolerance for Sensor Stream Processing
    Takao, Daiki
    Sugiura, Kento
    Ishikawa, Yoshiharu
    [J]. DATABASES THEORY AND APPLICATIONS, ADC 2020, 2020, 12008 : 55 - 67
  • [6] Approximate Fault Tolerance for Edge Stream Processing
    Takao, Daiki
    Sugiura, Kento
    Ishikawa, Yoshiharu
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS - DEXA 2021 WORKSHOPS, 2021, 1479 : 173 - 183
  • [7] Cost of Fault-Tolerance on Data Stream Processing
    Vianello, Valerio
    Patino-Martinez, Marta
    Azqueta-Alzuar, Ainhoa
    Jimenez-Peris, Ricardo
    [J]. EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 17 - 27
  • [8] FATM: A failure-aware adaptive fault tolerance model for distributed stream processing systems
    Akber, Syed Muhammad Abrar
    Chen, Hanhua
    Jin, Hai
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (10):
  • [9] User-Constraint and Self-Adaptive Fault Tolerance for Event Stream Processing Systems
    Martin, Andre
    Smaneoto, Tiaraju
    Dietze, Tobias
    Brito, Andrey
    Fetzer, Christof
    [J]. 2015 45TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, 2015, : 462 - 473
  • [10] Fault-tolerance in the borealis distributed stream processing system
    Balazinska, Magdalena
    Balakrishnan, Hari
    Madden, Samuel R.
    Stonebraker, Michael
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2008, 33 (01):