On the performance and convergence of distributed stream processing via approximate fault tolerance

被引:0
|
作者
Zhinan Cheng
Qun Huang
Patrick P. C. Lee
机构
[1] The Chinese University of Hong Kong,Department of Computer Science and Engineering
[2] Chinese Academy of Sciences,State Key Laboratory of Computer Architecture, Institute of Computing Technology
[3] University of Chinese Academy of Sciences,undefined
来源
The VLDB Journal | 2019年 / 28卷
关键词
Distributed stream processing; Approximate fault tolerance; Online learning;
D O I
暂无
中图分类号
学科分类号
摘要
Fault tolerance is critical for distributed stream processing systems, yet achieving error-free fault tolerance often incurs substantial performance overhead. We present AF-Stream, a distributed stream processing system that addresses the trade-off between performance and accuracy in fault tolerance. AF-Stream builds on a notion called approximate fault tolerance, whose idea is to mitigate backup overhead by adaptively issuing backups, while ensuring that the errors upon failures are bounded with theoretical guarantees. Specifically, AF-Stream allows users to specify bounds on both the state divergence and the loss of non-backup streaming items. It issues state and item backups only when the bounds are reached. Our AF-Stream design provides an extensible programming model for incorporating general streaming algorithms as well as exports only few threshold parameters for configuring approximation fault tolerance. Furthermore, we formally prove that AF-Stream preserves high algorithm-specific accuracy of streaming algorithms, and in particular the convergence guarantees of online learning. Experiments show that AF-Stream maintains high performance (compared to no fault tolerance) and high accuracy after multiple failures (compared to no failures) under various streaming algorithms.
引用
收藏
页码:821 / 846
页数:25
相关论文
共 50 条
  • [1] On the performance and convergence of distributed stream processing via approximate fault tolerance
    Cheng, Zhinan
    Huang, Qun
    Lee, Patrick P. C.
    [J]. VLDB JOURNAL, 2019, 28 (05): : 821 - 846
  • [2] Toward High-Performance Distributed Stream Processing via Approximate Fault Tolerance
    Huang, Qun
    Lee, Patrick P. C.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 10 (03): : 73 - 84
  • [3] Approximate Fault Tolerance for Edge Stream Processing
    Takao, Daiki
    Sugiura, Kento
    Ishikawa, Yoshiharu
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS - DEXA 2021 WORKSHOPS, 2021, 1479 : 173 - 183
  • [4] Approximate Fault Tolerance for Sensor Stream Processing
    Takao, Daiki
    Sugiura, Kento
    Ishikawa, Yoshiharu
    [J]. DATABASES THEORY AND APPLICATIONS, ADC 2020, 2020, 12008 : 55 - 67
  • [5] Fault-tolerance in the borealis distributed stream processing system
    Balazinska, Magdalena
    Balakrishnan, Hari
    Madden, Samuel R.
    Stonebraker, Michael
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2008, 33 (01):
  • [6] Fault-Tolerance Implementation in Typical Distributed Stream Processing Systems
    Chen, Wuhong
    Tsai, Jichiang
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1167 - 1186
  • [7] Towards reliability and fault-tolerance of distributed stream processing system
    Gorawski, Marcin
    Marks, Pawel
    [J]. DEPCOS - RELCOMEX '07: INTERNATIONAL CONFERENCE ON DEPENDABILITY OF COMPUTER SYSTEMS, PROCEEDINGS, 2007, : 246 - +
  • [8] Integrating workload balancing and fault tolerance in distributed stream processing system
    Junhua Fang
    Pingfu Chao
    Rong Zhang
    Xiaofang Zhou
    [J]. World Wide Web, 2019, 22 : 2471 - 2496
  • [9] Integrating workload balancing and fault tolerance in distributed stream processing system
    Fang, Junhua
    Chao, Pingfu
    Zhang, Rong
    Zhou, Xiaofang
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (06): : 2471 - 2496
  • [10] Reducing Fault-tolerant Overhead for Distributed Stream Processing with Approximate Backup
    Zhuang, Yuan
    Wei, Xiaohui
    Li, Hongliang
    Hou, Mingkai
    Wang, Yundi
    [J]. 2020 29TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2020), 2020,