Research Proposal: Reliability Evaluation of the Apache Kafka Streaming System

被引:8
|
作者
Wu, Han [1 ]
机构
[1] Free Univ Berlin, Inst Informat, Berlin, Germany
关键词
Stream processing; Reliability; Apache Kafka;
D O I
10.1109/ISSREW.2019.00055
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Apache Kafka is a distributed messaging system with high throughput, high scalability and low latency. It has been widely adopted in enterprise and due to its widespread integration into enterprise-level infrastructures, the research on the reliability of Kafka consumers has become an increasingly important issue. The application scenarios vary from tracking user profiles on a website, server log monitoring, to online bank transfer and online reservation. The main purpose of this research is to evaluate the reliability of Kafka in different application scenarios. Kafka is highly configurable and provides many options to manage reliability strategies. In this research we test the impacts of an kinds of configuration parameters on the reliability of Kafka, including retry strategies and replications of partitions for fault tolerance. The tradeoffs between performance and reliability is another portion of our research, which help users of Kafka using it in an appropriate way.
引用
收藏
页码:112 / 113
页数:2
相关论文
共 50 条
  • [1] Performance Evaluation of Intrusion Detection Streaming Transactions Using Apache Kafka and Spark Streaming
    Tun, May Thet
    Nyaung, Dim En
    Phyu, Myat Pwint
    [J]. 2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 25 - 30
  • [2] A Performance Evaluation of Apache Kafka in Support of Big Data Streaming Applications
    Le Noac'h, Paul
    Costan, Alexandru
    Bouge, Luc
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4803 - 4806
  • [3] A Survey on Networked Data Streaming With Apache Kafka
    Raptis, Theofanis P.
    Passarella, Andrea
    [J]. IEEE ACCESS, 2023, 11 : 85333 - 85350
  • [4] Learning to Reliably Deliver Streaming Data with Apache Kafka
    Wu, Han
    Shang, Zhihao
    Wolter, Katinka
    [J]. 2020 50TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2020), 2020, : 564 - 571
  • [5] Efficient topic partitioning of Apache Kafka for high-reliability real-time data streaming applications
    Raptis, Theofanis P.
    Cicconetti, Claudio
    Passarella, Andrea
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 173 - 188
  • [6] Building a Replicated Logging System with Apache Kafka
    Wang, Guozhang
    Koshy, Joel
    Subramanian, Sriram
    Paramasivam, Kartik
    Zadeh, Mammad
    Narkhede, Neha
    Rao, Jun
    Kreps, Jay
    Stein, Joe
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1654 - 1655
  • [7] TRAK: A Testing Tool for Studying the Reliability of Data Delivery in Apache Kafka
    Wu, Han
    Shang, Zhihao
    Wolter, Katinka
    [J]. 2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2019), 2019, : 394 - 397
  • [8] Design and Development of A Cloud-Based IDS using Apache Kafka and Spark Streaming
    Wirz, Leon
    Tanthanathewin, Rinrada
    Ketphet, Asipan
    Fugkeaw, Somchart
    [J]. 2022 19TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2022), 2022,
  • [9] Monitoring Framework for the Performance Evaluation of an IoT Platform with Elasticsearch and Apache Kafka
    Calderon, Gonzalo
    del Campo, Guillermo
    Saavedra, Edgar
    Santamaria, Asuncion
    [J]. INFORMATION SYSTEMS FRONTIERS, 2023,
  • [10] Automated script-based engine for Apache Kafka messaging system
    University of Craiova, Faculty of Automation, Computers and Electronics, Department of Computers and Information Technology, Craiova, Romania
    [J]. Proc. RoEduNet IEEE Int. Conf.,