A Survey on Networked Data Streaming With Apache Kafka

被引:6
|
作者
Raptis, Theofanis P. [1 ]
Passarella, Andrea [1 ]
机构
[1] CNR, Inst Informat & Telemat, I-56124 Pisa, Italy
关键词
Algorithms; cyber-physical; data; Internet of Things; networks; pub-sub; security; stream processing; BIG DATA; DATA-MANAGEMENT; ANALYTICS; MODELS; SYSTEM; ML;
D O I
10.1109/ACCESS.2023.3303810
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Apache Kafka has become a popular solution for managing networked data streaming in a variety of applications, from industrial to general purpose. This paper systematically surveys the research literature in this field by carefully classifying it into key macro areas, namely algorithms, networks, data, cyber-physical systems, and security. Through this meticulous classification, the paper aims to identify and analyze the optimization aspects relevant to each area, drawing upon practical applications as the basis for analysis. In this respect, the paper synthesizes and consolidates existing knowledge, saving researchers valuable time and effort in searching for relevant information across multiple sources. The tangible benefits of this survey paper include providing a consolidated knowledge base about research-intensive Apache Kafka topics, highlighting practical insights and novel approaches, pointing up cross-domain applications, identifying related research challenges, and serving as a trusted reference for the Apache Kafka community.
引用
收藏
页码:85333 / 85350
页数:18
相关论文
共 50 条
  • [1] Learning to Reliably Deliver Streaming Data with Apache Kafka
    Wu, Han
    Shang, Zhihao
    Wolter, Katinka
    [J]. 2020 50TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2020), 2020, : 564 - 571
  • [2] A Performance Evaluation of Apache Kafka in Support of Big Data Streaming Applications
    Le Noac'h, Paul
    Costan, Alexandru
    Bouge, Luc
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4803 - 4806
  • [3] Research Proposal: Reliability Evaluation of the Apache Kafka Streaming System
    Wu, Han
    [J]. 2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2019), 2019, : 112 - 113
  • [4] Big Data Forensics on Apache Kafka
    Mager, Thomas
    [J]. INFORMATION SYSTEMS SECURITY, ICISS 2023, 2023, 14424 : 42 - 56
  • [5] Performance Evaluation of Intrusion Detection Streaming Transactions Using Apache Kafka and Spark Streaming
    Tun, May Thet
    Nyaung, Dim En
    Phyu, Myat Pwint
    [J]. 2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 25 - 30
  • [6] Efficient topic partitioning of Apache Kafka for high-reliability real-time data streaming applications
    Raptis, Theofanis P.
    Cicconetti, Claudio
    Passarella, Andrea
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 173 - 188
  • [7] Design and Development of A Cloud-Based IDS using Apache Kafka and Spark Streaming
    Wirz, Leon
    Tanthanathewin, Rinrada
    Ketphet, Asipan
    Fugkeaw, Somchart
    [J]. 2022 19TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2022), 2022,
  • [8] Field of genes: using Apache Kafka as a bioinformatic data repository
    Lawlor, Brendan
    Lynch, Richard
    Mac Aogain, Micheal
    Walsh, Paul
    [J]. GIGASCIENCE, 2018, 7 (04):
  • [9] Dynamically Scaling Apache Storm for the Analysis of Streaming Data
    van der Veen, Jan Sipke
    van der Waaij, Bram
    Lazovik, Elena
    Wijbrandi, Wilco
    Meijer, Robert J.
    [J]. 2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 154 - 161
  • [10] TRAK: A Testing Tool for Studying the Reliability of Data Delivery in Apache Kafka
    Wu, Han
    Shang, Zhihao
    Wolter, Katinka
    [J]. 2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2019), 2019, : 394 - 397