Big Data Forensics on Apache Kafka

被引：0

作者：

Mager, Thomas

机构：

来源：

INFORMATION SYSTEMS SECURITY, ICISS 2023 | 2023年 / 14424卷

关键词：

RECOVERY;

D O I：

10.1007/978-3-031-49099-6_3

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

There is a growing demand for information exchange in the age of the Internet of Things. One common scenario involves transferring data from distributed devices in the field to central servers or cloud environments. However, little research has been done on the possibilities for forensic investigation of supporting infrastructure such as Apache Kafka, which plays a crucial role in modern big data architectures. In this paper, we present our work on the forensic investigation of Apache Kafka. We use methodologies of reverse engineering to infer the data formats that Apache Kafka uses server-side. The results help us to implement a new module that is able to read Apache Kafka log files. An investigator can load the module in the open-source forensic platform "Autopsy". We highlight possibilities and limitations regarding encryption and data retention in Apache Kafka and suggest to store data decentralized when it comes to sensitive data. As a result of these measures, applications become more resilient to attacks and are able to provide increased security, ethical standards, and freedom for the application users. This can be a unique selling point in future data driven applications.

引用

页码：42 / 56

页数：15

共 50 条

[21] Engineering Resource-Efficient Data Management for Smart Cities with Apache Kafka
Raptis, Theofanis P.
Cicconetti, Claudio
Falelakis, Manolis
Kalogiannis, Grigorios
Kanellos, Tassos
Lobo, Tomas Pariente
[J]. FUTURE INTERNET, 2023, 15 (02):
[22] CloudMe forensics: A case of big data forensic investigation
Teing, Yee-Yang
Dehghantanha, Ali
Choo, Kim-Kwang Raymond
[J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (05):
[23] The big data potential of epidemiological studies for criminology and forensics
DeLisi, Matt
[J]. JOURNAL OF FORENSIC AND LEGAL MEDICINE, 2018, 57 : 24 - 27
[24] Forensic cloud environment: a solution for big data forensics
Tabona, Oteng
Blyth, Andrew
Maupong, Thabiso M.
Semong, Thabo
[J]. INTERNATIONAL JOURNAL OF ELECTRONIC SECURITY AND DIGITAL FORENSICS, 2022, 14 (05) : 513 - 533
[25] Scalable Manifold Learning for Big Data with Apache Spark
Schoeneman, Frank
Zola, Jaroslaw
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 272 - 281
[26] Big Data in metagenomics: Apache Spark vs MPI
Abuin, Jose M.
Lopes, Nuno
Ferreira, Luis
Pena, Tomas F.
Schmidt, Bertil
[J]. PLOS ONE, 2020, 15 (10):
[27] Static and Dynamic Big Data Partitioning on Apache Spark
Bertolucci, Massimiliano
Carlini, Emanuele
Dazzi, Patrizio
Lulli, Alessandro
Ricci, Laura
[J]. PARALLEL COMPUTING: ON THE ROAD TO EXASCALE, 2016, 27 : 489 - 498
[28] Shared Disk Big Data Analytics with Apache Hadoop
Mukherjee, Anirban
Datta, Joydip
Jorapur, Raghavendra
Singhvi, Ravi
Haloi, Saurav
Akram, Wasim
[J]. 2012 19TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2012,
[29] Accelerating Apache Spark Big Data Analysis with FPGAs
Ghasemi, Ehsan
Chow, Paul
[J]. 2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 737 - 744
[30] Accelerating Apache Spark Big Data Analysis with FPGAs
Ghasemi, Ehsan
Chow, Paul
[J]. 2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 94 - 94

← 1 2 3 4 5 →