Implementation and visualization of a netflow log data lake system for cyberattack detection using distributed deep learning

被引:0
|
作者
Wen-Chung Shih
Chao-Tung Yang
Cheng-Tian Jiang
Endah Kristiani
机构
[1] Asia University,Department of M
[2] Tunghai University,Commerce and Multimedia Applications
[3] Tunghai University,Department of Computer Science
[4] Krida Wacana Christian University,Research Center for Smart Sustainable Circular Economy
[5] iAmbition Technology Inc.,Department of Informatics
来源
关键词
Data lake; Distributed deep learning; NetFlow analysis; Cyberattack detection; Cloudera cluster; Big data; DNN;
D O I
暂无
中图分类号
学科分类号
摘要
Big data and artificial intelligence (AI) technology are complicated systems that will continue developing in recent years. This paper implemented a data lake architecture to handle massive data and perform data analysis in a real-time system. Using a data lake and AI model, a NetFlow storage monitoring system was deployed to perform a platform that can cover the storage, query, analysis, and visualization of massive volumes of data. The big data platform was built on Cloudera, which utilized big data tools like Kafka, Spark, HBase, Hive, and Impala. In addition, we used Spark to develop network threat recognition models using distributed deep learning. Also, we used the deep neural network (DNN) to train the model. Then, we evaluated the model performance, which reached 94% accuracy while decreasing by 48% of training time. The results of the studies demonstrate that deep learning model training time is significantly shortened. Additionally, this system employs several configurations to assess the elements influencing accuracy and performance. The model is evaluated using the confusion matrix to demonstrate that it can accurately detect attack behavior in log data. Furthermore, we have developed a real-time log data monitoring and analysis system to demonstrate the proposed architecture.
引用
收藏
页码:4983 / 5012
页数:29
相关论文
共 50 条
  • [31] Harnessing feature pruning with optimal deep learning-based distributed denial of service cyberattack detection on IoT environment
    Aboalela, Rania
    Allehaibi, Khalid H.
    Alsaadi, Naif
    Alorfi, Almuhannad S.
    Maghrabi, Louai A.
    Ashary, Ehab Bahaudien
    Alghamdi, Wajdi
    Ragab, Mahmoud
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 120 : 584 - 597
  • [32] Design and Implementation of a Big Data Evaluator Recommendation System Using Deep Learning Methodology
    Cha, Sukil
    Yi, Mun Y.
    Youm, Sekyoung
    APPLIED SCIENCES-BASEL, 2020, 10 (22): : 1 - 13
  • [33] Real-time implementation of IoT-enabled cyberattack detection system in advanced metering infrastructure using machine learning technique
    Naveeda, K.
    Fathima, S. M. H. Sithi Shameem
    ELECTRICAL ENGINEERING, 2025, 107 (01) : 909 - 928
  • [34] Drowsiness detection system using deep learning based data fusion approach
    G. Yogarajan
    R. Nitin Singh
    S. Avudai Nandhu
    R. Mohana Rudhran
    Multimedia Tools and Applications, 2024, 83 : 36081 - 36095
  • [35] A Deep Learning-Based Coyote Detection System Using Audio Data
    Jung, Heesun
    Kwon, Bokyung
    Kim, Youngbin
    Lee, Yejin
    Park, Jihyeon
    Pegg, Griffin
    Wang, Yaqin
    Smith, Anthony H.
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 170 - 175
  • [36] Anomaly Detection from System Tracing Data using Multimodal Deep Learning
    Nedelkoski, Sasho
    Cardoso, Jorge
    Kao, Odej
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 179 - 186
  • [37] Drowsiness detection system using deep learning based data fusion approach
    Yogarajan, G.
    Singh, R. Nitin
    Nandhu, S. Avudai
    Rudhran, R. Mohana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36081 - 36095
  • [38] Anomaly Detection and Classification using Distributed Tracing and Deep Learning
    Nedelkoski, Sasho
    Cardoso, Jorge
    Kao, Odej
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 241 - 250
  • [39] Scalable malware detection system using big data and distributed machine learning approach
    Manish Kumar
    Soft Computing, 2022, 26 : 3987 - 4003
  • [40] Scalable malware detection system using big data and distributed machine learning approach
    Kumar, Manish
    SOFT COMPUTING, 2022, 26 (08) : 3987 - 4003