Implementation and visualization of a netflow log data lake system for cyberattack detection using distributed deep learning

被引:0
|
作者
Wen-Chung Shih
Chao-Tung Yang
Cheng-Tian Jiang
Endah Kristiani
机构
[1] Asia University,Department of M
[2] Tunghai University,Commerce and Multimedia Applications
[3] Tunghai University,Department of Computer Science
[4] Krida Wacana Christian University,Research Center for Smart Sustainable Circular Economy
[5] iAmbition Technology Inc.,Department of Informatics
来源
关键词
Data lake; Distributed deep learning; NetFlow analysis; Cyberattack detection; Cloudera cluster; Big data; DNN;
D O I
暂无
中图分类号
学科分类号
摘要
Big data and artificial intelligence (AI) technology are complicated systems that will continue developing in recent years. This paper implemented a data lake architecture to handle massive data and perform data analysis in a real-time system. Using a data lake and AI model, a NetFlow storage monitoring system was deployed to perform a platform that can cover the storage, query, analysis, and visualization of massive volumes of data. The big data platform was built on Cloudera, which utilized big data tools like Kafka, Spark, HBase, Hive, and Impala. In addition, we used Spark to develop network threat recognition models using distributed deep learning. Also, we used the deep neural network (DNN) to train the model. Then, we evaluated the model performance, which reached 94% accuracy while decreasing by 48% of training time. The results of the studies demonstrate that deep learning model training time is significantly shortened. Additionally, this system employs several configurations to assess the elements influencing accuracy and performance. The model is evaluated using the confusion matrix to demonstrate that it can accurately detect attack behavior in log data. Furthermore, we have developed a real-time log data monitoring and analysis system to demonstrate the proposed architecture.
引用
收藏
页码:4983 / 5012
页数:29
相关论文
共 50 条
  • [41] Automatic landslide detection and visualization by using deep ensemble learning method
    Hacıefendioğlu K.
    Varol N.
    Toğan V.
    Bahadır Ü.
    Kartal M.E.
    Neural Computing and Applications, 2024, 36 (18) : 10761 - 10776
  • [42] Big data analysis and distributed deep learning for next-generation intrusion detection system optimization
    Khloud Al Jallad
    Mohamad Aljnidi
    Mohammad Said Desouki
    Journal of Big Data, 6
  • [43] Big data analysis and distributed deep learning for next-generation intrusion detection system optimization
    Al Jallad, Khloud
    Aljnidi, Mohamad
    Desouki, Mohammad Said
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [44] Interactive system using LDA for exploratory visualization to extract data association in a data lake
    Yamada, Takaki
    Maekawa, Yuki
    Kato, Yuko
    Tomiyama, Tomoe
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 172 - 177
  • [45] Cyber Intrusion Prediction and Taxonomy System Using Deep Learning And Distributed Big Data Processing
    Al Najada, Hamzah
    Mahgoub, Imad
    Mohammed, Imran
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 631 - 638
  • [46] Distributed Raman Spectrum Data Augmentation System Using Federated Learning with Deep Generative Models
    Kim, Yaeran
    Lee, Woonghee
    SENSORS, 2022, 22 (24)
  • [47] Implementation of an Intelligent Video Detection System using Deep Learning in the Manufacturing Process of Tungsten Hexafluoride
    Son, Seung-Yong
    Kim, Young Mok
    Choi, Doo-Hyun
    KOREAN JOURNAL OF MATERIALS RESEARCH, 2021, 31 (12): : 719 - 726
  • [48] Design and implementation of visualization abnormal detection system for agricultural sensor data stream
    Shi, Xiaochen
    Cai, Saihua
    Li, Sicong
    Sun, Ruizhi
    International Agricultural Engineering Journal, 2020, 29 (01): : 418 - 427
  • [49] Small Target Detection for Search and Rescue Operations using Distributed Deep Learning and Synthetic Data Generation
    Yun, Kyongsik
    Luan Nguyen
    Tuan Nguyen
    Kim, Doyoung
    Eldin, Sarah
    Huyen, Alexander
    Lu, Thomas
    Chow, Edward
    PATTERN RECOGNITION AND TRACKING XXX, 2019, 10995
  • [50] Distributed deep learning system for cancerous region detection on Sunway TaihuLight
    GuoFeng Lv
    MingFan Li
    Hong An
    Han Lin
    Junshi Chen
    Wenting Han
    Qian Xiao
    Fei Wang
    Rongfen Lin
    CCF Transactions on High Performance Computing, 2020, 2 : 348 - 361