Big Data Network Flow Processing Using Apache Spark

被引:0
|
作者
Jerabek, Kamil [1 ]
Rysavy, Ondrej [1 ]
机构
[1] Brno Univ Technol, Brno, Czech Republic
关键词
Big Data; Network flows; Apache Spark; Cassandra; Apache Ignite;
D O I
10.1145/3352700.3352709
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing amount of traffic flows captured as a part of network monitoring activities makes the analysis more complicated. One of the goals for network traffic analysis is to identify malicious communication. In the paper, we present a new system for big data network flow classification and clustering. The proposed system is based on the popular big data engines such as Apache Spark and Apache Ignite. The conducted experiments demonstrate the feasibility of the proposed approach and show the possible scalability.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] SparkJNI: A Toolchain for Hardware Accelerated Big Data Apache Spark
    Voicu, Tudor Alexandru
    Al-Ars, Zaid
    2019 4TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2019), 2019, : 152 - 157
  • [42] Big Data Processing Using Hadoop and Spark: The Case of Meteorology Data
    Hussein, Eslam
    Sadiki, Ronewa
    Jafta, Yahlieel
    Sungay, Muhammad Mujahid
    Ajayi, Olasupo
    Bagula, Antoine
    E-INFRASTRUCTURE AND E-SERVICES FOR DEVELOPING COUNTRIES (AFRICOMM 2019), 2020, 311 : 180 - 185
  • [43] Linked Data Partitioning for RDF Processing on Apache Spark
    Atashkar, Amir Hossein
    Ghadiri, Nasser
    Joodaki, Mehdi
    2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2017, : 73 - 77
  • [44] BigDebug: Interactive Debugger for Big Data Analytics in Apache Spark
    Gulzar, Muhammad Ali
    Interlandi, Matteo
    Condie, Tyson
    Kim, Miryung
    FSE'16: PROCEEDINGS OF THE 2016 24TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2016, : 1033 - 1037
  • [45] Processing Big Trajectory and Twitter Data Streams using Apache STORM
    Stojanovic, Dragan
    Stojanovic, Natalija
    Turanjanin, Jovan
    2015 12TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS IN MODERN SATELLITE, CABLE AND BROADCASTING SERVICES (TELSIKS), 2015, : 301 - 304
  • [46] Big Data Approach For IoT Botnet Traffic Detection Using Apache Spark Technology
    Arokodare, Oluwatomisin
    Wimmer, Hayden
    Du, Jie
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 1260 - 1266
  • [47] Sentiment classification using paragraph vector and cognitive big data semantics on Apache Spark
    Ravi, Kumar
    Ravi, Vadlamani
    Shivakrishna, B.
    PROCEEDINGS OF 2018 IEEE 17TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2018), 2018, : 187 - 194
  • [48] Efficiently Processing and Storing Library Linked Data using Apache Spark and Parquet
    Sharma, Kumar
    Marjit, Ujjal
    Biswas, Utpal
    INFORMATION TECHNOLOGY AND LIBRARIES, 2018, 37 (03) : 29 - 49
  • [49] Effective Selection of Machine Learning Algorithms for Big Data Analytics Using Apache Spark
    Hafez, Manar Mohamed
    Shehab, Mohamed Elemam
    El Fakharany, Essam
    Hegazy, Abd El Ftah Abdel Ghfar
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 692 - 704
  • [50] A distributed evolutionary based instance selection algorithm for big data using Apache Spark
    Qin, Liyang
    Wang, Xiaoli
    Yin, Linzi
    Jiang, Zhaohui
    APPLIED SOFT COMPUTING, 2024, 159