Network Traffic Data Collection for Machine Learning Analysis

被引:0
|
作者
Chao, James [1 ]
Rodriguez, Ramiro [1 ]
机构
[1] Naval Informat Warfare Ctr Pacif, San Diego, CA 53560 USA
来源
关键词
network traffic classification; machine learning; data collection;
D O I
10.1117/12.2664375
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Network traffic has increased substantially due to the introduction of advanced network-enabled applications and devices. The introduction of software defined networks (SDNs) and machine learning (ML) has empowered optimizing network operations and network traffic monitoring, resulting in improved complex traffic operations and security with faster malicious intention detections. This paper focuses on network traffic data collection systems, and the data is evaluated using a survey of ML algorithms, depending on the data type (tabular or image). Adhering to system architecture best practices including a decoupled design to integrate with existing network monitoring infrastructures and cybersecurity standards; and online and offline data collection via packet capture (PCAP) standards. For packet based network traffic data analysis, we convert captured data into images and feed into a convolutional neural network to classify the data based on requirements. For statistical based network traffic data analysis, we apply feature engineering on tabular data and feed into various ML systems to classify based on requirements. Finally, We show that the same ML algorithm outperforms publicly available datasets using our collection method.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Machine Learning in Software Defined Networks: Data Collection and Traffic Classification
    Amaral, Pedro
    Dinis, Joao
    Pinto, Paulo
    Bernardo, Luis
    Tavares, Joao
    Mamede, Henrique S.
    2016 IEEE 24TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2016,
  • [2] Traffic Refinery: Cost-Aware Data Representation for Machine Learning on Network Traffic
    Bronzino, Francesco
    Schmitt, Paul
    Ayoubi, Sara
    Kim, Hyojoon
    Teixeira, Renata
    Feamster, Nick
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2021, 5 (03)
  • [3] Encrypted Network Traffic Analysis and Classification Utilizing Machine Learning
    Alwhbi, Ibrahim A.
    Zou, Cliff C.
    Alharbi, Reem N.
    SENSORS, 2024, 24 (11)
  • [4] Data set and machine learning models for the classification of network traffic originators
    Canavese, Daniele
    Regano, Leonardo
    Basile, Cataldo
    Ciravegna, Gabriele
    Lioy, Antonio
    DATA IN BRIEF, 2022, 41
  • [5] A traffic data collection and analysis method based on wireless sensor network
    Wang, Huan
    Ouyang, Min
    Meng, Qingyuan
    Kong, Qian
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2020, 2020 (01)
  • [6] The Capacity of the Road Network: Data Collection and Statistical Analysis of Traffic Characteristics
    Shepelev, Vladimir
    Aliukov, Sergei
    Nikolskaya, Kseniya
    Shabiev, Salavat
    ENERGIES, 2020, 13 (07)
  • [7] A traffic data collection and analysis method based on wireless sensor network
    Huan Wang
    Min Ouyang
    Qingyuan Meng
    Qian Kong
    EURASIP Journal on Wireless Communications and Networking, 2020
  • [8] In Vitro Data Collection Using Image Analysis and Machine Learning
    Niedz, R. P.
    IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-ANIMAL, 2020, 56 (01) : S17 - S17
  • [9] Optimizing Data Collection for Machine Learning
    Mahmood, Rafid
    Lucas, James
    Alvarez, Jose M.
    Fidler, Sanja
    Law, Marc T.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] Classifier Selection for an Ensemble of Network Traffic Analysis Machine Learning Models
    Roponena, Evita
    Polaka, Inese
    2022 63RD INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2022,