Comprehensive Analysis of Network Traffic Data

被引:6
|
作者
Miao, Yuantian [1 ]
Ruan, Zichan [1 ]
Pan, Lei [1 ]
Zhang, Jun [1 ]
Xiang, Yang [1 ]
Wang, Yu [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia
基金
中国国家自然科学基金;
关键词
Network Traffic Classification; Machine Learning; Data Analytics; Performance; FEATURE-SELECTION; ALGORITHMS; PCA;
D O I
10.1109/CIT.2016.22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the large volume of network traffic flow, it is necessary to preprocess raw data before classification to gain the accurate results speedily. Feature selection is an essential approach in preprocessing phase. The Principal Component Analysis (PCA) is recognized as an effective and efficient method. In this paper, we classify network traffic by using the PCA technique together with six machine learning algorithms Naive Bayes, Decision Tree, 1-Nearest Neighbor (NN), Random Forest, Support Vector Machine (SVM) and H2O. We analyze the impact of PCA through classifying the data set by each algorithm with and without PCA. Experiments are set out by varying the size of input data sets, and the performances are measured from two metrics including overall accuracy and F-measure. The computational time is also considered in analysis phase. Our results show that Random Forest and NN are the top two algorithms among the six. Specifically, both of them behave well in classification under the most cases of input sets regardless of applying PCA. Lastly, PCA significantly boosts NN algorithms in terms of classification accuracy and shortens the classification time for Random Forest.
引用
收藏
页码:423 / 430
页数:8
相关论文
共 50 条
  • [1] Comprehensive analysis of network traffic data
    Miao, Yuantian
    Ruan, Zichan
    Pan, Lei
    Zhang, Jun
    Xiang, Yang
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (05):
  • [2] NetCube: a comprehensive network traffic analysis model based on multidimensional OLAP data cube
    Park, Daihee
    Yu, Jaehak
    Park, Jun-Sang
    Kim, Myung-Sup
    [J]. INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2013, 23 (02) : 101 - 118
  • [3] Complex-network-based traffic network analysis and dynamics: A comprehensive review
    Zhang, Mengyao
    Huang, Tao
    Guo, Zhaoxia
    He, Zhenggang
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 607
  • [4] Network Traffic Analysis of Cloud Data Centre
    Sankari, Subbiah
    Varalakshmi, Perumal
    Divya, Boopathi
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATIONS TECHNOLOGIES (ICCCT 15), 2015, : 408 - 413
  • [5] ANALYSIS OF A TDMA NETWORK WITH VOICE AND DATA TRAFFIC
    HONIG, ML
    [J]. AT&T BELL LABORATORIES TECHNICAL JOURNAL, 1984, 63 (08): : 1537 - 1563
  • [6] Analysis of the data of network traffic by dataware technique
    Tang, Hong
    Wu, Yongjun
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2003, 31 (11):
  • [7] Network Traffic Data Analysis Based on DGX
    Zou, Dan
    Liu, Jun
    Yan, Qing
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1199 - 1203
  • [8] Data Traffic Analysis of Utility Smart Metering Network
    Luan, Wenpeng
    Sharp, Duncan
    LaRoy, Stephen
    [J]. 2013 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PES), 2013,
  • [9] Infinitely Divisible Cascade analysis of network traffic data
    Veitch, D
    Abry, P
    Flandrin, P
    Chainais, P
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 245 - 248
  • [10] A Survey on Big Data for Network Traffic Monitoring and Analysis
    D'Alconzo, Alessandro
    Drago, Idilio
    Morichetta, Andrea
    Mellia, Marco
    Casas, Pedro
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (03): : 800 - 813