An SVM-based machine learning method for accurate internet traffic classification

被引:177
|
作者
Yuan, Ruixi [3 ]
Li, Zhu [3 ]
Guan, Xiaohong [1 ,2 ,3 ]
Xu, Li [4 ,5 ]
机构
[1] Xi An Jiao Tong Univ, MOE KLINNS Lab, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, SKLMS Lab, Xian 710049, Peoples R China
[3] Tsinghua Univ, Ctr Intelligent & Networked Syst, TNLIST Lab, Beijing 100084, Peoples R China
[4] Beijing Jiaotong Univ, Coll Econ & Management, Beijing 100044, Peoples R China
[5] Old Dominion Univ, Dept Informat Technol & Decis Sci, Norfolk, VA 23529 USA
关键词
Internet traffic; Network traffic classification; Machine learning; Feature selection; SVM; SUPPORT VECTOR MACHINES; FEATURE-SELECTION; SPECIAL-ISSUE; SYSTEM; CHINA;
D O I
10.1007/s10796-008-9131-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate and timely traffic classification is critical in network security monitoring and traffic engineering. Traditional methods based on port numbers and protocols have proven to be ineffective in terms of dynamic port allocation and packet encapsulation. The signature matching methods, on the other hand, require a known signature set and processing of packet payload, can only handle the signatures of a limited number of IP packets in real-time. A machine learning method based on SVM (supporting vector machine) is proposed in this paper for accurate Internet traffic classification. The method classifies the Internet traffic into broad application categories according to the network flow parameters obtained from the packet headers. An optimized feature set is obtained via multiple classifier selection methods. Experimental results using traffic from campus backbone show that an accuracy of 99.42% is achieved with the regular biased training and testing samples. An accuracy of 97.17% is achieved when un-biased training and testing samples are used with the same feature set. Furthermore, as all the feature parameters are computable from the packet headers, the proposed method is also applicable to encrypted network traffic.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 50 条
  • [41] SVM-based Colour Classification of Dyeing Products
    Zhang, Jian-Xin
    Chang, Wei
    Wu, Lang
    TEXTILE BIOENGINEERING AND INFORMATICS SYMPOSIUM PROCEEDINGS, VOLS 1-3, 2011, : 1310 - 1314
  • [42] A Survey of Techniques for Internet Traffic Classification using Machine Learning
    Nguyen, Thuy T. T.
    Armitage, Grenville
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2008, 10 (04): : 56 - 76
  • [43] Associative classification using SVM-based discretization
    Park, Cheong Hee
    Lee, Moonhwi
    CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 171 - 175
  • [44] Scalable SVM-based Classification in Dynamic Graphs
    Yao, Yibo
    Holder, Lawrence
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 650 - 659
  • [45] Hardware Trojan Detection Combine with Machine Learning: an SVM-based Detection Approach
    Hu, Taifeng
    Wu, Liji
    Zhang, Xiangmin
    Yin, Yanzhao
    Yang, Yijun
    PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 202 - 206
  • [46] A multilingual SVM-based question classification system
    Bisbal, E
    Tomás, D
    Moreno, L
    Vicedo, JL
    Suárez, A
    MICAI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3789 : 806 - 815
  • [47] An application of SVM-based Classification in Landslide Stability
    Jiang, Tingyao
    Lei, Peng
    Qin, Qin
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2016, 22 (02): : 267 - 271
  • [48] Combined SVM-Based Feature Selection and Classification
    Julia Neumann
    Christoph Schnörr
    Gabriele Steidl
    Machine Learning, 2005, 61 : 129 - 150
  • [49] SVM-Based Classification of Digital Modulation Signals
    Tabatabaei, Talieh S.
    Krishnan, Sridhar
    Anpalagan, Alagan
    2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [50] SVM-based Reliability Analysis Method
    Li Wei
    Yu Xiaolin
    PROCEEDING OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES, 2009, : 584 - 588