An SVM-based machine learning method for accurate internet traffic classification

被引:177
|
作者
Yuan, Ruixi [3 ]
Li, Zhu [3 ]
Guan, Xiaohong [1 ,2 ,3 ]
Xu, Li [4 ,5 ]
机构
[1] Xi An Jiao Tong Univ, MOE KLINNS Lab, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, SKLMS Lab, Xian 710049, Peoples R China
[3] Tsinghua Univ, Ctr Intelligent & Networked Syst, TNLIST Lab, Beijing 100084, Peoples R China
[4] Beijing Jiaotong Univ, Coll Econ & Management, Beijing 100044, Peoples R China
[5] Old Dominion Univ, Dept Informat Technol & Decis Sci, Norfolk, VA 23529 USA
关键词
Internet traffic; Network traffic classification; Machine learning; Feature selection; SVM; SUPPORT VECTOR MACHINES; FEATURE-SELECTION; SPECIAL-ISSUE; SYSTEM; CHINA;
D O I
10.1007/s10796-008-9131-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate and timely traffic classification is critical in network security monitoring and traffic engineering. Traditional methods based on port numbers and protocols have proven to be ineffective in terms of dynamic port allocation and packet encapsulation. The signature matching methods, on the other hand, require a known signature set and processing of packet payload, can only handle the signatures of a limited number of IP packets in real-time. A machine learning method based on SVM (supporting vector machine) is proposed in this paper for accurate Internet traffic classification. The method classifies the Internet traffic into broad application categories according to the network flow parameters obtained from the packet headers. An optimized feature set is obtained via multiple classifier selection methods. Experimental results using traffic from campus backbone show that an accuracy of 99.42% is achieved with the regular biased training and testing samples. An accuracy of 97.17% is achieved when un-biased training and testing samples are used with the same feature set. Furthermore, as all the feature parameters are computable from the packet headers, the proposed method is also applicable to encrypted network traffic.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 50 条
  • [21] Using GMM and SVM-based Techniques for the Classification of SSH-Encrypted Traffic
    Dusi, Maurizio
    Este, Alice
    Gringoli, Francesco
    Salgarelli, Luca
    2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 702 - 707
  • [22] SVM-Based Classification of Moving Objects
    Li, Zhanchuang
    Jiang, Jianmin
    Xiao, Guoqiang
    MULTIMEDIA, COMPUTER GRAPHICS AND BROADCASTING, PROCEEDINGS, 2009, 60 : 37 - 45
  • [23] An SVM-based machine learning method for the separation of alteration zones in Sungun porphyry copper deposit
    Abbaszadeh, Maliheh
    Hezarkhani, Ardeshir
    Soltani-Mohammadi, Saeed
    CHEMIE DER ERDE-GEOCHEMISTRY, 2013, 73 (04) : 545 - 554
  • [24] Water Level Prediction of Rainwater Pipe Network Using an SVM-Based Machine Learning Method
    Wang, Hao
    Song, Lixiang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (02)
  • [25] SVM-based audio scene classification
    Jiang, HC
    Bai, JM
    Zhang, SW
    Xu, B
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 131 - 136
  • [26] Research of SVM-based document classification
    Zhang, ZhenNan
    Xu, Qian
    Cui, Junbo
    Pu, Duan
    Information, Management and Algorithms, Vol II, 2007, : 260 - 263
  • [27] SVM-based analysis and prediction on network traffic
    Luo, Weidong
    Liu, Xingwei
    Zhang, Jian
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
  • [28] SVM-Based Models for Predicting WLAN Traffic
    Feng, Huifang
    Shu, Yantai
    Wang, Shuyi
    Ma, Maode
    2006 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-12, 2006, : 597 - 602
  • [29] Enhanced Android Malware Detection: An SVM-based Machine Learning Approach
    Han, Hyoil
    Lim, SeungJin
    Suh, Kyoungwon
    Park, Seonghyun
    Cho, Seong-je
    Park, Minkyu
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 75 - 81
  • [30] EHPred: an SVM-based method for epoxide hydrolases recognition and classification.
    Jia J.
    Yang L.
    Zhang Z.Z.
    Journal of Zhejiang University SCIENCE B, 2006, 7 (1): : 1 - 6