Feature selection for optimizing traffic classification

被引:95
|
作者
Zhang, Hongli [1 ]
Lu, Gang [1 ]
Qassrawi, Mahmoud T. [1 ]
Zhang, Yu [1 ]
Yu, Xiangzhan [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Traffic classification; Class imbalance; Robust features; IDENTIFICATION;
D O I
10.1016/j.comcom.2012.04.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) algorithms have been widely applied in recent traffic classification. However, due to the imbalance in the number of traffic flows, ML based classifiers are prone to misclassify flows as the traffic type that occupies the majority of flows on the Internet. To address the problem, a novel feature selection metric named Weighted Symmetrical Uncertainty (WSU) is proposed. We design a hybrid feature selection algorithm named WSU_AUC, which prefilters most of features with WSU metric and further uses a wrapper method to select features for a specific classifier with Area Under roc Curve (AUC) metric. Additionally, to overcome the impacts of dynamic traffic flows on feature selection, we propose an algorithm named SRSF that Selects the Robust and Stable Features from the results achieved by WSU_AUC. We evaluate our approaches using three classifiers on the traces captured from entirely different networks. Experimental results obtained by our algorithms are promising in terms of true positive rate (TPR) and false positive rate (FPR). Moreover, our algorithms can achieve >94% flow accuracy and >80% byte accuracy on average. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1457 / 1471
页数:15
相关论文
共 50 条
  • [1] Optimizing Feature Selection for Efficient Encrypted Traffic Classification: A Systematic Approach
    Shen, Meng
    Liu, Yiting
    Zhu, Liehuang
    Xu, Ke
    Du, Xiaojiang
    Guizani, Nadra
    [J]. IEEE NETWORK, 2020, 34 (04): : 20 - 27
  • [2] A stable feature selection approach for optimizing traffic classification based on adaptive threshold
    Duan, Wenbei
    Wang, Yuanli
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS, NETWORK AND COMPUTER ENGINEERING (ICENCE 2016), 2016, 67 : 827 - 832
  • [3] Optimizing feature selection techniques for sentiment classification
    Uribe, Diego
    [J]. 2011 IEEE ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE (CERMA 2011), 2011, : 103 - 107
  • [4] Evaluation of feature selection on network traffic classification
    Wang, Yun
    Wang, Pan
    Wang, ZiXuan
    Wu, KaiLin
    [J]. 2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 813 - 818
  • [5] Optimizing IP flow classification using feature selection
    Lei, Dai
    You, Chen
    Yun Xiaochun
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2007, : 39 - +
  • [6] Feature Selection Toward Optimizing Internet Traffic Behavior Identification
    Chen, Zhenxiang
    Peng, Lizhi
    Zhao, Shupeng
    Zhang, Lei
    Jing, Shan
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT II, 2014, 8631 : 631 - 644
  • [7] Efficient and robust feature extraction and selection for traffic classification
    Shi, Hongtao
    Li, Hongping
    Zhang, Dan
    Cheng, Chaqiu
    Wu, Wei
    [J]. COMPUTER NETWORKS, 2017, 119 : 1 - 16
  • [8] A Survey on Feature Selection Techniques for Internet Traffic Classification
    Dhote, Yogesh
    Agrawal, Shikha
    Deen, Anjana Jayant
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 1375 - 1380
  • [9] Balanced feature selection method for Internet traffic classification
    Liu, Z.
    Liu, Q.
    [J]. IET NETWORKS, 2012, 1 (02) : 74 - 83
  • [10] Real-time feature selection in traffic classification
    ZHAO, Jing-jing
    HUANG, Xiao-hong
    SUN, Qiong
    MA, Yan
    [J]. Journal of China Universities of Posts and Telecommunications, 2008, 15 (SUPPL.): : 68 - 72