Feature selection for optimizing traffic classification

被引:95
|
作者
Zhang, Hongli [1 ]
Lu, Gang [1 ]
Qassrawi, Mahmoud T. [1 ]
Zhang, Yu [1 ]
Yu, Xiangzhan [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Traffic classification; Class imbalance; Robust features; IDENTIFICATION;
D O I
10.1016/j.comcom.2012.04.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) algorithms have been widely applied in recent traffic classification. However, due to the imbalance in the number of traffic flows, ML based classifiers are prone to misclassify flows as the traffic type that occupies the majority of flows on the Internet. To address the problem, a novel feature selection metric named Weighted Symmetrical Uncertainty (WSU) is proposed. We design a hybrid feature selection algorithm named WSU_AUC, which prefilters most of features with WSU metric and further uses a wrapper method to select features for a specific classifier with Area Under roc Curve (AUC) metric. Additionally, to overcome the impacts of dynamic traffic flows on feature selection, we propose an algorithm named SRSF that Selects the Robust and Stable Features from the results achieved by WSU_AUC. We evaluate our approaches using three classifiers on the traces captured from entirely different networks. Experimental results obtained by our algorithms are promising in terms of true positive rate (TPR) and false positive rate (FPR). Moreover, our algorithms can achieve >94% flow accuracy and >80% byte accuracy on average. (c) 2012 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:1457 / 1471
页数:15
相关论文
共 50 条
  • [21] An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification
    Shi, Hongtao
    Li, Hongping
    Zhang, Dan
    Cheng, Chaqiu
    Cao, Xuanxuan
    COMPUTER NETWORKS, 2018, 132 : 81 - 98
  • [22] Optimizing feature selection across a multimodality database in computerized classification of breast lesions
    Horsch, K
    Ceballos, AF
    Giger, IL
    Bonta, I
    Huo, ZM
    Vyborny, CJ
    Hendrick, E
    Lan, L
    MEDICAL IMAGING 2002: IMAGE PROCESSING, VOL 1-3, 2002, 4684 : 986 - 992
  • [23] Optimizing Feature Selection for Solar Park Classification: Approaches with OBIA and Machine Learning
    Ladisa, Claudio
    Capolupo, Alessandra
    Tarantino, Eufemia
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT V, 2024, 14819 : 286 - 301
  • [24] Optimizing text classification through efficient feature selection based on quality metric
    Jean-Charles Lamirel
    Pascal Cuxac
    Aneesh Sreevallabh Chivukula
    Kafil Hajlaoui
    Journal of Intelligent Information Systems, 2015, 45 : 379 - 396
  • [25] Optimizing text classification through efficient feature selection based on quality metric
    Lamirel, Jean-Charles
    Cuxac, Pascal
    Chivukula, Aneesh Sreevallabh
    Hajlaoui, Kafil
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (03) : 379 - 396
  • [26] Feature selection for classification
    Department of Information Systems and Computer Science, National University of Singapore, Singapore 119260, Singapore
    Intell. Data Anal., 3 (131-156):
  • [27] A machine learning approach for feature selection traffic classification using security analysis
    Shafiq, Muhammad
    Yu, Xiangzhan
    Bashir, Ali Kashif
    Chaudhry, Hassan Nazeer
    Wang, Dawei
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (10): : 4867 - 4892
  • [28] Explainable artificial intelligence for feature selection in network traffic classification: A comparative study
    Khani, Pouya
    Moeinaddini, Elham
    Abnavi, Narges Dehghan
    Shahraki, Amin
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (04)
  • [29] Effective Feature Selection for 5G IM Applications Traffic Classification
    Shafiq, Muhammad
    Yu, Xiangzhan
    Laghari, Asif Ali
    Wang, Dawei
    MOBILE INFORMATION SYSTEMS, 2017, 2017
  • [30] Mobile app traffic flow feature extraction and selection for improving classification robustness
    Liu, Zhen
    Wang, Ruoyu
    Japkowicz, Nathalie
    Cai, Yongming
    Tang, Deyu
    Cai, Xianfa
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2019, 125 : 190 - 208