A machine learning approach for feature selection traffic classification using security analysis

被引:71
|
作者
Shafiq, Muhammad [1 ]
Yu, Xiangzhan [1 ]
Bashir, Ali Kashif [2 ]
Chaudhry, Hassan Nazeer [3 ]
Wang, Dawei [4 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Univ Faroe Isl, Fac Sci & Technol, Torshavn, Faroe Islands, Denmark
[3] Politecn Milan, Dept Elect Informat & Bioengn, Milan, Italy
[4] Coordinat Ctr, Natl Comp Network Emergency Response Tech Team, Beijing, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2018年 / 74卷 / 10期
基金
中国国家自然科学基金;
关键词
Network traffic classification; Class imbalance; Feature selection; Machine learning; Security;
D O I
10.1007/s11227-018-2263-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance has become a big problem that leads to inaccurate traffic classification. Accurate traffic classification of traffic flows helps us in security monitoring, IP management, intrusion detection, etc. To address the traffic classification problem, in literature, machine learning (ML) approaches are widely used. Therefore, in this paper, we also proposed an ML-based hybrid feature selection algorithm named WMI_AUC that make use of two metrics: weighted mutual information (WMI) metric and area under ROC curve (AUC). These metrics select effective features from a traffic flow. However, in order to select robust features from the selected features, we proposed robust features selection algorithm. The proposed approach increases the accuracy of ML classifiers and helps in detecting malicious traffic. We evaluate our work using 11 well-known ML classifiers on the different network environment traces datasets. Experimental results showed that our algorithms achieve more than 95% flow accuracy results.
引用
收藏
页码:4867 / 4892
页数:26
相关论文
共 50 条
  • [1] A machine learning approach for feature selection traffic classification using security analysis
    Muhammad Shafiq
    Xiangzhan Yu
    Ali Kashif Bashir
    Hassan Nazeer Chaudhry
    Dawei Wang
    [J]. The Journal of Supercomputing, 2018, 74 : 4867 - 4892
  • [2] Data Classification Using Feature Selection And kNN Machine Learning Approach
    Begum, Shemim
    Chakraborty, Debasis
    Sarkar, Ram
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 811 - 814
  • [3] Connected Devices Classification using Feature Selection with Machine Learning
    Fagroud, Fatima Zahra
    Toumi, Hicham
    Lahmar, El Habib Ben
    Achtaich, Khadija
    El Filali, Sanaa
    Baddi, Youssef
    [J]. IAENG International Journal of Computer Science, 2022, 49 (02)
  • [4] Feature Selection for Text Classification Using Machine Learning Approaches
    Thirumoorthy, K.
    Muneeswaran, K.
    [J]. NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2022, 45 (01): : 51 - 56
  • [5] Feature Selection for Text Classification Using Machine Learning Approaches
    K. Thirumoorthy
    K. Muneeswaran
    [J]. National Academy Science Letters, 2022, 45 : 51 - 56
  • [6] Robust Feature Selection for IM Applications at Early Stage Traffic Classification Using Machine Learning Algorithms
    Shafiq, Muhammad
    Yu, Xiangzhan
    Wang, Dawei
    [J]. 2017 19TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS (HPCC) / 2017 15TH IEEE INTERNATIONAL CONFERENCE ON SMART CITY (SMARTCITY) / 2017 3RD IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (DSS), 2017, : 239 - 245
  • [7] An Approach for Optimal Feature Selection in Machine Learning using Global Sensitivity Analysis
    Saranya, G.
    Pravin, A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (06) : 647 - 656
  • [8] A Machine Learning Approach to Mass Spectra Classification with Unsupervised Feature Selection
    Ceccarelli, Michele
    d'Acierno, Antonio
    Facchiano, Angelo
    [J]. COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2009, 5488 : 242 - +
  • [9] Brain Neural Data Analysis Using Machine Learning Feature Selection and Classification Methods
    Bozhkov, Lachezar
    Georgieva, Petia
    Trifonov, Roumen
    [J]. ENGINEERING APPLICATIONS OF NEURAL NETWORKS (EANN 2014), 2014, 459 : 123 - 132
  • [10] An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification
    Shi, Hongtao
    Li, Hongping
    Zhang, Dan
    Cheng, Chaqiu
    Cao, Xuanxuan
    [J]. COMPUTER NETWORKS, 2018, 132 : 81 - 98