A machine learning approach for feature selection traffic classification using security analysis

被引:2
|
作者
Muhammad Shafiq
Xiangzhan Yu
Ali Kashif Bashir
Hassan Nazeer Chaudhry
Dawei Wang
机构
[1] Harbin Institute of Technology,School of Computer Science and Technology
[2] University of Faroe Islands,Faculty of Science and Technology
[3] Politecnico di Milano,Department of Electronics, Information and Bioengineering
[4] National Computer Network Emergency Response Technical Team/Coordination Center,undefined
来源
关键词
Network traffic classification; Class imbalance; Feature selection; Machine learning; Security;
D O I
暂无
中图分类号
学科分类号
摘要
Class imbalance has become a big problem that leads to inaccurate traffic classification. Accurate traffic classification of traffic flows helps us in security monitoring, IP management, intrusion detection, etc. To address the traffic classification problem, in literature, machine learning (ML) approaches are widely used. Therefore, in this paper, we also proposed an ML-based hybrid feature selection algorithm named WMI_AUC that make use of two metrics: weighted mutual information (WMI) metric and area under ROC curve (AUC). These metrics select effective features from a traffic flow. However, in order to select robust features from the selected features, we proposed robust features selection algorithm. The proposed approach increases the accuracy of ML classifiers and helps in detecting malicious traffic. We evaluate our work using 11 well-known ML classifiers on the different network environment traces datasets. Experimental results showed that our algorithms achieve more than 95% flow accuracy results.
引用
收藏
页码:4867 / 4892
页数:25
相关论文
共 50 条
  • [21] Android malware classification using optimum feature selection and ensemble machine learning
    Islam, Rejwana
    Sayed, Moinul Islam
    Saha, Sajal
    Hossain, Mohammad Jamal
    Masud, Md Abdul
    [J]. Internet of Things and Cyber-Physical Systems, 2023, 3 : 100 - 111
  • [22] Sentiment Classification of Spanish Reviews: An Approach based on Feature Selection and Machine Learning Methods
    del Pilar Salas-Zarate, Maria
    Andres Paredes-Valverde, Mario
    Limon-Romero, Jorge
    Tlapa, Diego
    Baez-Lopez, Yolanda
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2016, 22 (05) : 691 - 708
  • [23] A Systematic Approach of Feature Selection for Encrypted Network Traffic Classification
    McGaughey, Donald
    Semeniuk, Trevor
    Smith, Ron
    Knight, Scott
    [J]. 12TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON2018), 2018, : 618 - 625
  • [24] Improved Feature Selection and Stream Traffic Classification Based on Machine Learning in Software-Defined Networks
    Eldhai, Arwa M.
    Hamdan, Mosab
    Abdelaziz, Ahmed
    Hashem, Ibrahim Abaker Targio
    Babiker, Sharief F.
    Marsono, M. N.
    Hamzah, Muzaffar
    Jhanjhi, Noor Zaman
    [J]. IEEE ACCESS, 2024, 12 : 34141 - 34159
  • [25] Internet traffic classification using machine learning
    Jun, Li
    Shunyi, Zhang
    Yanqing, Lu
    Zailong, Zhang
    [J]. 2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 68 - 72
  • [26] Internet traffic classification using machine learning
    Singh, M.P.
    Srivastava, Gargi
    Kumar, Prabhat
    [J]. International Journal of Database Theory and Application, 2016, 9 (12): : 45 - 54
  • [27] Feature Selection using an SVM learning machine
    El Ferchichi, Sabra
    Laabedi, Kaouther
    Zidi, Salah
    Maouche, Salah
    [J]. 2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 485 - +
  • [28] A Study on the Effect of Feature Selection on Malware Analysis using Machine Learning
    Babaagba, Kehinde Oluwatoyin
    Adesanya, Samuel Olumide
    [J]. PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2019), 2019, : 51 - 55
  • [29] On the fly classification of traffic in Anonymous Communication Networks using a Machine Learning approach
    Hurali, Lalitha Chinmayee M.
    Patil, Annapurna P.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (IEEE ANTS), 2020,
  • [30] Internet Traffic Classification using Machine Learning Approach: Datasets Validation Issues
    Ibrahim, Hamza Awad Hamza
    AL Zuobi, Omer Radhi Aqeel
    Al-Namari, Marwan A.
    MohamedAli, Gaafer
    Abdalla, Ali Ahmed Alfaki
    [J]. 2016 CONFERENCE OF BASIC SCIENCES AND ENGINEERING STUDIES (SCGAC), 2016, : 158 - 166