A machine learning approach for feature selection traffic classification using security analysis

被引:71
|
作者
Shafiq, Muhammad [1 ]
Yu, Xiangzhan [1 ]
Bashir, Ali Kashif [2 ]
Chaudhry, Hassan Nazeer [3 ]
Wang, Dawei [4 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Univ Faroe Isl, Fac Sci & Technol, Torshavn, Faroe Islands, Denmark
[3] Politecn Milan, Dept Elect Informat & Bioengn, Milan, Italy
[4] Coordinat Ctr, Natl Comp Network Emergency Response Tech Team, Beijing, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2018年 / 74卷 / 10期
基金
中国国家自然科学基金;
关键词
Network traffic classification; Class imbalance; Feature selection; Machine learning; Security;
D O I
10.1007/s11227-018-2263-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance has become a big problem that leads to inaccurate traffic classification. Accurate traffic classification of traffic flows helps us in security monitoring, IP management, intrusion detection, etc. To address the traffic classification problem, in literature, machine learning (ML) approaches are widely used. Therefore, in this paper, we also proposed an ML-based hybrid feature selection algorithm named WMI_AUC that make use of two metrics: weighted mutual information (WMI) metric and area under ROC curve (AUC). These metrics select effective features from a traffic flow. However, in order to select robust features from the selected features, we proposed robust features selection algorithm. The proposed approach increases the accuracy of ML classifiers and helps in detecting malicious traffic. We evaluate our work using 11 well-known ML classifiers on the different network environment traces datasets. Experimental results showed that our algorithms achieve more than 95% flow accuracy results.
引用
收藏
页码:4867 / 4892
页数:26
相关论文
共 50 条
  • [21] Feature selection and classification in breast cancer prediction using IoT and machine learning
    Gopal, V. Nanda
    Al-Turjman, Fadi
    Kumar, R.
    Anand, L.
    Rajesh, M.
    [J]. MEASUREMENT, 2021, 178
  • [22] Android malware classification using optimum feature selection and ensemble machine learning
    Islam, Rejwana
    Sayed, Moinul Islam
    Saha, Sajal
    Hossain, Mohammad Jamal
    Masud, Md Abdul
    [J]. Internet of Things and Cyber-Physical Systems, 2023, 3 : 100 - 111
  • [23] Sentiment Classification of Spanish Reviews: An Approach based on Feature Selection and Machine Learning Methods
    del Pilar Salas-Zarate, Maria
    Andres Paredes-Valverde, Mario
    Limon-Romero, Jorge
    Tlapa, Diego
    Baez-Lopez, Yolanda
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2016, 22 (05) : 691 - 708
  • [24] A Systematic Approach of Feature Selection for Encrypted Network Traffic Classification
    McGaughey, Donald
    Semeniuk, Trevor
    Smith, Ron
    Knight, Scott
    [J]. 12TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON2018), 2018, : 618 - 625
  • [25] Improved Feature Selection and Stream Traffic Classification Based on Machine Learning in Software-Defined Networks
    Eldhai, Arwa M.
    Hamdan, Mosab
    Abdelaziz, Ahmed
    Hashem, Ibrahim Abaker Targio
    Babiker, Sharief F.
    Marsono, M. N.
    Hamzah, Muzaffar
    Jhanjhi, Noor Zaman
    [J]. IEEE ACCESS, 2024, 12 : 34141 - 34159
  • [26] Internet traffic classification using machine learning
    Jun, Li
    Shunyi, Zhang
    Yanqing, Lu
    Zailong, Zhang
    [J]. 2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 68 - 72
  • [27] Feature Selection using an SVM learning machine
    El Ferchichi, Sabra
    Laabedi, Kaouther
    Zidi, Salah
    Maouche, Salah
    [J]. 2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 485 - +
  • [28] Sentiment Analysis using Feature Generation And Machine Learning Approach
    Srivastava, Roopam
    Bharti, P. K.
    Verma, Parul
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 86 - 91
  • [29] A Study on the Effect of Feature Selection on Malware Analysis using Machine Learning
    Babaagba, Kehinde Oluwatoyin
    Adesanya, Samuel Olumide
    [J]. PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2019), 2019, : 51 - 55
  • [30] On the fly classification of traffic in Anonymous Communication Networks using a Machine Learning approach
    Hurali, Lalitha Chinmayee M.
    Patil, Annapurna P.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (IEEE ANTS), 2020,