An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification

被引:92
|
作者
Shi, Hongtao [1 ,2 ]
Li, Hongping [2 ]
Zhang, Dan [2 ]
Cheng, Chaqiu [2 ]
Cao, Xuanxuan [2 ]
机构
[1] Qingdao Agr Univ, Network Management Ctr, Qingdao 266109, Peoples R China
[2] Ocean Univ China, Coll Informat Sci & Engn, Qingdao 266100, Peoples R China
关键词
Feature selection; Deep learning; Multi-class imbalance; Concept drift; Machine learning; Traffic classification; ALGORITHMS; IMBALANCE;
D O I
10.1016/j.comnet.2018.01.007
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Substantial recent efforts have been made on the application of Machine Learning (ML) techniques to flow statistical features for traffic classification. However, the classification performance of ML techniques is severely degraded due to the high dimensionality and redundancy of flow statistical features, the imbalance in the number of traffic flows and concept drift of Internet traffic. With the aim of comprehensively solving these problems, this paper proposes a new feature optimization approach based on deep learning and Feature Selection (FS) techniques to provide the optimal and robust features for traffic classification. Firstly, symmetric uncertainty is exploited to remove the irrelevant features in network traffic data sets, then a feature generation model based on deep learning is applied to these relevant features for dimensionality reduction and feature generation, finally Weighted Symmetric Uncertainty (WSU) is exploited to select the optimal features by removing the redundant ones. Based on real traffic traces, experimental results show that the proposed approach can not only efficiently reduce the dimension of feature space, but also overcome the negative impacts of multi-class imbalance and concept drift problems on ML techniques. Furthermore, compared with the approaches used in the previous works, the proposed approach achieves the best classification performance and relatively higher runtime performance. (C) 2018 Elsevier BN. All rights reserved.
引用
收藏
页码:81 / 98
页数:18
相关论文
共 50 条
  • [31] Network Traffic Feature Engineering Based on Deep Learning
    Wang, Kai
    Chen, Liyun
    Wang, Shuai
    Wang, Zengguang
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [32] Leveraging Association Rules in Feature Selection for Deep Learning Classification
    Kharsa R.
    Al Aghbari Z.
    SN Computer Science, 5 (1)
  • [33] Genre Classification using Feature Extraction and Deep Learning Techniques
    Kumar, Akshi
    Rajpal, Arjun
    Rathore, Dushyant
    PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2018, : 175 - 180
  • [34] An efficient adaptive feature selection with deep learning model-based paddy plant leaf disease classification
    Dubey, Ratnesh Kumar
    Choubey, Dilip Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 22639 - 22661
  • [35] An efficient adaptive feature selection with deep learning model-based paddy plant leaf disease classification
    Ratnesh Kumar Dubey
    Dilip Kumar Choubey
    Multimedia Tools and Applications, 2024, 83 : 22639 - 22661
  • [36] Imbalanced Network Traffic Classification based on Ensemble Feature Selection
    Ding, Yaojun
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2016,
  • [37] A Particle Swarm Optimization based Feature Selection Approach to Transfer Learning in Classification
    Nguyen, Bach Hoai
    Xue, Bing
    Andreae, Peter
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 37 - 44
  • [38] Rice diseases classification using feature selection and rule generation techniques
    Phadikar, Santanu
    Sil, Jaya
    Das, Asit Kumar
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2013, 90 : 76 - 85
  • [39] Feature Selection for Efficient Gender Classification
    Nazir, M.
    Ishtiaq, Muhammad
    Batool, Anab
    Jaffar, M. Arfan
    Mirza, Anwar M.
    RECENT ADVANCES IN NEURAL NETWORKS, FUZZY SYSTEMS & EVOLUTIONARY COMPUTING, 2010, : 70 - 75
  • [40] A fine-tuning deep learning with multi-objective-based feature selection approach for the classification of text
    Dhal, Pradip
    Azad, Chandrashekhar
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (07): : 3525 - 3553