An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification

被引:92
|
作者
Shi, Hongtao [1 ,2 ]
Li, Hongping [2 ]
Zhang, Dan [2 ]
Cheng, Chaqiu [2 ]
Cao, Xuanxuan [2 ]
机构
[1] Qingdao Agr Univ, Network Management Ctr, Qingdao 266109, Peoples R China
[2] Ocean Univ China, Coll Informat Sci & Engn, Qingdao 266100, Peoples R China
关键词
Feature selection; Deep learning; Multi-class imbalance; Concept drift; Machine learning; Traffic classification; ALGORITHMS; IMBALANCE;
D O I
10.1016/j.comnet.2018.01.007
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Substantial recent efforts have been made on the application of Machine Learning (ML) techniques to flow statistical features for traffic classification. However, the classification performance of ML techniques is severely degraded due to the high dimensionality and redundancy of flow statistical features, the imbalance in the number of traffic flows and concept drift of Internet traffic. With the aim of comprehensively solving these problems, this paper proposes a new feature optimization approach based on deep learning and Feature Selection (FS) techniques to provide the optimal and robust features for traffic classification. Firstly, symmetric uncertainty is exploited to remove the irrelevant features in network traffic data sets, then a feature generation model based on deep learning is applied to these relevant features for dimensionality reduction and feature generation, finally Weighted Symmetric Uncertainty (WSU) is exploited to select the optimal features by removing the redundant ones. Based on real traffic traces, experimental results show that the proposed approach can not only efficiently reduce the dimension of feature space, but also overcome the negative impacts of multi-class imbalance and concept drift problems on ML techniques. Furthermore, compared with the approaches used in the previous works, the proposed approach achieves the best classification performance and relatively higher runtime performance. (C) 2018 Elsevier BN. All rights reserved.
引用
收藏
页码:81 / 98
页数:18
相关论文
共 50 条
  • [1] Optimizing Feature Selection for Efficient Encrypted Traffic Classification: A Systematic Approach
    Shen, Meng
    Liu, Yiting
    Zhu, Liehuang
    Xu, Ke
    Du, Xiaojiang
    Guizani, Nadra
    IEEE NETWORK, 2020, 34 (04): : 20 - 27
  • [2] Toward an efficient and scalable feature selection approach for internet traffic classification
    Fahad, Adil
    Tari, Zahir
    Khalil, Ibrahim
    Habib, Ibrahim
    Alnuweiri, Hussein
    COMPUTER NETWORKS, 2013, 57 (09) : 2040 - 2057
  • [3] Traffic Signs Recognition and Classification based on Deep Feature Learning
    Lai, Yan
    Wang, Nanxin
    Yang, Yusi
    Lin, Lan
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 622 - 629
  • [4] A Survey on Feature Selection Techniques for Internet Traffic Classification
    Dhote, Yogesh
    Agrawal, Shikha
    Deen, Anjana Jayant
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 1375 - 1380
  • [5] Efficient and robust feature extraction and selection for traffic classification
    Shi, Hongtao
    Li, Hongping
    Zhang, Dan
    Cheng, Chaqiu
    Wu, Wei
    COMPUTER NETWORKS, 2017, 119 : 1 - 16
  • [6] A machine learning approach for feature selection traffic classification using security analysis
    Shafiq, Muhammad
    Yu, Xiangzhan
    Bashir, Ali Kashif
    Chaudhry, Hassan Nazeer
    Wang, Dawei
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (10): : 4867 - 4892
  • [7] A machine learning approach for feature selection traffic classification using security analysis
    Muhammad Shafiq
    Xiangzhan Yu
    Ali Kashif Bashir
    Hassan Nazeer Chaudhry
    Dawei Wang
    The Journal of Supercomputing, 2018, 74 : 4867 - 4892
  • [8] Credit scoring with a feature selection approach based deep learning
    Van-Sang Ha
    Ha-Nam Nguyen
    2016 7TH INTERNATIONAL CONFERENCE ON MECHANICAL, INDUSTRIAL, AND MANUFACTURING TECHNOLOGIES (MIMT 2016), 2016, 54
  • [9] An efficient botnet detection approach based on feature learning and classification
    Padmavathi, B.
    Muthukumar, B.
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 40 - 53
  • [10] Deep Learning Based Load Forecasting with Decomposition and Feature Selection Techniques
    Subbiah, Siva Sankari
    Kumar, Senthil P.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2022, 81 (05): : 505 - 517