The art of time-bending: Data augmentation and early prediction for efficient traffic classification

被引:0
|
作者
Hajaj, Chen [1 ,4 ]
Aharon, Porat [2 ,5 ]
Dubin, Ran [2 ,5 ]
Dvir, Amit [3 ,5 ]
机构
[1] Dept Ind Engn & Management, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[2] Dept Comp Sci, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[3] Dept Comp & Software Engn, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[4] Data Sci & Artificial Intelligence Res Ctr, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[5] Ariel Cyber Innovat Ctr, 3 Kiryat Hamada, IL-40700 Ariel, Israel
关键词
Internet traffic classification; Data augmentation; Long Short -Term Memory (LSTM) networks; FEATURE-SELECTION; SEQUENCE;
D O I
10.1016/j.eswa.2024.124166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy's practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Transfer Learning Based Efficient Traffic Prediction with Limited Training Data
    Saha, Sajal
    Haque, Anwar
    Sidebottom, Greg
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [32] Data-Efficient Communication Traffic Prediction With Deep Transfer Learning
    Li, Hang
    Wang, Ju
    Chen, Xi
    Liu, Xue
    Dudek, Gregory
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3190 - 3195
  • [33] Big Data Processing for Prediction of Traffic Time based on Vertical Data Arrangement
    Jeon, Seungwoo
    Hong, Bonghee
    Kim, Byungsoo
    2014 IEEE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2014, : 326 - 333
  • [34] Efficient Data Augmentation Techniques for Improved Classification in Limited Data Set of Oral Squamous Cell Carcinoma
    Alosaimi, Wael
    Uddin, M. Irfan
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2022, 131 (03): : 1387 - 1401
  • [35] The Impact of Data Augmentation on Time Series Classification Models: An In-Depth Study with Biomedical Data
    De, Bikram
    Sakevych, Mykhailo
    Metsis, Vangelis
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 192 - 203
  • [36] Gaussian Processes Based Data Augmentation and Expected Signature for Time Series Classification
    Triggiano, Francesco
    Romito, Marco
    IEEE ACCESS, 2024, 12 : 80884 - 80895
  • [37] SFCC: Data Augmentation with Stratified Fourier Coefficients Combination for Time Series Classification
    Wenbo Yang
    Jidong Yuan
    Xiaokang Wang
    Neural Processing Letters, 2023, 55 : 1833 - 1846
  • [38] MMDL-Based Data Augmentation with Domain Knowledge for Time Series Classification
    Li, Xiaosheng
    Wu, Yifan
    Jiang, Wei
    Li, Ying
    Li, Jianguo
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT III, ECML PKDD 2024, 2024, 14943 : 403 - 420
  • [39] SFCC: Data Augmentation with Stratified Fourier Coefficients Combination for Time Series Classification
    Yang, Wenbo
    Yuan, Jidong
    Wang, Xiaokang
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1833 - 1846
  • [40] Classification and prediction of traffic flow based on real data using neural networks
    Pamula, T. (teresa.pamula@polsl.pl), 1600, De Gruyter Open Ltd (24):