The art of time-bending: Data augmentation and early prediction for efficient traffic classification

被引:0
|
作者
Hajaj, Chen [1 ,4 ]
Aharon, Porat [2 ,5 ]
Dubin, Ran [2 ,5 ]
Dvir, Amit [3 ,5 ]
机构
[1] Dept Ind Engn & Management, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[2] Dept Comp Sci, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[3] Dept Comp & Software Engn, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[4] Data Sci & Artificial Intelligence Res Ctr, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[5] Ariel Cyber Innovat Ctr, 3 Kiryat Hamada, IL-40700 Ariel, Israel
关键词
Internet traffic classification; Data augmentation; Long Short -Term Memory (LSTM) networks; FEATURE-SELECTION; SEQUENCE;
D O I
10.1016/j.eswa.2024.124166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy's practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Traffic congestion prediction and missing data: a classification approach using weather information
    Mystakidis, Aristeidis
    Tjortjis, Christos
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [42] Implementing WEKA for medical data classification and early disease prediction
    Kumar, Narander
    Khatri, Sabita
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [43] An efficient GS-RBFN framework for early prediction and classification of ad
    Haulath K.
    Mohamed Basheer K.P.
    Multimedia Tools and Applications, 2025, 84 (11) : 8593 - 8621
  • [44] Interpolation and Prediction of Piezometric Multivariate Time Series Based on Data Augmentation and Transformers
    Rabah, Mohamed Louay
    Mellouli, Nedra
    Farah, Imed Riadh
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 327 - 344
  • [45] Data Augmentation for Short-Term Time Series Prediction with Deep Learning
    Flores, Anibal
    Tito-Chura, Hugo
    Apaza-Alanoca, Honorio
    INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 492 - 506
  • [46] A travel time prediction method for urban road traffic sensors data
    Zhu, Guangyu
    Song, Kang
    Zhang, Peng
    2015 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION, AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2015, : 29 - 32
  • [47] Exploring the Value of Traffic Flow Data in Bus Travel Time Prediction
    Mazloumi, Ehsan
    Moridpour, Sara
    Currie, Graham
    Rose, Geoff
    JOURNAL OF TRANSPORTATION ENGINEERING, 2012, 138 (04) : 436 - 446
  • [48] Traffic Estimation And Prediction Based On Real Time Floating Car Data
    de Fabritiis, Corrado
    Ragona, Roberto
    Valenti, Gaetano
    PROCEEDINGS OF THE 11TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2008, : 197 - +
  • [49] An Efficient Data-Driven Traffic Prediction Framework for Network Digital Twin
    Nan, Haihan
    Li, Ruidong
    Zhu, Xiaoyan
    Ma, Jianfeng
    Niyato, Dusit
    IEEE NETWORK, 2024, 38 (01): : 22 - 29
  • [50] Prediction-time Efficient Classification Using Feature Computational Dependencies
    Zhao, Liang
    Alipour-Fanid, Amir
    Slawski, Martin
    Zeng, Kai
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2787 - 2796