The art of time-bending: Data augmentation and early prediction for efficient traffic classification

被引:0
|
作者
Hajaj, Chen [1 ,4 ]
Aharon, Porat [2 ,5 ]
Dubin, Ran [2 ,5 ]
Dvir, Amit [3 ,5 ]
机构
[1] Dept Ind Engn & Management, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[2] Dept Comp Sci, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[3] Dept Comp & Software Engn, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[4] Data Sci & Artificial Intelligence Res Ctr, 3 Kiryat Hamada, IL-40700 Ariel, Israel
[5] Ariel Cyber Innovat Ctr, 3 Kiryat Hamada, IL-40700 Ariel, Israel
关键词
Internet traffic classification; Data augmentation; Long Short -Term Memory (LSTM) networks; FEATURE-SELECTION; SEQUENCE;
D O I
10.1016/j.eswa.2024.124166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The accurate identification of internet traffic is crucial for network management. However, the use of encryption techniques and constant changes in network protocols make it difficult to extract useful features for traffic classification. Additionally, there may be limited data availability and a lack of diversity within the dataset, which poses further challenges. To address these issues, our research proposes a novel solution that uses an innovative data augmentation technique. This approach leverages the capabilities of LSTM networks to create synthetic data points that closely resemble real traffic data. By doing so, we can significantly enrich the dataset used for training and improve classification efficiency. We conducted thorough experiments to validate our approach and found that combining LSTM-generated data with actual traffic data leads to notable improvements in classification efficiency. We demonstrated the effectiveness of our methodology using academic and commercial datasets. Our classifier, trained on the generated data, showed a performance boost of 6%. Moreover, when classifying with only half of the time, thus utilizing half of the signal, our approach achieved a notable 4% improvement compared to the original classifier. The inclusion of augmented samples within the training set led to a noticeable improvement in both accuracy and F1-score. These findings compellingly demonstrate our data augmentation strategy's practical utility and efficiency in earlier prediction with improved performance for encrypted traffic classification systems.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Real-Time Traffic Prediction and Probing Strategy for Lagrangian Traffic Data
    Chu, Kang-Ching
    Saigal, Romesh
    Saitou, Kazuhiro
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (02) : 497 - 506
  • [22] An empirical survey of data augmentation for time series classification with neural networks
    Iwana, Brian Kenji
    Uchida, Seiichi
    PLOS ONE, 2021, 16 (07):
  • [23] Efficient Gaussian Process Classification Using Polya-Gamma Data Augmentation
    Wenzel, Florian
    Galy-Fajou, Theo
    Donner, Christan
    Kloft, Marius
    Opper, Manfred
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5417 - 5424
  • [24] Large Imbalance Data Classification Based on MapReduce for Traffic Accident Prediction
    Park, Seoung-hun
    Ha, Young-guk
    2014 EIGHTH INTERNATIONAL CONFERENCE ON INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING (IMIS), 2014, : 45 - 49
  • [25] Real-time Traffic Classification with Twitter Data Mining
    Kurniawan, Dwi Aji
    Wibirama, Sunu
    Setiawan, Noor Akhmad
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2016,
  • [26] Efficient improvement of classification accuracy via selective test-time augmentation
    Son, Jongwook
    Kang, Seokho
    INFORMATION SCIENCES, 2023, 642
  • [27] Efficient Classification of Imbalanced Natural Disasters Data Using Generative Adversarial Networks for Data Augmentation
    Eltehewy, Rokaya
    Abouelfarag, Ahmed
    Saleh, Sherine Nagy
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (06)
  • [28] Novel Real-Time System for Traffic Flow Classification and Prediction
    YE Dezhong
    LV Haibing
    GAO Yun
    BAO Qiuxia
    CHEN Mingzi
    ZTE Communications, 2019, 17 (02) : 10 - 18
  • [29] An Accurate & Efficient Approach for Traffic Classification Inside Programmable Data Plane
    Saqib, Muhammad
    Hmitti, Zakaria Ait
    Elbiaze, Halima
    Glitho, Roch H.
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 6331 - 6336
  • [30] Wavelet based time series prediction for air traffic data
    Weinreich, I
    Rickert, H
    Lukaschewitsch, M
    WAVELET APPLICATIONS IN INDUSTRIAL PROCESSING, 2003, 5266 : 238 - 248