K-Dimensional Trees for Continuous Traffic Classification

被引:0
|
作者
Carela-Espanol, Valentin [1 ]
Barlet-Ros, Pere [1 ]
Sole-Simo, Marc [1 ]
Dainotti, Alberto [2 ]
de Donato, Walter [2 ]
Pescape, Antonio [2 ]
机构
[1] Univ Politecn Cataluna, Dept Comp Architecture, E-08028 Barcelona, Spain
[2] Univ Naples Federico II, Dept Comp Engn & Sys, Naples, Italy
基金
欧盟第七框架计划;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The network measurement community has proposed multiple machine learning (ML) methods for traffic classification during the last years. Although several research works have reported accuracies over 90%, most network operators still use either obsolete (e.g., port-based) or extremely expensive (e.g., pattern matching) methods for traffic classification. We argue that one of the barriers to the real deployment of ML-based methods is their time-consuming training phase. In this paper, we revisit the viability of using the Nearest Neighbor technique for traffic classification. We present an efficient; implementation of this well-known technique based on multiple K-dimensional trees, which is characterized by short training times and high classification speed. This allows us not only to run the classifier online but also to continuously retrain it, without requiring human intervention, as the training data become obsolete. The proposed solution achieves very promising accuracy (> 95%) while looking just at the size of the very first packets of a flow. We present an implementation of this method based on the TIE classification engine as a feasible and simple solution for network operators.
引用
收藏
页码:141 / +
页数:2
相关论文
共 50 条