Early Classification of Residential Networks Traffic using C5.0 Machine Learning Algorithm

被引:0
|
作者
Aouini, Zied [1 ,2 ]
Kortebi, Abdesselem [1 ]
Ghamri-Doudane, Yacine [2 ]
Cherif, Iyad Lahsen [1 ]
机构
[1] Orange Labs, Lannion, France
[2] Univ La Rochelle, Lab L3i, La Rochelle, France
关键词
Traffic classification; Residential Internet traffic; machine learning algorithms;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A reliable traffic identification engine is a key component for Internet Service Providers (ISPs) to tune up their networks to meet customers' requirements. The continuously evolving characteristics of Internet traffic along with traffic encryption are challenging the reliability of classical approaches (i.e. port-based, pattern matching). A large body of the literature aims to overcome these challenges using machine learning based methods. However, several gaps limit the deployment of these approaches. In this paper, we focus on providing a fine-grained early residential traffic classification approach considering the lessons learnt from the literature. Our machine learning approach can identify finely services based on the very first packets statistical features. Furthermore, the methodology we developed aims to overcome commonly identified validation issues. Our dataset consists of a real residential traffic capture collected in France and provided by a major ISP involving more than 34,000 customers. Moreover, we developed an extension for an existing open source tool to provide the community with a reliable data processing chain. Our solution achieves very promising accuracy (98.8%) while identifying encrypted services such as Facebook, Google Services or Skype.
引用
收藏
页码:46 / 53
页数:8
相关论文
共 50 条
  • [1] Classification of HTTP traffic based on C5.0 Machine Learning Algorithm
    Bujlow, Tomasz
    Riaz, Tahir
    Pedersen, Jens Myrup
    [J]. 2012 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2012, : 876 - 881
  • [2] Classification of Teak Wood Production in Central Java']Java Using the C5.0 Algorithm
    Susanti, Yuliana
    Respatiwulan
    Handajani, Sri Sulistijowati
    Pratiwi, Hasih
    Slamet, Isnandar
    Hartatik
    Istiqomah, Firstiana
    [J]. INTERNATIONAL CONFERENCE ON SCIENCE AND APPLIED SCIENCE (ICSAS) 2019, 2019, 2202
  • [3] Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees
    Williams, Philip H.
    Eyles, Rod
    Weiller, Georg
    [J]. JOURNAL OF NUCLEIC ACIDS, 2012, 2012
  • [4] Rule Optimization of Boosted C5.0 Classification Using Genetic Algorithm for Liver disease Prediction
    Hassoon, Mafazalyaqeen
    Zomorodi-Moghadam, Mariam
    Kouhi, Mikhak Samadi
    Abdar, Moloud
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA), 2017, : 299 - 305
  • [5] A NOVEL TRAFFIC CLASSIFICATION ALGORITHM USING MACHINE LEARNING
    Liu Huixian
    Li Xiaojuan
    [J]. PROCEEDINGS OF 2009 2ND IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY, 2009, : 340 - 344
  • [6] Application of C5.0 Algorithm to Flu Prediction Using Twitter Data
    Albances, L. Z.
    Bungar, Beatrice Anne
    Patio, Jannah Patrize
    Sevilla, Rio Jan Marty
    Acula, Donata
    [J]. 2018 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON18), 2018, : 141 - 143
  • [7] Breast Cancer Prediction by Using C5.0 Algorithm and BOOSTING Method
    Rafe, Vahid
    Farhoud, Sara Hashemi
    Rasoolzadeh, Siamak
    [J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2014, 4 (04) : 600 - 604
  • [8] Web-based classification application for forest fire data using the shiny framework and the C5.0 algorithm
    Siknun, Gita Puspita
    Sitanggang, Imas Sukaesih
    [J]. 2ND INTERNATIONAL SYMPOSIUM ON LAPAN-IPB SATELLITE (LISAT) FOR FOOD SECURITY AND ENVIRONMENTAL MONITORING, 2016, 33 : 332 - 339
  • [9] Constructing a Gaming Model for Professional Tennis Players Using the C5.0 Algorithm
    Chang, Che-Wei
    Qiu, Yu-Ran
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (16):
  • [10] Machine Learning Algorithm in Network Traffic Classification
    Rachmawati, Syifa Maliah
    Kim, Dong-Seong
    Lee, Jae-Min
    [J]. 12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1010 - 1013