m3: Accurate Flow-Level Performance Estimation using Machine Learning

被引:0
|
作者
Li, Chenning [1 ]
Nasr-Esfahany, Arash [1 ]
Zhao, Kevin [2 ]
Noorbakhsh, Kimia [1 ]
Goyal, Prateesh [3 ]
Alizadeh, Mohammad [1 ]
Anderson, Thomas E. [2 ]
机构
[1] MIT CSAIL, Cambridge, MA 02139 USA
[2] Univ Washington, Seattle, WA USA
[3] Microsoft Res, Redmond, WA USA
关键词
Network simulation; Data center networks; Approximation; Machine learning; Network modeling; NETWORK; PARALLEL;
D O I
10.1145/3651890.3672243
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data center network operators often need accurate estimates of aggregate network performance. Unfortunately, existing methods for estimating aggregate network statistics are either inaccurate or too slow to be practical at the data center scale. In this paper, we develop and evaluate a scale-free, fast, and accurate model for estimating data center network tail latency performance for a given workload, topology, and network configuration. First, we show that path-level simulations-simulations of traffic that intersects a given path-produce almost the same aggregate statistics as full network-wide packet-level simulations. We use a simple and fast flow-level fluid simulation in a novel way to capture and summarize essential elements of the path workload, including the effect of cross-traffic on flows on that path. We use this coarse simulation as input to a machine-learning model to predict path-level behavior, and run it on a sample of paths to produce accurate network-wide estimates. Our model generalizes over the choice of congestion control (CC) protocol, CC protocol parameters, and routing. Relative to Parsimon, a state-of-the-art system for rapidly estimating aggregate network tail latency, our approach is significantly faster (5.7.), more accurate (45.9% less error), and more robust.
引用
收藏
页码:813 / 827
页数:15
相关论文
共 50 条
  • [1] An Accurate and Extensible Machine Learning Classifier for Flow-Level Traffic Classification
    Lu, Gang
    Guo, Ronghua
    Zhou, Ying
    Du, Jing
    [J]. CHINA COMMUNICATIONS, 2018, 15 (06) : 125 - 138
  • [2] An Accurate and Extensible Machine Learning Classifier for Flow-Level Traffic Classification
    Gang Lu
    Ronghua Guo
    Ying Zhou
    Jing Du
    [J]. China Communications, 2018, 15 (06) : 125 - 138
  • [3] Towards Bandwidth Estimation Using Flow-Level Measurements
    Schmidt, Ricardo de O.
    Sperotto, Anna
    Sadre, Ramin
    Pras, Aiko
    [J]. DEPENDABLE NETWORKS AND SERVICES, 2012, 7279 : 127 - 138
  • [4] Opportunistic Flow-Level Latency Estimation Using Consistent NetFlow
    Lee, Myungjin
    Duffield, Nick
    Kompella, Ramana Rao
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2012, 20 (01) : 139 - 152
  • [5] Flow-Level Capacity and Performance in HetNets
    Borst, Sem
    Bakker, Hajo
    Gruber, Markus
    Klein, Siegfried
    Whiting, Phil
    [J]. 2015 IEEE 81ST VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2015,
  • [6] A Modular Machine Learning System for Flow-Level Traffic Classification in Large Networks
    Jin, Yu
    Duffield, Nick
    Erman, Jeffrey
    Haffner, Patrick
    Sen, Subhabrata
    Zhang, Zhi-Li
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (01)
  • [7] Accurate Rate-Aware Flow-level Traffic Splitting
    Wu, Ning
    Tseng, Shih-Hao
    Tang, Ao
    [J]. 2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 774 - 783
  • [8] Two Samples are Enough: Opportunistic Flow-level Latency Estimation using NetFlow
    Lee, Myungjin
    Duffield, Nick
    Kompella, Ramana Rao
    [J]. 2010 PROCEEDINGS IEEE INFOCOM, 2010,
  • [9] Accurate mobile-app fingerprinting using flow-level relationship with graph neural networks
    Jiang, Minghao
    Li, Zhen
    Fu, Peipei
    Cai, Wei
    Cui, Mingxin
    Xiong, Gang
    Gou, Gaopeng
    [J]. COMPUTER NETWORKS, 2022, 217
  • [10] Estimating Bandwidth Requirements Using Flow-Level Measurements
    Schmidt, Ricardo de O.
    Pras, Aiko
    [J]. MANAGING THE DYNAMICS OF NETWORKS AND SERVICES, 2011, 6734 : 169 - 172