m3: Accurate Flow-Level Performance Estimation using Machine Learning

被引:2
|
作者
Li, Chenning [1 ]
Nasr-Esfahany, Arash [1 ]
Zhao, Kevin [2 ]
Noorbakhsh, Kimia [1 ]
Goyal, Prateesh [3 ]
Alizadeh, Mohammad [1 ]
Anderson, Thomas E. [2 ]
机构
[1] MIT CSAIL, Cambridge, MA 02139 USA
[2] Univ Washington, Seattle, WA USA
[3] Microsoft Res, Redmond, WA USA
来源
PROCEEDINGS OF THE 2024 ACM SIGCOMM 2024 CONFERENCE, ACM SIGCOMM 2024 | 2024年
关键词
Network simulation; Data center networks; Approximation; Machine learning; Network modeling; NETWORK; PARALLEL;
D O I
10.1145/3651890.3672243
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data center network operators often need accurate estimates of aggregate network performance. Unfortunately, existing methods for estimating aggregate network statistics are either inaccurate or too slow to be practical at the data center scale. In this paper, we develop and evaluate a scale-free, fast, and accurate model for estimating data center network tail latency performance for a given workload, topology, and network configuration. First, we show that path-level simulations-simulations of traffic that intersects a given path-produce almost the same aggregate statistics as full network-wide packet-level simulations. We use a simple and fast flow-level fluid simulation in a novel way to capture and summarize essential elements of the path workload, including the effect of cross-traffic on flows on that path. We use this coarse simulation as input to a machine-learning model to predict path-level behavior, and run it on a sample of paths to produce accurate network-wide estimates. Our model generalizes over the choice of congestion control (CC) protocol, CC protocol parameters, and routing. Relative to Parsimon, a state-of-the-art system for rapidly estimating aggregate network tail latency, our approach is significantly faster (5.7.), more accurate (45.9% less error), and more robust.
引用
收藏
页码:813 / 827
页数:15
相关论文
共 50 条
  • [21] Flow-level performance and capacity of wireless networks with user mobility
    Thomas Bonald
    Sem Borst
    Nidhi Hegde
    Matthieu Jonckheere
    Alexandre Proutiere
    Queueing Systems, 2009, 63
  • [22] An Analytical Model for Flow-Level Performance in Heterogeneous Wireless Networks
    Arvanitakis, George
    Spyropoulos, Thrasyvoulos
    Kaltenberger, Florian
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (03) : 1488 - 1501
  • [23] Flow-level Tail Latency Estimation and Verification based on Extreme Value Theory
    Helm, Max
    Wiedner, Florian
    Carle, Georg
    2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 359 - 363
  • [24] Accurate Performance and Power Prediction for FPGAs Using Machine Learning
    Sawalha, Lina
    Abuaita, Tawfiq
    Cowley, Martin
    Akhmatdinov, Sergei
    Dubs, Adam
    2022 IEEE 30TH INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2022), 2022, : 228 - 228
  • [25] Flow-level Spam Modelling using separate data sources
    Luckner, Marcin
    Filasiak, Robert
    2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2013, : 91 - 98
  • [26] How mobility impacts the flow-level performance of wireless data systems
    Bonald, T
    Borst, SC
    Proutière, A
    IEEE INFOCOM 2004: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-4, PROCEEDINGS, 2004, : 1872 - 1881
  • [27] Flow-level Performance of Opportunistic OFDM-TDMA and OFDMA Networks
    Lei, Lei
    Lin, Chuang
    Cai, Jun
    Shen, Xuemin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2008, 7 (12) : 5461 - 5472
  • [28] Flow-Level Performance of Intra-site Coordination in Cellular Networks
    Khlass, Ahlem
    Bonald, Thomas
    Elayoubi, Salah Eddine
    2013 11TH INTERNATIONAL SYMPOSIUM ON MODELING & OPTIMIZATION IN MOBILE, AD HOC & WIRELESS NETWORKS (WIOPT), 2013, : 216 - 223
  • [29] Fast and Accurate Estimation of Quality of Results in High-Level Synthesis with Machine Learning
    Dai, Steve
    Zhou, Yuan
    Zhang, Hang
    Ustun, Ecenur
    Young, Evangeline F. Y.
    Zhang, Zhiru
    PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 129 - 132
  • [30] GPGPU Performance and Power Estimation Using Machine Learning
    Wu, Gene
    Greathouse, Joseph L.
    Lyashevsky, Alexander
    Jayasena, Nuwan
    Chiou, Derek
    2015 IEEE 21ST INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2015, : 564 - 576