m3: Accurate Flow-Level Performance Estimation using Machine Learning

被引:2
|
作者
Li, Chenning [1 ]
Nasr-Esfahany, Arash [1 ]
Zhao, Kevin [2 ]
Noorbakhsh, Kimia [1 ]
Goyal, Prateesh [3 ]
Alizadeh, Mohammad [1 ]
Anderson, Thomas E. [2 ]
机构
[1] MIT CSAIL, Cambridge, MA 02139 USA
[2] Univ Washington, Seattle, WA USA
[3] Microsoft Res, Redmond, WA USA
来源
PROCEEDINGS OF THE 2024 ACM SIGCOMM 2024 CONFERENCE, ACM SIGCOMM 2024 | 2024年
关键词
Network simulation; Data center networks; Approximation; Machine learning; Network modeling; NETWORK; PARALLEL;
D O I
10.1145/3651890.3672243
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data center network operators often need accurate estimates of aggregate network performance. Unfortunately, existing methods for estimating aggregate network statistics are either inaccurate or too slow to be practical at the data center scale. In this paper, we develop and evaluate a scale-free, fast, and accurate model for estimating data center network tail latency performance for a given workload, topology, and network configuration. First, we show that path-level simulations-simulations of traffic that intersects a given path-produce almost the same aggregate statistics as full network-wide packet-level simulations. We use a simple and fast flow-level fluid simulation in a novel way to capture and summarize essential elements of the path workload, including the effect of cross-traffic on flows on that path. We use this coarse simulation as input to a machine-learning model to predict path-level behavior, and run it on a sample of paths to produce accurate network-wide estimates. Our model generalizes over the choice of congestion control (CC) protocol, CC protocol parameters, and routing. Relative to Parsimon, a state-of-the-art system for rapidly estimating aggregate network tail latency, our approach is significantly faster (5.7.), more accurate (45.9% less error), and more robust.
引用
收藏
页码:813 / 827
页数:15
相关论文
共 50 条
  • [41] A Flow-Level Performance Model for Mobile Networks Carrying Adaptive Streaming Traffic
    Bonald, Thomas
    Elayoubi, Salah Eddine
    Lin, Yu-Ting
    2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
  • [42] Flow-Level Performance of Device-to-Device Underlaid OFDM Cellular Networks
    Lei, Lei
    Wang, Huijian
    Shen, Xuemin
    Cheng, Nan
    Zhong, Zhangdui
    2015 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2015,
  • [43] Packet-Level and Flow-Level Network Intrusion Detection Based on Reinforcement Learning and Adversarial Training
    Yang, Bin
    Arshad, Muhammad Haseeb
    Zhao, Qing
    ALGORITHMS, 2022, 15 (12)
  • [44] Modeling the flow-level performance of hierarchical modulation in OFDMA-based networks
    Anis Jdidi
    Tijani Chahed
    Salah Eddine Elayoubi
    Hichem Besbes
    Telecommunication Systems, 2012, 50 : 169 - 180
  • [45] Design and Implementation of Flow-Level Simulator for Performance Evaluation of Large Scale Networks
    Sakumoto, Yusuke
    Asai, Ryouta
    Ohsaki, Hiroyuki
    Imase, Makoto
    PROCEEDINGS OF MASCOTS '07: 15TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, 2007, : 166 - 172
  • [46] Performance and estimation genetic variability of M3 pearl millet (Pennisetum glaucum) populations
    Maryono, M. Y.
    Sihono
    Indriatama, W. M.
    Human, S.
    INTERNATIONAL CONFERENCE ON SUSTAINABLE CEREALS AND CROPS PRODUCTION SYSTEMS IN THE TROPICS, 2020, 484
  • [47] Resource and Performance Estimation for CNN Models using Machine Learning
    Shahshahani, Masoud
    Bhatia, Dinesh
    2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 43 - 48
  • [48] GPU Performance Estimation using Software Rasterization and Machine Learning
    O'Neal, Kenneth
    Brisk, Philip
    Abousamra, Ahmed
    Waters, Zack
    Shriver, Emily
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2017, 16
  • [49] Estimation of Small Antenna Performance Using a Machine Learning Approach
    Roqui, Julian
    Santamaria, Luca
    Khacef, Lyes
    Pegatoquet, Alain
    Lizzi, Leonardo
    2020 IEEE INTERNATIONAL SYMPOSIUM ON ANTENNAS AND PROPAGATION AND NORTH AMERICAN RADIO SCIENCE MEETING, 2020, : 581 - 582
  • [50] We are all treated equal, aren't we? - Flow-level performance as a function of flow size
    Mehmood, Muhammad Amir
    Feldmann, Anja
    Uhlig, Steve
    Willinger, Walter
    2014 IFIP NETWORKING CONFERENCE, 2014,