Tail Latency Prediction for Datacenter Applications in Consolidated Environments

被引:0
|
作者
Alesawi, Sami [1 ,2 ]
Minh Nguyen [1 ]
Che, Hao [1 ]
Singhal, Akshit [1 ]
机构
[1] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Rabigh, Saudi Arabia
关键词
tail latency; Fork-Join queuing networks; consolidated datacenters; resource provisioning; JOIN QUEUES; FORK; EFFICIENT; SYNCHRONIZATION;
D O I
10.1109/iccnc.2019.8685505
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Consolidating applications is a practical necessity in today's datacenters to reduce cost and improve resource utilization. However, resource sharing among different applications may result in high latency in responses to user requests. Due to the lack of a performance model for tail latency of Fork-Join structures, which underlay the workflows of lots of datacenter applications, the current practice is to overprovision resource in an attempt to satisfy as many user requests as possible. However, this practice leads to low resource utilization. Therefore, it is of importance to have a performance model that can accurately predict tail latency in such an environment, especially at high load regions, where resource provisioning is desired at most. In this paper, we propose an analytical solution for the prediction of tail latency of a target application in a consolidated environment where it is mixed with other background applications. The proposed model is validated against simulation through extensive case studies. The experimental results show the effectiveness of the proposed model in tail latency prediction at high load region, yielding all the prediction errors well within 10% at the load of 75% or higher, making the model a valuable tool for resource provisioning and supporting scheduling decisions in datacenter clusters to guarantee user satisfactions.
引用
收藏
页码:265 / 269
页数:5
相关论文
共 50 条
  • [1] On IO Latency Prediction Accuracy and Automated Load Balancing in Consolidated VM Environments
    Nemoto, Jun
    Ganger, Gregory R.
    PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2016, : 224 - 225
  • [2] TimeTrader: Exploiting Latency Tail to Save Datacenter Energy for Online Search
    Vamanan, Balajee
    Bin Sohail, Hamza
    Hasan, Jahangir
    Vijaykumar, T. N.
    PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, : 585 - 597
  • [3] A Mechanism Achieving Low Latency for Wireless Datacenter Applications
    Huang, Tao
    Zhang, Jiao
    Liu, Yunjie
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 13 (02) : 639 - 658
  • [4] WorkloadCompactor: Reducing datacenter cost while providing tail latency SLO guarantees
    Zhu, Timothy
    Kozuch, Michael A.
    Harchol-Balter, Mor
    PROCEEDINGS OF THE 2017 SYMPOSIUM ON CLOUD COMPUTING (SOCC '17), 2017, : 598 - 610
  • [5] Fast Convergence to Fairness for Reduced Long Flow Tail Latency in Datacenter Networks
    Snyder, John
    Lebeck, Alvin R.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1007 - 1017
  • [6] Reducing tail latency for multi-bottleneck in datacenter networks: A compound approach
    Zhang, Yuxiang
    Cui, Lin
    Tso, Fung Po
    Lei, Xiaolin
    COMPUTER NETWORKS, 2025, 257
  • [7] ForkTail: A Black-Box Fork-Join Tail Latency Prediction Model for User-Facing Datacenter Workloads
    Minh Nguyen
    Alesawi, Sami
    Li, Ning
    Che, Hao
    Jiang, Hong
    HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2018, : 206 - 217
  • [8] Precise Power Capping for Latency-Sensitive Applications in Datacenter
    Wu, Song
    Chen, Yang
    Wang, Xinhou
    Jin, Hai
    Liu, Fangming
    Chen, Haibao
    Yan, Chuxiong
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2021, 6 (03): : 469 - 480
  • [9] Managing Tail Latency in Datacenter-Scale File Systems Under Production Constraints
    Misra, Pulkit A.
    Borge, Maria F.
    Goiri, Inigo
    Lebeck, Alvin R.
    Zwaenepoel, Willy
    Bianchini, Ricardo
    PROCEEDINGS OF THE FOURTEENTH EUROSYS CONFERENCE 2019 (EUROSYS '19), 2019,
  • [10] Profiling, Prediction, and Capping of Power Consumption in Consolidated Environments
    Choi, Jeonghwan
    Govindan, Sriram
    Urgaonkar, Bhuvan
    Sivasubramaniam, Anand
    2008 IEEE INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2008, : 65 - 74