Tail Latency Prediction for Datacenter Applications in Consolidated Environments

被引:0
|
作者
Alesawi, Sami [1 ,2 ]
Minh Nguyen [1 ]
Che, Hao [1 ]
Singhal, Akshit [1 ]
机构
[1] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Rabigh, Saudi Arabia
关键词
tail latency; Fork-Join queuing networks; consolidated datacenters; resource provisioning; JOIN QUEUES; FORK; EFFICIENT; SYNCHRONIZATION;
D O I
10.1109/iccnc.2019.8685505
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Consolidating applications is a practical necessity in today's datacenters to reduce cost and improve resource utilization. However, resource sharing among different applications may result in high latency in responses to user requests. Due to the lack of a performance model for tail latency of Fork-Join structures, which underlay the workflows of lots of datacenter applications, the current practice is to overprovision resource in an attempt to satisfy as many user requests as possible. However, this practice leads to low resource utilization. Therefore, it is of importance to have a performance model that can accurately predict tail latency in such an environment, especially at high load regions, where resource provisioning is desired at most. In this paper, we propose an analytical solution for the prediction of tail latency of a target application in a consolidated environment where it is mixed with other background applications. The proposed model is validated against simulation through extensive case studies. The experimental results show the effectiveness of the proposed model in tail latency prediction at high load region, yielding all the prediction errors well within 10% at the load of 75% or higher, making the model a valuable tool for resource provisioning and supporting scheduling decisions in datacenter clusters to guarantee user satisfactions.
引用
收藏
页码:265 / 269
页数:5
相关论文
共 50 条
  • [21] Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers
    Chen, Wei
    Zhou, Xiaobo
    Rao, Jia
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (12) : 2749 - 2762
  • [22] Aquila: A unified, low-latency fabric for datacenter networks
    Gibson, Dan
    Hariharan, Hema
    Lance, Eric
    McLaren, Moray
    Montazeri, Behnam
    Singh, Arjun
    Wang, Stephen
    Wassel, Hassan M. G.
    Wu, Zhehua
    Yoo, Sunghwan
    Balasubramanian, Raghuraman
    Chandra, Prashant
    Cutforth, Michael
    Cuy, Peter
    Decotigny, David
    Gautam, Rakesh
    Iriza, Alex
    Martin, Milo M. K.
    Roy, Rick
    Shen, Zuowei
    Tan, Ming
    Tang, Ye
    Monica, Wong-Chan
    Zbiciak, Joe
    Vahdat, Amin
    PROCEEDINGS OF THE 19TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI '22), 2022, : 1249 - 1266
  • [23] DeTail: Reducing the Flow Completion Time Tail in Datacenter Networks
    Zats, David
    Das, Tathagata
    Mohan, Prashanth
    Borthakur, Dhruba
    Katz, Randy
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2012, 42 (04) : 139 - 150
  • [24] Applications: Consolidated nanostructures
    Siegel, RW
    Kear, B
    NANOTECHNOLOGY RESEARCH DIRECTIONS: IWGN WORKSHOP REPORT: VISION FOR NANOTECHNOLOGY R&D IN THE NEXT DECADE, 2000, : 139 - 152
  • [25] Delayed-Dynamic-Selective (DDS) Prediction for Reducing Extreme Tail Latency in Web Search
    Kim, Saehoon
    He, Yuxiong
    Hwang, Seung-won
    Elnikety, Sameh
    Choi, Seungjin
    WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 7 - 16
  • [26] Amdahl's Law for Tail Latency
    Delimitrou, Christina
    Kozyrakis, Christos
    COMMUNICATIONS OF THE ACM, 2018, 61 (08) : 65 - 72
  • [27] A Prediction Based Replica Selection Strategy for Reducing Tail Latency in Geo-Distributed Systems
    Shithil, Santa Maria
    Adnan, Muhammad Abdullah
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (03) : 2954 - 2965
  • [28] A Gittins Policy for Optimizing Tail Latency
    Harlev, Amit
    Yu, George
    Scully, Ziv
    Performance Evaluation Review, 2024, 52 (02): : 15 - 17
  • [29] BlueJay: A Platform to Quantifying the Impact of Memory Latency on Datacenter Application Performance
    Qin, Jingchang
    Chen, Yiquan
    Cai, Shishun
    Lin, Wenhai
    Xu, Jiexiong
    Jin, Then
    Cao, Lifa
    Zheng, Zijie
    Zhang, Yuzhong
    Chen, Yi
    Chen, Wenzhi
    2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID 2024, 2024, : 489 - 495
  • [30] Virtual Machine Contracts for Datacenter and Cloud Computing Environments
    Matthews, Jeanna
    Garfinkel, Tal
    Hoff, Christofer
    Wheeler, Jeff
    FIRST WORKSHOP ON AUTOMATED CONTROL FOR DATACENTERS AND CLOUDS (ACDC '09), 2009, : 25 - 30