SpotWeb: Running Latency-sensitive Distributed Web Services on Transient Cloud Servers

被引:6
|
作者
Ali-Eldin, Ahmed [1 ]
Westin, Jonathan [1 ]
Wang, Bin [1 ]
Sharma, Prateek [2 ]
Shenoy, Prashant [1 ]
机构
[1] UMass Amherst, Amherst, MA 01003 USA
[2] Indiana Univ, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
WORKLOAD;
D O I
10.1145/3307681.3325397
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many cloud providers offer servers with transient availability at a reduced cost. These servers can be unilaterally revoked by the provider, usually after a warning period to the user. Until recently, it has been thought that these servers are not suitable to run latency-sensitive workloads due to their transient availability. In this paper, we introduce SpotWeb, a framework for running latency-sensitive web workloads on transient computing platforms while maintaining the Quality-of-Service (QoS) of the running applications. SpotWeb is based on three novel concepts; using multi-period optimization-a novel approach developed in finance-for server selection; transiency-aware load-balancing; and using intelligent capacity over-provisioning. We implement SpotWeb and evaluate its performance in both simulations and testbed experiments. Our results show that SpotWeb reduces costs by up to 50% compared to state-of-the-art solutions while being scalable to hundreds of cloud server configurations.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] A New Approach for Evaluating the Performance of Distributed Latency-Sensitive Services
    Theodoropoulos, Theodoros
    Violos, John
    Makris, Antonios
    Tserpes, Konstantinos
    2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 365 - 370
  • [2] Cloud Support for Latency-Sensitive Telephony Applications
    Kim, Jong Yul
    Schulzrinne, Henning
    2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 1, 2013, : 421 - 426
  • [3] Latency-sensitive hashing for collaborative Web caching
    Wu, KL
    Yu, PS
    COMPUTER NETWORKS, 2000, 33 (1-6) : 633 - +
  • [4] Topic allocation method on edge servers for latency-sensitive notification service
    Tanaka, Tomoya
    Kamada, Tomio
    Ohta, Chikara
    International Journal of Network Management, 31 (06):
  • [5] A Genetic Algorithm for the Placement of Latency-Sensitive Multiplayer Game Servers in the Fog
    Benamer, Amira Rayane
    Boussetta, Khaled
    Ben Hadj-Alouane, Nejib
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [6] Topic allocation method on edge servers for latency-sensitive notification service
    Tanaka, Tomoya
    Kamada, Tomio
    Ohta, Chikara
    INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2021, 31 (06)
  • [7] Network performance isolation for latency-sensitive cloud applications
    Cheng, Luwei
    Wang, Cho-Li
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (04): : 1073 - 1084
  • [8] Joint VNF Placement and Scheduling for Latency-Sensitive Services
    Promwongsa, Nattakorn
    Ebrahimzadeh, Amin
    Glitho, Roch H.
    Crespi, Noel
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (04): : 2432 - 2449
  • [9] PerfIso: Performance Isolation for Commercial Latency-Sensitive Services
    Iorgulescu, Calin
    Azimi, Reza
    Kwon, Youngjin
    Elnikety, Sameh
    Syamala, Manoj
    Narasayya, Vivek
    Herodotou, Herodotos
    Tomita, Paulo
    Chen, Alex
    Zhang, Jack
    Wang, Junhua
    PROCEEDINGS OF THE 2018 USENIX ANNUAL TECHNICAL CONFERENCE, 2018, : 519 - 531
  • [10] Power-Aware Cloud Computing Infrastructure For Latency-Sensitive Internet-of-Things Services
    Wan, Zhitao
    Wang, Ping
    Liu, Jing
    Tang, Wei
    UKSIM-AMSS 15TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM 2013), 2013, : 617 - 621