SpotWeb: Running Latency-sensitive Distributed Web Services on Transient Cloud Servers

被引:6
|
作者
Ali-Eldin, Ahmed [1 ]
Westin, Jonathan [1 ]
Wang, Bin [1 ]
Sharma, Prateek [2 ]
Shenoy, Prashant [1 ]
机构
[1] UMass Amherst, Amherst, MA 01003 USA
[2] Indiana Univ, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
WORKLOAD;
D O I
10.1145/3307681.3325397
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many cloud providers offer servers with transient availability at a reduced cost. These servers can be unilaterally revoked by the provider, usually after a warning period to the user. Until recently, it has been thought that these servers are not suitable to run latency-sensitive workloads due to their transient availability. In this paper, we introduce SpotWeb, a framework for running latency-sensitive web workloads on transient computing platforms while maintaining the Quality-of-Service (QoS) of the running applications. SpotWeb is based on three novel concepts; using multi-period optimization-a novel approach developed in finance-for server selection; transiency-aware load-balancing; and using intelligent capacity over-provisioning. We implement SpotWeb and evaluate its performance in both simulations and testbed experiments. Our results show that SpotWeb reduces costs by up to 50% compared to state-of-the-art solutions while being scalable to hundreds of cloud server configurations.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [21] Impact of Distributed Rate Limiting on Load Distribution in a Latency-sensitive Messaging Service
    Li, Chong
    Liu, Jiangnan
    Lu, Chenyang
    Guerin, Roch
    Gill, Christopher D.
    2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 367 - 377
  • [22] Latency-Sensitive Web Service Workflows: A Case for a Software-Defined Internet
    Kathiravelu, Pradeeban
    Van Roy, Peter
    Veiga, Luis
    Benkhelifa, Elhadj
    2020 SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS), 2020, : 115 - 122
  • [23] Efficiency of Distributed Selection of Edge or Cloud Servers under Latency Constraints
    Mancuso, Vincenzo
    Badia, Leonardo
    Castagno, Paolo
    Sereno, Matteo
    Marsan, Marco Ajmone
    2023 21ST MEDITERRANEAN COMMUNICATION AND COMPUTER NETWORKING CONFERENCE, MEDCOMNET, 2023, : 158 - 166
  • [24] Characterizing and Modeling Distributed Training with Transient Cloud GPU Servers
    Li, Shijian
    Walls, Robert J.
    Guo, Tian
    2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 943 - 953
  • [25] IADA: A dynamic interference-aware cloud scheduling architecture for latency-sensitive workloads
    Meyer, Vinicius
    da Silva, Matheus L.
    Kirchoff, Dionatra F.
    De Rose, Cesar A. F.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 194
  • [26] DKNNS: Scalable and accurate distributed K nearest neighbor search for latency-sensitive applications
    Fu YongQuan
    Wang YiJie
    SCIENCE CHINA-INFORMATION SCIENCES, 2013, 56 (03) : 1 - 17
  • [27] DKNNS:Scalable and accurate distributed K nearest neighbor search for latency-sensitive applications
    FU YongQuan
    WANG YiJie
    ScienceChina(InformationSciences), 2013, 56 (03) : 123 - 139
  • [28] A Near Optimal Reliable Orchestration Approach for Geo-Distributed Latency-Sensitive SFCs
    Chemodanov, Dmitrii
    Calyam, Prasad
    Esposito, Flavio
    McGarvey, Ronald
    Palaniappan, Kannappan
    Pescape, Antonio
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 2730 - 2745
  • [29] DKNNS: Scalable and accurate distributed K nearest neighbor search for latency-sensitive applications
    YongQuan Fu
    YiJie Wang
    Science China Information Sciences, 2013, 56 : 1 - 17
  • [30] LEASE: Leveraging Energy-Awareness in Serverless Edge for Latency-Sensitive IoT Services
    Verma, Aastik
    Satpathy, Anurag
    Das, Sajal. K.
    Addya, Sourav Kanti
    2024 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS, PERCOM WORKSHOPS, 2024, : 302 - 307