SpotWeb: Running Latency-sensitive Distributed Web Services on Transient Cloud Servers

被引:6
|
作者
Ali-Eldin, Ahmed [1 ]
Westin, Jonathan [1 ]
Wang, Bin [1 ]
Sharma, Prateek [2 ]
Shenoy, Prashant [1 ]
机构
[1] UMass Amherst, Amherst, MA 01003 USA
[2] Indiana Univ, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
WORKLOAD;
D O I
10.1145/3307681.3325397
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many cloud providers offer servers with transient availability at a reduced cost. These servers can be unilaterally revoked by the provider, usually after a warning period to the user. Until recently, it has been thought that these servers are not suitable to run latency-sensitive workloads due to their transient availability. In this paper, we introduce SpotWeb, a framework for running latency-sensitive web workloads on transient computing platforms while maintaining the Quality-of-Service (QoS) of the running applications. SpotWeb is based on three novel concepts; using multi-period optimization-a novel approach developed in finance-for server selection; transiency-aware load-balancing; and using intelligent capacity over-provisioning. We implement SpotWeb and evaluate its performance in both simulations and testbed experiments. Our results show that SpotWeb reduces costs by up to 50% compared to state-of-the-art solutions while being scalable to hundreds of cloud server configurations.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [11] Latency-Sensitive Data Allocation and Workload Consolidation for Cloud Storage
    Yang, Song
    Wieder, Philipp
    Aziz, Muzzamil
    Yahyapour, Ramin
    Fu, Xiaoming
    Chen, Xu
    IEEE ACCESS, 2018, 6 : 76098 - 76110
  • [12] Elastic Scaling for Distributed Latency-sensitive Data Stream Operators
    De Matteis, Tiziano
    Mencagli, Gabriele
    2017 25TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2017), 2017, : 61 - 68
  • [13] Mobile Edge Computing: An enabler for latency-sensitive mobile services
    Mobile Edge Computing: Ein Enabler für latenzsensitive Mobilfunk-Services
    Beck, Michael Till, 1600, Springer Verlag (39):
  • [14] Exploring In-Memory Accelerators and FPGAs for Latency-Sensitive DNN Inference on Edge Servers
    Suvizi, Ali
    Subramaniam, Suresh
    Lan, Tian
    Venkataramani, Guru
    2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 1 - 6
  • [15] Distributed Ordering Transmissions for Latency-Sensitive Estimation in Wireless Sensor Networks
    Yang, Liu
    Zhu, Hongbin
    Zhu, Zhenghang
    Luo, Xiliang
    Qian, Hua
    2019 IEEE 90TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2019-FALL), 2019,
  • [16] Intelligent and Agile Control of Edge Resources for Latency-Sensitive IoT Services
    Kafle, Ved P.
    Al Muktadir, Abu Hena
    IEEE ACCESS, 2020, 8 (207991-208002) : 207991 - 208002
  • [17] A Genetic Algorithm-based approach for placement in the fog of latency-sensitive multiplayer game servers
    Benamer, Amira-Rayane
    Hadj-Alouane, Nejib Ben
    Boussetta, Khaled
    Hadj-Alouane, Atidel B.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 11249 - 11275
  • [18] Cloud vs Fog: assessment of alternative deployments for a latency-sensitive IoT application
    Gomes, Marcus
    Pardal, Miguel L.
    9TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2018) / THE 8TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2018) / AFFILIATED WORKSHOPS, 2018, 130 : 488 - 495
  • [19] Nomad: An Efficient Consensus Approach for Latency-Sensitive Edge-Cloud Applications
    Hao, Zijiang
    Yi, Shanhe
    Li, Qun
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 2539 - 2547
  • [20] Space4time: Optimization latency-sensitive content service in cloud
    Zeng, Lingfang
    Veeravalli, Bharadwaj
    Wei, Qingsong
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2014, 41 : 358 - 368