Backup or Not: An Online Cost Optimal Algorithm for Data Analysis Jobs Using Spot Instances

被引:4
|
作者
Lin, Liduo [1 ]
Pan, Li [1 ]
Liu, Shijun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Spot instance; online algorithm; back up; abrupt termination; BIG DATA;
D O I
10.1109/ACCESS.2020.3014978
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, large-scale public cloud providers begin to offer spot instances. This type of instance has become popular with more and more cloud users in the light of its convenient access mode and low price, especially for those big data analysis jobs with high performance computation requirements. However, using spot instances may carry the risk of being interrupted and lead to extra costs for job re-executions because these instances are generally unstable. Yet, such cost can be greatly reduced if a backup can be made at the right time before interruptions. For convenience and cost efficiency, users can choose the StaaS (Storage-as-a-Service) storage provided by the same cloud provider, whose spot instances are used by the users, to store backup data files for future job execution recovery. Since making backups too often will incur increased costs, users need to make the backup decisions appropriately considering the condition when an abrupt interruption will occur in the future. However, it is hard to know or predict precisely when such an interruption will occur. For solving this problem, in this article, we propose an online algorithm to guide cloud users to make backups when using spot instances to execute big data analysis jobs, without requiring any information about future interruptions. We prove theoretically that our proposed online algorithm can guarantee a bounded competitive ratio less than 2. Finally, according to extensive experiments, we verify the effectiveness of our online algorithm in reducing the additional cost caused by interruptions in using spot instances and find that our online algorithm can still achieve a stable cost optimization even if interruptions occur frequently.
引用
收藏
页码:144945 / 144956
页数:12
相关论文
共 50 条
  • [1] An Online Algorithm Based on Replication for Using Spot Instances in IaaS Clouds
    Xu, Zhi-Wei
    Pan, Li
    Liu, Shi-Jun
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (01) : 103 - 115
  • [2] Optimal Static Bidding Strategy for Running Jobs with Hard Deadline Constraints on Spot Instances
    Wang, Kai-Siang
    Hsieh, Cheng-Han
    Chou, Jerry
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, CLOSER 2023, 2023, : 123 - 130
  • [3] An online algorithm for scheduling big data analysis jobs in cloud environments
    Kang, Youyou
    Pan, Li
    Liu, Shijun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [4] Reducing Cloud provisioning Cost Using Spot Instances hopping
    Sanad, Ali Jassim
    Hammad, Mustafa
    [J]. 2019 INTERNATIONAL CONFERENCE ON INNOVATION AND INTELLIGENCE FOR INFORMATICS, COMPUTING, AND TECHNOLOGIES (3ICT), 2019,
  • [5] To store or not: Online cost optimization for running big data jobs on the cloud
    Fu, Xiankun
    Pan, Li
    Liu, Shijun
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 156 : 42 - 52
  • [6] A Cost-Efficient Workflow as a Service Broker Using On-demand and Spot Instances
    Taghavi, Bahareh
    Zolfaghari, Behrooz
    Abrishami, Saeid
    [J]. JOURNAL OF GRID COMPUTING, 2023, 21 (03)
  • [7] A Cost-Efficient Workflow as a Service Broker Using On-demand and Spot Instances
    Bahareh Taghavi
    Behrooz Zolfaghari
    Saeid Abrishami
    [J]. Journal of Grid Computing, 2023, 21
  • [8] A tight analysis and near-optimal instances of the algorithm of Anderson and Woll
    Malewicz, G
    [J]. THEORETICAL COMPUTER SCIENCE, 2004, 329 (1-3) : 285 - 301
  • [9] Caching or not: An online cost optimization algorithm for geodistributed data analysis in cloud environments
    Yang, Weitao
    Pan, Li
    Liu, Shijun
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2024, 229
  • [10] Are student jobs flexible jobs? Using online data to study employers' preferences in Slovakia
    Kurekova, Lucia Mytna
    Zilincikova, Zuzana
    [J]. IZA JOURNAL OF EUROPEAN LABOR STUDIES, 2016, 5