Cloud application deployment with transient failure recovery

被引:0
|
作者
Ioannis Giannakopoulos
Ioannis Konstantinou
Dimitrios Tsoumakos
Nectarios Koziris
机构
[1] Computing Systems Laboratory,
[2] School of ECE,undefined
[3] National Technical University of Athens,undefined
[4] Department of Informatics,undefined
[5] Ionian University,undefined
关键词
Cloud application deployment; Resource configuration; Transient failure; Error-recovery; Filesystem snapshot;
D O I
暂无
中图分类号
学科分类号
摘要
Application deployment is a crucial operation for modern cloud providers. The ability to dynamically allocate resources and deploy a new application instance based on a user-provided description in a fully automated manner is of great importance for the cloud users as it facilitates the generation of fully reproducible application environments with minimum effort. However, most modern deployment solutions do not consider the error-prone nature of the cloud: Network glitches, bad synchronization between different services and other software or infrastructure related failures with transient characteristics are frequently encountered. Even if these failures may be tolerable during an application’s lifetime, during the deployment phase they can cause severe errors and lead it to failure. In order to tackle this challenge, in this work we propose AURA, an open source system that enables cloud application deployment with transient failure recovery capabilities. AURA formulates the application deployment as a Directed Acyclic Graph. Whenever a transient failure occurs, it traverses the graph, identifies the parts of it that failed and re-executes the respective scripts, based on the fact that when the transient failure disappears the script execution will succeed. Moreover, in order to guarantee that each script execution is idempotent, AURA adopts a lightweight filesystem snapshot mechanism that aims at canceling the side effects of the failed scripts. Our thorough evaluation indicated that AURA is capable of deploying diverse real-world applications to environments exhibiting high error probabilities, introducing a minimal time overhead, proportional to the failure probability of the deployment scripts.
引用
收藏
相关论文
共 50 条
  • [31] The AppScale Cloud Platform Enabling Portable, Scalable Web Application Deployment
    Krintz, Chandra
    IEEE INTERNET COMPUTING, 2013, 17 (02) : 72 - 75
  • [32] Cloud toolkit for Provider assessment, optimized Application Cloudification and deployment on IaaS
    Psychas, A.
    Violos, J.
    Aisopos, F.
    Evangelinou, A.
    Kousiouris, G.
    Bouras, I
    Varvarigou, T.
    Xidas, G.
    Charilas, D.
    Stavroulas, Y.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 109 : 657 - 667
  • [33] Towards Automated IoT Application Deployment by a Cloud-based Approach
    Li, Fei
    Voegler, Michael
    Claessens, Markus
    Dustdar, Schahram
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA), 2013, : 61 - 68
  • [34] Automatic deployment system of computer program application based on cloud computing
    Zhai, Hui
    Wang, Jia
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2021, 12 (04) : 731 - 740
  • [35] Formal Approach to Workflow Application Fragmentations Over Cloud Deployment Models
    Ahn, Hyun
    Kim, Kwanghoon Pio
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (03): : 3071 - 3088
  • [36] A study on virtual machine deployment for application outsourcing in mobile cloud computing
    Shiraz, Muhammad
    Abolfazli, Saeid
    Sanaei, Zohreh
    Gani, Abdullah
    JOURNAL OF SUPERCOMPUTING, 2013, 63 (03): : 946 - 964
  • [37] OptiSpot: minimizing application deployment cost using spot cloud resources
    Daniel J. Dubois
    Giuliano Casale
    Cluster Computing, 2016, 19 : 893 - 909
  • [38] Deployment of a Web Application for Fitting Experimental Data at the JINR Cloud Infrastructure
    Soloviev, A.
    Solovjeva, T.
    Lukyanov, K.
    PHYSICS OF PARTICLES AND NUCLEI, 2024, 55 (03) : 489 - 491
  • [39] Service Deployment in Cloud
    Haji, Amel
    Ben Letaifa, Asma
    Tabbane, Sami
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 268 - 271
  • [40] Industrial Deployment of Reactive Transport Simulation: An Application to Uranium In situ Recovery
    Lagneau, Vincent
    Regnault, Olivier
    Descostes, Michael
    REACTIVE TRANSPORT IN NATURAL AND ENGINEERED SYSTEMS, 2019, 85 : 499 - 528