Cloud application deployment with transient failure recovery

被引:0
|
作者
Ioannis Giannakopoulos
Ioannis Konstantinou
Dimitrios Tsoumakos
Nectarios Koziris
机构
[1] Computing Systems Laboratory,
[2] School of ECE,undefined
[3] National Technical University of Athens,undefined
[4] Department of Informatics,undefined
[5] Ionian University,undefined
关键词
Cloud application deployment; Resource configuration; Transient failure; Error-recovery; Filesystem snapshot;
D O I
暂无
中图分类号
学科分类号
摘要
Application deployment is a crucial operation for modern cloud providers. The ability to dynamically allocate resources and deploy a new application instance based on a user-provided description in a fully automated manner is of great importance for the cloud users as it facilitates the generation of fully reproducible application environments with minimum effort. However, most modern deployment solutions do not consider the error-prone nature of the cloud: Network glitches, bad synchronization between different services and other software or infrastructure related failures with transient characteristics are frequently encountered. Even if these failures may be tolerable during an application’s lifetime, during the deployment phase they can cause severe errors and lead it to failure. In order to tackle this challenge, in this work we propose AURA, an open source system that enables cloud application deployment with transient failure recovery capabilities. AURA formulates the application deployment as a Directed Acyclic Graph. Whenever a transient failure occurs, it traverses the graph, identifies the parts of it that failed and re-executes the respective scripts, based on the fact that when the transient failure disappears the script execution will succeed. Moreover, in order to guarantee that each script execution is idempotent, AURA adopts a lightweight filesystem snapshot mechanism that aims at canceling the side effects of the failed scripts. Our thorough evaluation indicated that AURA is capable of deploying diverse real-world applications to environments exhibiting high error probabilities, introducing a minimal time overhead, proportional to the failure probability of the deployment scripts.
引用
收藏
相关论文
共 50 条
  • [1] Cloud application deployment with transient failure recovery
    Giannakopoulos, Ioannis
    Konstantinou, Ioannis
    Tsoumakos, Dimitrios
    Koziris, Nectarios
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2018, 7
  • [2] Application Deployment and Management in the Cloud
    Miglierina, Marco
    16TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2014), 2014, : 422 - 428
  • [3] Cloud Standby Deployment: A Model-Driven Deployment Method for Disaster Recovery in the Cloud
    Lenk, Alexander
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 933 - 940
  • [4] SCALABLE CLOUD APPLICATION DEPLOYMENT SERVICE FOR VERSATILE CLOUD SERVICE DEPLOYMENT AND CONFIGURATION
    Habala, Ondrej
    Seleng, Martin
    Habala, Michal
    Stuhl, Lubor
    Stano, Michal
    Hluchy, Ladislav
    COMPUTING AND INFORMATICS, 2024, 43 (06) : 1416 - 1431
  • [5] Towards Reliable Application Deployment in the Cloud
    Chen, Ruichuan
    Akkus, Istemi Ekin
    Viswanath, Bimal
    Rimac, Ivica
    Hilt, Volker
    CONEXT'17: PROCEEDINGS OF THE 2017 THE 13TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, 2017, : 464 - 477
  • [6] Quality function deployment in failure recovery and prevention
    Chen, Hsi-Tien
    SERVICE INDUSTRIES JOURNAL, 2016, 36 (13-14): : 615 - 637
  • [7] A System for Application Deployment Automation on Cloud Environment
    Deshmukh, Sagar Narendrasing
    Khandagale, H. P.
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [8] An Application Deployment Approach based on Hybrid Cloud
    Huang, Fengtao
    Li, Hao
    Yuan, Zhihao
    Li, Xian
    2017 IEEE 3RD INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY, IEEE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 2ND IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2017, : 74 - 79
  • [9] AutoCADep: An Approach for Automatic Cloud Application Deployment
    Hiba, Saddam Hocine
    Belguidoum, Meriem
    SERVICE-ORIENTED COMPUTING, ICSOC 2019, 2020, 12019 : 82 - 94
  • [10] On the modeling of optimal and automatized cloud application deployment
    de Gouw, Stijn
    Mauro, Jacopo
    Zavattaro, Gianluigi
    JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING, 2019, 107 : 108 - 135