Optimizing virtual machine allocation for parallel scientific workflows in federated clouds

被引：41

作者：

Coutinho, Rafaelli de C. ^{[1
]}

Drummond, Lucia M. A. ^{[1
]}

Frota, Yuri ^{[1
]}

de Oliveira, Daniel ^{[1
]}

机构：

[1] Univ Fed Fluminense, Inst Comp, Niteroi, RJ, Brazil

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2015年 / 46卷

关键词：

Cloud computing; Federated cloud; Parallel scientific workflow; Virtual machine allocation; MULTIPLE SEQUENCE ALIGNMENT; RESOURCE-ALLOCATION; PROVENANCE; MATRIX; ENGINE;

D O I：

10.1016/j.future.2014.10.009

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Cloud computing has established itself as an interesting computational model that provides a wide range of resources such as storage, databases and computing power for several types of users. Recently, the concept of cloud computing was extended with the concept of federated clouds where several resources from different cloud providers are inter-connected to perform a common action (e.g. execute a scientific workflow). Users can benefit from both single-provider and federated cloud environment to execute their scientific workflows since they can get the necessary amount of resources on demand. In several of these workflows, there is a demand for high performance and parallelism techniques since many activities are data and computing intensive and can execute for hours, days or even weeks. There are some Scientific Workflow Management Systems (SWfMS) that already provide parallelism capabilities for scientific workflows in single-provider cloud. Most of them rely on creating a virtual cluster to execute the workflow in parallel. However, they also rely on the user to estimate the amount of virtual machines to be allocated to create this virtual cluster. Most SWfMS use this initial virtual cluster configuration made by the user for the entire workflow execution. Dimensioning the virtual cluster to execute the workflow in parallel is then a top priority task since if the virtual cluster is under or over dimensioned it can impact on the workflow performance or increase (unnecessarily) financial costs. This dimensioning is far from trivial in a single-provider cloud and specially in federated clouds due to the huge number of virtual machine types to choose in each location and provider. In this article, we propose an approach named GraspCC-fed to produce the optimal (or near-optimal) estimation of the amount of virtual machines to allocate for each workflow. GraspCC-fed extends a previously proposed heuristic based on GRASP for executing standalone applications to consider scientific workflows executed in both single-provider and federated clouds. For the experiments, GraspCC-fed was coupled to an adapted version of SciCumulus workflow engine for federated clouds. This way, we believe that GraspCC-fed can be an important decision support tool for users and it can help determining an optimal configuration for the virtual cluster for parallel cloud-based scientific workflows. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：51 / 68

页数：18

共 50 条

[1] Handling Failures in Parallel Scientific Workflows Using Clouds
Costa, Flavio
de Oliveira, Daniel
Ocana, Kary
Ogasawara, Eduardo
Dias, Jonas
Mattoso, Marta
[J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 129 - 139
[2] Computational resource and cost prediction service for scientific workflows in federated clouds
Rosa, Michel J. F.
Ralha, Celia Ghedini
Holanda, Maristela
Araujo, Aleteia P. F.
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 844 - 858
[3] Survey of Virtual Machine Placement in Federated Clouds
Gahlawat, Monica
Sharma, Priyanka
[J]. SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 735 - 738
[4] Optimizing Resource Allocation for Scientific Workflows Using Advance Reservations
Langguth, Christoph
Schuldt, Heiko
[J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2010, 6187 : 434 - 451
[5] Networking Introspection and Analysis for Virtual Machine Migration in Federated Clouds
Andronico, Giuseppe
Bua, Filippo
Fargetta, Marco
Giorgio, Emidio
Guglielmo, Alessio
Monforte, Salvatore
Paone, Maurizio
Villari, Massimo
[J]. ADVANCES IN SERVICE-ORIENTED AND CLOUD COMPUTING (ESOCC 2015), 2016, 567 : 353 - 362
[6] A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Daniel de Oliveira
Kary A. C. S. Ocaña
Fernanda Baião
Marta Mattoso
[J]. Journal of Grid Computing, 2012, 10 : 521 - 552
[7] A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
de Oliveira, Daniel
Ocana, Kary A. C. S.
Baiao, Fernanda
Mattoso, Marta
[J]. JOURNAL OF GRID COMPUTING, 2012, 10 (03) : 521 - 552
[8] Virtual Machine Placement Optimization in SDN-Aware Federated Clouds
Somasundaram, Thamarai Selvi
Govindarajan, Kannan
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2015, : 379 - 385
[9] Power efficient virtual machine migration in a scientific federated cloud
Amol Jaikar
Dada Huang
Gyeong-Ryoon Kim
Seo-Young Noh
[J]. Cluster Computing, 2015, 18 : 609 - 618
[10] Power efficient virtual machine migration in a scientific federated cloud
Jaikar, Amol
Huang, Dada
Kim, Gyeong-Ryoon
Noh, Seo-Young
[J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 609 - 618

← 1 2 3 4 5 →