Optimizing virtual machine allocation for parallel scientific workflows in federated clouds

被引:41
|
作者
Coutinho, Rafaelli de C. [1 ]
Drummond, Lucia M. A. [1 ]
Frota, Yuri [1 ]
de Oliveira, Daniel [1 ]
机构
[1] Univ Fed Fluminense, Inst Comp, Niteroi, RJ, Brazil
关键词
Cloud computing; Federated cloud; Parallel scientific workflow; Virtual machine allocation; MULTIPLE SEQUENCE ALIGNMENT; RESOURCE-ALLOCATION; PROVENANCE; MATRIX; ENGINE;
D O I
10.1016/j.future.2014.10.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing has established itself as an interesting computational model that provides a wide range of resources such as storage, databases and computing power for several types of users. Recently, the concept of cloud computing was extended with the concept of federated clouds where several resources from different cloud providers are inter-connected to perform a common action (e.g. execute a scientific workflow). Users can benefit from both single-provider and federated cloud environment to execute their scientific workflows since they can get the necessary amount of resources on demand. In several of these workflows, there is a demand for high performance and parallelism techniques since many activities are data and computing intensive and can execute for hours, days or even weeks. There are some Scientific Workflow Management Systems (SWfMS) that already provide parallelism capabilities for scientific workflows in single-provider cloud. Most of them rely on creating a virtual cluster to execute the workflow in parallel. However, they also rely on the user to estimate the amount of virtual machines to be allocated to create this virtual cluster. Most SWfMS use this initial virtual cluster configuration made by the user for the entire workflow execution. Dimensioning the virtual cluster to execute the workflow in parallel is then a top priority task since if the virtual cluster is under or over dimensioned it can impact on the workflow performance or increase (unnecessarily) financial costs. This dimensioning is far from trivial in a single-provider cloud and specially in federated clouds due to the huge number of virtual machine types to choose in each location and provider. In this article, we propose an approach named GraspCC-fed to produce the optimal (or near-optimal) estimation of the amount of virtual machines to allocate for each workflow. GraspCC-fed extends a previously proposed heuristic based on GRASP for executing standalone applications to consider scientific workflows executed in both single-provider and federated clouds. For the experiments, GraspCC-fed was coupled to an adapted version of SciCumulus workflow engine for federated clouds. This way, we believe that GraspCC-fed can be an important decision support tool for users and it can help determining an optimal configuration for the virtual cluster for parallel cloud-based scientific workflows. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:51 / 68
页数:18
相关论文
共 50 条
  • [1] Handling Failures in Parallel Scientific Workflows Using Clouds
    Costa, Flavio
    de Oliveira, Daniel
    Ocana, Kary
    Ogasawara, Eduardo
    Dias, Jonas
    Mattoso, Marta
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 129 - 139
  • [2] Computational resource and cost prediction service for scientific workflows in federated clouds
    Rosa, Michel J. F.
    Ralha, Celia Ghedini
    Holanda, Maristela
    Araujo, Aleteia P. F.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 844 - 858
  • [3] Survey of Virtual Machine Placement in Federated Clouds
    Gahlawat, Monica
    Sharma, Priyanka
    [J]. SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 735 - 738
  • [4] Optimizing Resource Allocation for Scientific Workflows Using Advance Reservations
    Langguth, Christoph
    Schuldt, Heiko
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2010, 6187 : 434 - 451
  • [5] Networking Introspection and Analysis for Virtual Machine Migration in Federated Clouds
    Andronico, Giuseppe
    Bua, Filippo
    Fargetta, Marco
    Giorgio, Emidio
    Guglielmo, Alessio
    Monforte, Salvatore
    Paone, Maurizio
    Villari, Massimo
    [J]. ADVANCES IN SERVICE-ORIENTED AND CLOUD COMPUTING (ESOCC 2015), 2016, 567 : 353 - 362
  • [6] A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
    Daniel de Oliveira
    Kary A. C. S. Ocaña
    Fernanda Baião
    Marta Mattoso
    [J]. Journal of Grid Computing, 2012, 10 : 521 - 552
  • [7] A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
    de Oliveira, Daniel
    Ocana, Kary A. C. S.
    Baiao, Fernanda
    Mattoso, Marta
    [J]. JOURNAL OF GRID COMPUTING, 2012, 10 (03) : 521 - 552
  • [8] Virtual Machine Placement Optimization in SDN-Aware Federated Clouds
    Somasundaram, Thamarai Selvi
    Govindarajan, Kannan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2015, : 379 - 385
  • [9] Power efficient virtual machine migration in a scientific federated cloud
    Amol Jaikar
    Dada Huang
    Gyeong-Ryoon Kim
    Seo-Young Noh
    [J]. Cluster Computing, 2015, 18 : 609 - 618
  • [10] Power efficient virtual machine migration in a scientific federated cloud
    Jaikar, Amol
    Huang, Dada
    Kim, Gyeong-Ryoon
    Noh, Seo-Young
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 609 - 618