WaaS: Workflow-as-a-Service for the Cloud with Scheduling of Continuous and Data-Intensive Workflows

被引:13
|
作者
Esteves, Sergio [1 ]
Veiga, Luis [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID Lisboa, P-1699 Lisbon, Portugal
来源
COMPUTER JOURNAL | 2016年 / 59卷 / 03期
关键词
data processing workflow; data-intensive; scheduling; continuous processing; cloud computing; quality-of-service; SCIENTIFIC WORKFLOWS; ALGORITHM;
D O I
10.1093/comjnl/bxu158
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data-intensive and long-lasting applications running in the form of workflows are being increasingly dispatched to cloud computing systems. Current scheduling approaches for graphs of dependencies fail to deliver high resource efficiency while keeping computation costs low, especially for continuous data processing workflows, where the scheduler does not perform any reasoning about the impact new input data may have in the workflow final output. To face such a challenge, we introduce a new scheduling criterion, Quality-of-Data (QoD), which describes the requirements about the data that are worthy of the triggering of tasks in workflows. Based on the QoD notion, we propose a novel service-oriented scheduler planner, for continuous data processing workflows, that is capable of enforcing QoD constraints and guide the scheduling to attain resource efficiency, overall controlled performance and task prioritization. To contrast the advantages of our scheduling model against others, we developed WaaS (Workflow-as-a-Service), a workflow coordinator system for the Cloud where data is shared among tasks via cloud columnar database.
引用
收藏
页码:371 / 383
页数:13
相关论文
共 50 条
  • [1] Data-Intensive Workflow Scheduling in Cloud on Budget and Deadline Constraints
    Xin, Zhang
    Wu, Changze
    Wu, Kaigui
    [J]. COLLABORATE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2016, 2017, 201 : 262 - 272
  • [2] Running Data-Intensive Scientific Workflows in the Cloud
    Sato, Chiaki
    Leslie, Luke M.
    Lee, Young Choon
    Zomaya, Albert Y.
    Ranjan, Rajiv
    [J]. 2014 15TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2014), 2014, : 180 - 185
  • [3] An incremental reinforcement learning scheduling strategy for data-intensive scientific workflows in the cloud
    Nascimento, Andre
    Silva, Vitor
    Paes, Aline
    de Oliveira, Daniel
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (11):
  • [4] Scheduling Data-Intensive Scientific Workflows with Reduced Communication
    Pietri, Ilia
    Sakellariou, Rizos
    [J]. 30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [5] Adaptive Caching for Data-Intensive Scientific Workflows in the Cloud
    Heidsieck, Gaetan
    de Oliveira, Daniel
    Pacitti, Esther
    Pradal, Christophe
    Tardieu, Francois
    Valduriez, Patrick
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, 2019, 11707 : 452 - 466
  • [6] A Data Placement Strategy for Data-Intensive Scientific Workflows in Cloud
    Zhao, Qing
    Xiong, Congcong
    Zhao, Xi
    Yu, Ce
    Xiao, Jian
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 928 - 934
  • [7] DynaSched: a dynamic Web service scheduling and deployment framework for data-intensive Grid workflows
    Shahand, Shayan
    Turner, Stephen J.
    Cai, Wentong
    Khademi H, Maryam
    [J]. ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 593 - 602
  • [8] A Data-Intensive Workflow Scheduling Algorithm for Grid Computing
    Xu, Meng
    Cui, Lizhen
    Wang, Haiyang
    Bi, Yanbing
    Bian, Ji
    [J]. FOURTH CHINAGRID ANNUAL CONFERENCE, PROCEEDINGS, 2009, : 110 - 115
  • [9] Science in the Cloud: Allocation and Execution of Data-Intensive Scientific Workflows
    Claudia Szabo
    Quan Z. Sheng
    Trent Kroeger
    Yihong Zhang
    Jian Yu
    [J]. Journal of Grid Computing, 2014, 12 : 245 - 264
  • [10] Dynamic Task Allocation for Data-Intensive Workflows in Cloud Environment
    Liu, Xiping
    Zheng, Liyang
    Chen Junyu
    Lei Shang
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2018, 2019, 11434 : 269 - 280