WaaS: Workflow-as-a-Service for the Cloud with Scheduling of Continuous and Data-Intensive Workflows

被引:13
|
作者
Esteves, Sergio [1 ]
Veiga, Luis [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID Lisboa, P-1699 Lisbon, Portugal
来源
COMPUTER JOURNAL | 2016年 / 59卷 / 03期
关键词
data processing workflow; data-intensive; scheduling; continuous processing; cloud computing; quality-of-service; SCIENTIFIC WORKFLOWS; ALGORITHM;
D O I
10.1093/comjnl/bxu158
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data-intensive and long-lasting applications running in the form of workflows are being increasingly dispatched to cloud computing systems. Current scheduling approaches for graphs of dependencies fail to deliver high resource efficiency while keeping computation costs low, especially for continuous data processing workflows, where the scheduler does not perform any reasoning about the impact new input data may have in the workflow final output. To face such a challenge, we introduce a new scheduling criterion, Quality-of-Data (QoD), which describes the requirements about the data that are worthy of the triggering of tasks in workflows. Based on the QoD notion, we propose a novel service-oriented scheduler planner, for continuous data processing workflows, that is capable of enforcing QoD constraints and guide the scheduling to attain resource efficiency, overall controlled performance and task prioritization. To contrast the advantages of our scheduling model against others, we developed WaaS (Workflow-as-a-Service), a workflow coordinator system for the Cloud where data is shared among tasks via cloud columnar database.
引用
收藏
页码:371 / 383
页数:13
相关论文
共 50 条
  • [21] Optimizing Distributed Data-Intensive Workflows
    Friese, Ryan D.
    Tallent, Nathan R.
    Schram, Malachi
    Halappanavar, Mahantesh
    Barker, Kevin J.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 279 - 289
  • [22] PFPMine: A parallel approach for discovering interacting data entities in data-intensive cloud workflows
    Huang, Yuze
    Huang, Jiwei
    Liu, Cong
    Zhang, Chengning
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 113 : 474 - 487
  • [23] Scheduling Technique of Data Intensive Application Workflows in Cloud Computing
    Lakhani, Jignesh
    Bheda, Hitesh
    [J]. 3RD NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2012), 2012,
  • [24] An Energy-Aware Heuristic Scheduling for Data-Intensive Workflows in Virtualized Datacenters
    Xiao, Peng
    Hu, Zhi-Gang
    Zhang, Yan-Ping
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2013, 28 (06) : 948 - 961
  • [25] An Energy-Aware Heuristic Scheduling for Data-Intensive Workflows in Virtualized Datacenters
    Peng Xiao
    Zhi-Gang Hu
    Yan-Ping Zhang
    [J]. Journal of Computer Science and Technology, 2013, 28 : 948 - 961
  • [26] Data-intensive application scheduling on Mobile Edge Cloud Computing
    Alkhalaileh, Mohammad
    Calheiros, Rodrigo N.
    Quang Vinh Nguyen
    Javadi, Bahman
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2020, 167
  • [27] An Energy-Aware Heuristic Scheduling for Data-Intensive Workflows in Virtualized Datacenters
    肖鹏
    胡志刚
    张艳平
    [J]. Journal of Computer Science & Technology, 2013, 28 (06) : 948 - 961
  • [28] Scheduling Method of Data-Intensive Applications in Cloud Computing Environments
    Fu, Xiong
    Cang, Yeliang
    Zhu, Xinxin
    Deng, Song
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [29] Dynamic Resource Provisioning With Fault Tolerance for Data-Intensive Meteorological Workflows in Cloud
    Xu, Xiaolong
    Mo, Ruichao
    Dai, Fei
    Lin, Wenmin
    Wan, Shaohua
    Dou, Wanchun
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (09) : 6172 - 6181
  • [30] Data-intensive workflow management: For clouds and data-intensive and scalable computing environments
    De Oliveira, Daniel C.M.
    Liu, Ji
    Pacitti, Esther
    [J]. Synthesis Lectures on Data Management, 2019, 14 (04): : 1 - 179