Distributed Scientific Workflow Management for Data-Intensive Applications

被引:4
|
作者
Shumilov, S. [1 ]
Leng, Y. [1 ]
El-Gayyar, M. [1 ]
Cremers, A. B. [1 ]
机构
[1] Univ Bonn, Comp Sci Dept 3, D-5300 Bonn, Germany
关键词
D O I
10.1109/FTDCS.2008.39
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Grids and service-oriented technologies are emerging as dominant approaches fir distributed systems. Rising complexity of grid applications places new requirements and increases needs to improve the design and reusability of grid workflow systems. In particular, scientific data-intensive workflows require new approaches for integration and coordination of distributed resources. Traditional centralized approaches for workflow execution can be quite efficient for such workflows. In order to clarify these inefficient, the paper surveys existing workflow management systems evaluating them practically on some use cases for management of natural resources from the multidisciplinary research project GLOWA Volta. Subsequently, the most important obstacles are identified and a new approach facilitating semantic oriented composition, reuse and distributed execution of workflows is proposed.
引用
收藏
页码:65 / 73
页数:9
相关论文
共 50 条
  • [1] Integrating Policy with Scientific Workflow Management for Data-Intensive Applications
    Chervenak, Ann L.
    Smith, David E.
    Chen, Weiwei
    Deelman, Ewa
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 140 - 149
  • [2] A Survey of Data-Intensive Scientific Workflow Management
    Liu, Ji
    Pacitti, Esther
    Valduriez, Patrick
    Mattoso, Marta
    [J]. JOURNAL OF GRID COMPUTING, 2015, 13 (04) : 457 - 493
  • [3] A Survey of Data-Intensive Scientific Workflow Management
    Ji Liu
    Esther Pacitti
    Patrick Valduriez
    Marta Mattoso
    [J]. Journal of Grid Computing, 2015, 13 : 457 - 493
  • [4] Data-intensive workflow management: For clouds and data-intensive and scalable computing environments
    De Oliveira, Daniel C.M.
    Liu, Ji
    Pacitti, Esther
    [J]. Synthesis Lectures on Data Management, 2019, 14 (04): : 1 - 179
  • [5] Data-intensive Workflow Execution using Distributed Compute Resources
    Pandey, Ashish
    Wang, Songjie
    Calyam, Prasad
    [J]. 2019 IEEE 27TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP), 2019,
  • [6] Data-Intensive Scalable Computing for Scientific Applications
    Bryant, Randal E.
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2011, 13 (06) : 25 - 33
  • [7] Citus: Distributed PostgreSQL for Data-Intensive Applications
    Cubukcu, Umur
    Erdogan, Ozgun
    Pathak, Sumedh
    Sannakkayala, Sudhakar
    Slot, Marco
    [J]. SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2490 - 2502
  • [8] Understanding performance of distributed data-intensive applications
    Miceli, Christopher
    Miceli, Michael
    Rodriguez-Milla, Bety
    Jha, Shantenu
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2010, 368 (1926): : 4089 - 4102
  • [9] Data Management Challenges of Data-Intensive Scientific Workflows
    Deelman, Ewa
    Chervenak, Ann
    [J]. CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 687 - 692
  • [10] CoLoc: Distributed Data and Container Colocation for Data-Intensive Applications
    Renner, Thomas
    Thamsen, Lauritz
    Kao, Odej
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3008 - 3015