Reusability First: Toward FAIR Workflows

被引:9
|
作者
Wolf, Matthew [1 ]
Logan, Jeremy [1 ]
Mehta, Kshitij [1 ]
Jacobson, Daniel [1 ,2 ]
Cashman, Mikaela [1 ]
Walker, Angelica M. [2 ]
Eisenhauer, Greg [4 ]
Widener, Patrick [3 ]
Cliff, Ashley [2 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37830 USA
[2] Univ Tennessee, Bredesen Ctr Interdisciplinary Res & Grad Educ, Knoxville, TN USA
[3] Sandia Natl Labs, POB 5800, Albuquerque, NM 87185 USA
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
Workflows; FAIR; Reusability; Distributed Information systems; Middleware; SCIENCE;
D O I
10.1109/Cluster48925.2021.00053
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The FAIR principles of open science (Findable, Accessible, Interoperable, and Reusable) have had transformative effects on modern large-scale computational science. In particular, they have encouraged more open access to and use of data, an important consideration as collaboration among teams of researchers accelerates and the use of workflows by those teams to solve problems increases. How best to apply the FAIR principles to workflows themselves, and software more generally, is not yet well understood. We argue that the software engineering concept of technical debt management provides a useful guide for application of those principles to workflows, and in particular that it implies reusability should be considered as 'first among equals'. Moreover, our approach recognizes a continuum of reusability where we can make explicit and selectable the trade-offs required in workflows for both their users and developers. To this end, we propose a new abstraction approach for reusable workflows, with demonstrations for both synthetic workloads and real-world computational biology workflows. Through application of novel systems and tools that are based on this abstraction, these experimental workflows are refactored to right-size the granularity of workflow components to efficiently fill the gap between end-user simplicity and general customizability. Our work makes it easier to selectively reason about and automate the connections between trade-offs across user and developer concerns when exposing degrees of freedom for reuse. Additionally, by exposing fine-grained reusability abstractions we enable performance optimizations, as we demonstrate on both institutional-scale and leadership-class HPC resources.
引用
收藏
页码:444 / 455
页数:12
相关论文
共 50 条
  • [21] Fair-Share Methods for Scheduling Scientific Workflows in Cloud
    Aldabaybah, Balqees
    Alrawashdeh, Tawfiq
    Butt, Talal Ashraf
    Almiani, Khaled
    [J]. 2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [22] Automatic, Efficient and Scalable Provenance Registration for FAIR HPC Workflows
    Sirvent, Raul
    Conejero, Javier
    Lordan, Francesc
    Ejarque, Jorge
    Rodriguez-Navas, Laura
    Fernandez, Jose M.
    Capella-Gutierrez, Salvador
    Badia, Rosa M.
    [J]. 2022 IEEE/ACM WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, WORKS, 2022, : 1 - 9
  • [23] FAIR Skin Lesion Classification Workflows using Transfer Learning
    Walshe, David
    O'Reilly, Ruairi
    [J]. 2022 33RD IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2022,
  • [24] Sim2Ls: FAIR simulation workflows and data
    Hunt, Martin
    Clark, Steven
    Mejia, Daniel
    Desai, Saaketh
    Strachan, Alejandro
    [J]. PLOS ONE, 2022, 17 (03):
  • [25] User-friendly Composition of FAIR Workflows in a Notebook Environment
    Richardson, Robin A.
    Celebi, Remzi
    van der Burg, Sven
    Smits, Djura
    Ridder, Lars
    Dumontier, Michel
    Kuhn, Tobias
    [J]. PROCEEDINGS OF THE 11TH KNOWLEDGE CAPTURE CONFERENCE (K-CAP '21), 2021, : 1 - 8
  • [26] Ubermag: Toward More Effective Micromagnetic Workflows
    Beg, Marijan
    Lang, Martin
    Fangohr, Hans
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 2022, 58 (02)
  • [27] TOWARD A SEMANTIC MANAGEMENT OF GEOLOGICAL MODELING WORKFLOWS
    Belaid, Nabil
    Ait-Ameur, Yamine
    Jean, Stephane
    Rainaud, Jean-Francois
    [J]. KEOD 2010: Proceedings of the International Conference on Knowledge Engineering and Ontology Development, 2010, : 282 - 287
  • [28] INFLATION - TOWARD A FAIR PROGRAM
    不详
    [J]. AFL-CIO AMERICAN FEDERATIONIST, 1978, 85 (11): : 1 - 3
  • [29] adCFS: adaptive Completely Fair Scheduling Policy for Containerised Workflows Systems
    Alzahrani, Eidah J.
    Tari, Zahir
    Lee, Young Choon
    Alsadie, Deafallah
    Zomaya, Albert Y.
    [J]. 2017 IEEE 16TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2017, : 245 - 252
  • [30] Implementation of FAIR Practices in Computational Metabolomics Workflows-A Case Study
    Zulfiqar, Mahnoor
    Crusoe, Michael R.
    Koenig-Ries, Birgitta
    Steinbeck, Christoph
    Peters, Kristian
    Gadelha, Luiz
    [J]. METABOLITES, 2024, 14 (02)