Reusability First: Toward FAIR Workflows

被引:9
|
作者
Wolf, Matthew [1 ]
Logan, Jeremy [1 ]
Mehta, Kshitij [1 ]
Jacobson, Daniel [1 ,2 ]
Cashman, Mikaela [1 ]
Walker, Angelica M. [2 ]
Eisenhauer, Greg [4 ]
Widener, Patrick [3 ]
Cliff, Ashley [2 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37830 USA
[2] Univ Tennessee, Bredesen Ctr Interdisciplinary Res & Grad Educ, Knoxville, TN USA
[3] Sandia Natl Labs, POB 5800, Albuquerque, NM 87185 USA
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
Workflows; FAIR; Reusability; Distributed Information systems; Middleware; SCIENCE;
D O I
10.1109/Cluster48925.2021.00053
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The FAIR principles of open science (Findable, Accessible, Interoperable, and Reusable) have had transformative effects on modern large-scale computational science. In particular, they have encouraged more open access to and use of data, an important consideration as collaboration among teams of researchers accelerates and the use of workflows by those teams to solve problems increases. How best to apply the FAIR principles to workflows themselves, and software more generally, is not yet well understood. We argue that the software engineering concept of technical debt management provides a useful guide for application of those principles to workflows, and in particular that it implies reusability should be considered as 'first among equals'. Moreover, our approach recognizes a continuum of reusability where we can make explicit and selectable the trade-offs required in workflows for both their users and developers. To this end, we propose a new abstraction approach for reusable workflows, with demonstrations for both synthetic workloads and real-world computational biology workflows. Through application of novel systems and tools that are based on this abstraction, these experimental workflows are refactored to right-size the granularity of workflow components to efficiently fill the gap between end-user simplicity and general customizability. Our work makes it easier to selectively reason about and automate the connections between trade-offs across user and developer concerns when exposing degrees of freedom for reuse. Additionally, by exposing fine-grained reusability abstractions we enable performance optimizations, as we demonstrate on both institutional-scale and leadership-class HPC resources.
引用
收藏
页码:444 / 455
页数:12
相关论文
共 50 条
  • [1] Reusability and modularity in transactional workflows
    Puustjarvi, J
    Tirri, H
    Veijalainen, J
    [J]. INFORMATION SYSTEMS, 1997, 22 (2-3) : 101 - 120
  • [2] Dealing with Reusability and Reproducibility for Scientific Workflows
    Lifschitz, Sergio
    Gomes, Luciana
    Rehen, Stevens K.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 625 - 632
  • [3] FAIR Computational Workflows
    Goble, Carole
    Cohen-Boulakia, Sarah
    Soiland-Reyes, Stian
    Garijo, Daniel
    Gil, Yolanda
    Crusoe, Michael R.
    Peters, Kristian
    Schober, Daniel
    [J]. DATA INTELLIGENCE, 2020, 2 (1-2) : 108 - 121
  • [4] The First Step Toward Reducing Documentation Defining Ideal Workflows
    Mosier, Sammie
    Englebright, Jane
    [J]. CIN-COMPUTERS INFORMATICS NURSING, 2019, 37 (02) : 57 - 59
  • [5] Progress toward reusability
    Dissel, Adam
    Hellman, Barry
    Poulos, Dennis
    [J]. AEROSPACE AMERICA, 2013, 51 (11) : 47 - 47
  • [6] Reusability Challenges of Scientific Workflows: A Case Study for Galaxy
    Alam, Khairul
    Roy, Banani
    Serebrenik, Alexander
    [J]. PROCEEDINGS OF THE 2023 30TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC 2023, 2023, : 289 - 298
  • [7] Canonical Workflows to Make Data FAIR
    Wittenburg, Peter
    Hardisty, Alex
    Le Franc, Yann
    Mozaffari, Amirpasha
    Peer, Limor
    Skvortsov, Nikolay A.
    Zhao, Zhiming
    Spinuso, Alessandro
    [J]. DATA INTELLIGENCE, 2022, 4 (02) : 286 - 305
  • [8] Canonical Workflows to Make Data FAIR
    Peter Wittenburg
    Alex Hardisty
    Yann Le Franc
    Amirpasha Mozaffari
    Limor Peer
    Nikolay A.Skvortsov
    Zhiming Zhao
    Alessandro Spinuso
    [J]. Data Intelligence, 2022, 4 (02) : 286 - 305
  • [9] Semantic workflows for benchmark challenges: Enhancing comparability, reusability and reproducibility
    Srivastava, Arunima
    Adusumilli, Ravali
    Boyce, Hunter
    Garijo, Daniel
    Ratnakar, Varun
    Mayani, Rajiv
    Yu, Thomas
    Machiraju, Raghu
    Gil, Yolanda
    Mallick, Parag
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019, 2019, : 208 - 219
  • [10] F*** workflows: when parts of FAIR are missing
    Wilkinson, Sean R.
    Eisenhauer, Greg
    Kapadia, Anuj J.
    Knight, Kathryn
    Logan, Jeremy
    Widener, Patrick
    Wolf, Matthew
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022), 2022, : 507 - 512