Facilitating Asynchronous Collaboration in Scientific Workflow Composition Using Provenance

被引:1
|
作者
Abediniala M. [1 ]
Roy B. [1 ]
机构
[1] University of Saskatchewan, Saskatoon
来源
关键词
asynchronous collaboration; group awareness; groupware; provenance; SWfMS; version control system; workflow;
D O I
10.1145/3534520
中图分类号
学科分类号
摘要
Advances in scientific domains are led to an increase in the complexity of the experiments. To address this growing complexity, scientists from different domains require to work collaboratively. Scientific Workflow Management Systems (SWfMSs) are popular tools for data-intensive experiments. To the best of our knowledge, very few of the existing SWfMSs support collaboration, and it is not efficient in many cases. Researchers share a single version of the workflow in existing collaborative data analysis systems, which increases the chance of interference as the number of collaborators grows. Moreover, for effective collaboration, contributors require a clear view of the project's status, the information that existing SWfMSs do not provide. Another significant problem is most scientists are not capable of adding collaborative tools to existing SWfMSs, and they need software engineers to take on this responsibility. Even for software engineers such tasks could be challenging and time consuming. In this paper, we attempted to address this crucial issue in scientific workflow composition and doing so in a collaborative setting. Hence, we propose a tool to facilitate collaborative workflow composition. This tool provides branching and versioning, which are standard version control system features to allow multiple researchers to contribute to the project asynchronously. We also suggest some visualizations and a variety of reports to increase group awareness and help the scientists to realize the project's status and issues. As a proof of concept, we developed an API to capture the provenance data and provide collaborative tools. This API is developed as an example for software engineers to help them understand how to integrate collaborative tools into any SWfMS. We collect provenance information during workflow composition and then employ it to track workflow versions using the proposed collaborative tool. Prior to implementing the visualizations, we surveyed to discover how much the proposed visualizations could contribute to group awareness. Moreover, in the survey we investigated to what extent the proposed version control system could help address shortcomings in collaborative experiments. The survey participants provided us with valuable feedback. In future, we will use the survey responses to enhance the proposed version control system and visualizations. © 2022 ACM.
引用
收藏
相关论文
共 50 条
  • [1] A survey of provenance in scientific workflow
    Lin, Songhai
    Xiao, Hong
    Jiang, Wenchao
    Li, Dafeng
    Liang, Jiaben
    Li, Zelin
    JOURNAL OF HIGH SPEED NETWORKS, 2023, 29 (02) : 129 - 145
  • [2] Storing and querying scientific workflow provenance metadata using an RDBMS
    Chebotko, Artem
    Fei, Xubo
    Lin, Cui
    Lu, Shiyong
    Fotouhi, Farshad
    E-SCIENCE 2007: THIRD IEEE INTERNATIONAL CONFERENCE ON E-SCIENCE AND GRID COMPUTING, PROCEEDINGS, 2007, : 611 - 618
  • [3] LabelFlow: Exploiting Workflow Provenance to Surface Scientific Data Provenance
    Alper, Pinar
    Belhajjame, Khalid
    Goble, Carole A.
    Karagoz, Pinar
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES (IPAW 2014), 2015, 8628 : 84 - 96
  • [4] Provenance Browser: Displaying and Querying Scientific Workflow Provenance Graphs
    Anand, Manish Kumar
    Bowers, Shawn
    Ludaescher, Bertram
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1201 - 1204
  • [5] Revision Provenance in Text Documents of Asynchronous Collaboration
    Zhang, Jing
    Jagadish, H. V.
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 889 - 900
  • [6] Provenance-based Scientific Workflow Search
    Abu Jabal, Amani
    Bertino, Elisa
    de Mel, Geeth
    2017 IEEE 13TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2017, : 119 - 127
  • [7] Workflow provenance in the lifecycle of scientific machine learning
    Souza, Renan
    Azevedo, Leonardo G.
    Lourenco, Vitor
    Soares, Elton
    Thiago, Raphael
    Brandao, Rafael
    Civitarese, Daniel
    Brazil, Emilio Vital
    Moreno, Marcio
    Valduriez, Patrick
    Mattoso, Marta
    Cerqueira, Renato
    Netto, Marco A. S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (14):
  • [8] Challenges of Provenance in Scientific Workflow Management Systems
    Alam, Khairul
    Roy, Banani
    2022 IEEE/ACM WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, WORKS, 2022, : 10 - 18
  • [9] Mechanisms for provenance collection in scientific workflow systems
    Mehdi Sarikhani
    Andrew Wendelborn
    Computing, 2018, 100 : 439 - 472
  • [10] Mechanisms for provenance collection in scientific workflow systems
    Sarikhani, Mehdi
    Wendelborn, Andrew
    COMPUTING, 2018, 100 (05) : 439 - 472