Facilitating Asynchronous Collaboration in Scientific Workflow Composition Using Provenance

被引:1
|
作者
Abediniala M. [1 ]
Roy B. [1 ]
机构
[1] University of Saskatchewan, Saskatoon
来源
关键词
asynchronous collaboration; group awareness; groupware; provenance; SWfMS; version control system; workflow;
D O I
10.1145/3534520
中图分类号
学科分类号
摘要
Advances in scientific domains are led to an increase in the complexity of the experiments. To address this growing complexity, scientists from different domains require to work collaboratively. Scientific Workflow Management Systems (SWfMSs) are popular tools for data-intensive experiments. To the best of our knowledge, very few of the existing SWfMSs support collaboration, and it is not efficient in many cases. Researchers share a single version of the workflow in existing collaborative data analysis systems, which increases the chance of interference as the number of collaborators grows. Moreover, for effective collaboration, contributors require a clear view of the project's status, the information that existing SWfMSs do not provide. Another significant problem is most scientists are not capable of adding collaborative tools to existing SWfMSs, and they need software engineers to take on this responsibility. Even for software engineers such tasks could be challenging and time consuming. In this paper, we attempted to address this crucial issue in scientific workflow composition and doing so in a collaborative setting. Hence, we propose a tool to facilitate collaborative workflow composition. This tool provides branching and versioning, which are standard version control system features to allow multiple researchers to contribute to the project asynchronously. We also suggest some visualizations and a variety of reports to increase group awareness and help the scientists to realize the project's status and issues. As a proof of concept, we developed an API to capture the provenance data and provide collaborative tools. This API is developed as an example for software engineers to help them understand how to integrate collaborative tools into any SWfMS. We collect provenance information during workflow composition and then employ it to track workflow versions using the proposed collaborative tool. Prior to implementing the visualizations, we surveyed to discover how much the proposed visualizations could contribute to group awareness. Moreover, in the survey we investigated to what extent the proposed version control system could help address shortcomings in collaborative experiments. The survey participants provided us with valuable feedback. In future, we will use the survey responses to enhance the proposed version control system and visualizations. © 2022 ACM.
引用
收藏
相关论文
共 50 条
  • [31] Storing, reasoning, and querying OPM-compliant scientific workflow provenance using relational databases
    Lim, Chunhyeok
    Lu, Shiyong
    Chebotko, Artem
    Fotouhi, Farshad
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (06): : 781 - 789
  • [32] Exploring Scientific Workflow Provenance Using Hybrid Queries over Nested Data and Lineage Graphs
    Anand, Manish Kumar
    Bowers, Shawn
    McPhillips, Timothy
    Ludaescher, Bertram
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2009, 5566 : 237 - +
  • [33] A scientific workflow framework integrated with object deputy model for data provenance
    Wang, Liwei
    Peng, Zhiyong
    Luo, Min
    Ji, Wenhao
    Huang, Zeqian
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2006, 4016 : 569 - 580
  • [34] RDFPROV: A relational RDF store for querying and managing scientific workflow provenance
    Chebotko, Artem
    Lu, Shiyong
    Fei, Xubo
    Fotouhi, Farshad
    DATA & KNOWLEDGE ENGINEERING, 2010, 69 (08) : 836 - 865
  • [35] Kepler/pPOD: Scientific Workflow and Provenance Support for Assembling the Tree of Life
    Bowers, Shawn
    McPhillips, Timothy
    Riddle, Sean
    Anand, Manish Kumar
    Ludaescher, Bertram
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2008, 5272 : 70 - +
  • [36] Data provenance in a scientific workflow service framework integrated with object deputy database
    International School of Software, Wuhan University, Wuhan 430072, China
    不详
    不详
    Jisuanji Xuebao, 2008, 5 (721-732):
  • [37] SciLedger: A Blockchain-based Scientific Workflow Provenance and Data Sharing Platform
    Hoopes, Reagan
    Hardy, Hamilton
    Long, Min
    Dagher, Gaby G.
    2022 IEEE 8TH INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING, CIC, 2022, : 125 - 134
  • [38] Provenance-Based Workflow Composition with Virtual Simulation Objects Technology
    Smirnov, Pavel A.
    Kovalchuk, Sergey V.
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 930 - 934
  • [39] Improving Workflow Design Using Abstract Provenance Graphs
    Song, Tianhong
    Dey, Saumen
    Bowers, Shawn
    Ludaescher, Bertram
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES (IPAW 2014), 2015, 8628 : 226 - 228
  • [40] Bridging Workflow and Data Provenance Using Strong Links
    Koop, David
    Santos, Emanuele
    Bauer, Bela
    Troyer, Matthias
    Freire, Juliana
    Silva, Claudio T.
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2010, 6187 : 397 - +