Datatrack: An R package for managing data in a multi-stage experimental workflow

被引:0
|
作者
Eichinski, Philip [1 ]
Roe, Paul [1 ]
机构
[1] Queensland Univ Technol, Sci & Engn Fac, Brisbane, Qld, Australia
关键词
computational science; data provenance; R language; R package; workflow;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In experimental research using computation, a workflow is a sequence of steps involving some data processing or analysis where the output of one step may be used as the input of another. The processing steps may involve user-supplied parameters, that when modified, result in a new version of input to the downstream steps, in turn generating new versions of their own output. As more experimentation is done, the results of these various steps can become numerous. It is important to keep track of which data output is dependent on which other generated data, and which parameters were used. In many situations, scientific workflow management systems solve this problem, but these systems are best suited to collaborative, distributed experiments using a variety of services, possibly batch processing parameter sweeps. This paper presents an R package for managing and navigating a network of interdependent data. It is intended as a lightweight tool that provides some visual data provenance information to the experimenter to allow them to manage their generated data as they run experiments within their familiar scripting environment, where it may not be desirable to commit to a fully-blown comprehensive workflow manager. The package consists of wrapper functions for writing and reading output data that can be called from within the R analysis scripts, as well as a visualization of the data-output dependency graph rendered within the R-studio console. Thus, it offers benefit to the experimenter while requiring minimal commitment for integration in their existing working environment.
引用
收藏
页码:147 / 154
页数:8
相关论文
共 50 条
  • [11] REPRESENTATION AND ANALYSIS OF MULTI-STAGE PROBLEMS IN R AND D
    LOCKETT, AG
    GEAR, AE
    MANAGEMENT SCIENCE SERIES B-APPLICATION, 1973, 19 (08): : 947 - 960
  • [12] Limited multi-stage stochastic programming for managing water supply systems
    Housh, Mashor
    Ostfeld, Avi
    Shamir, Uri
    ENVIRONMENTAL MODELLING & SOFTWARE, 2013, 41 : 53 - 64
  • [13] Experimental Investigation of A Multi-Stage Impedance Pumping System
    Lee, Vincent Chieng-Chen
    Abakr, Yousif Abdalla
    Woo, Ko-Choong
    4TH INTERNATIONAL MEETING OF ADVANCES IN THERMOFLUIDS (IMAT 2011), PT 1 AND 2, 2012, 1440 : 1206 - 1211
  • [14] Experimental investigations on a multi-stage water desalination prototype
    Tigrine, Z.
    Aburideh, H.
    Abbas, M.
    Zioui, D.
    Bellatreche, R.
    Merzouk, N. Kasbadji
    Hout, S.
    Belhout, D.
    DESALINATION AND WATER TREATMENT, 2015, 56 (10) : 2612 - 2617
  • [15] An experimental study of hydrodynamics in a multi-stage flotation column
    Gu, XQ
    Chiang, SH
    ADVANCES IN FILTRATION AND SEPARATION TECHNOLOGY, VOLS 13A AND 13B, 1999: ADVANCING FILTRATION AND SEPARATION SOLUTIONS FOR THE MILLENNIUM, 1999, : 245 - 251
  • [16] Numerical and experimental characterization of multi-stage Savonius rotors
    Frikha, Sobhi
    Driss, Zied
    Ayadi, Emna
    Masmoudi, Zied
    Abid, Mohamed Salah
    ENERGY, 2016, 114 : 382 - 404
  • [17] Design and experimental study of novel multi-stage gearbox
    Dong, Peng
    Wang, Yongfei
    Zhao, Shengdun
    Gao, Zhuoneng
    Zhao, Yongqiang
    Cao, Yangfeng
    Zhang, Peng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2021, 235 (01) : 105 - 113
  • [18] Experimental study of multi-stage and multi-channel closing switch
    Huang, Jian-Jun
    Zhang, Yong-Min
    Lai, Ding-Guo
    Ren, Shu-Qing
    Cheng, Liang
    Xie, Lin-Shen
    Yang, Li
    Zhang, Yu-Ying
    Huang, Yong
    Sun, Feng-Ju
    Qiu, Ai-Ci
    Kuai, Bin
    Qiangjiguang Yu Lizishu/High Power Laser and Particle Beams, 2008, 20 (02): : 339 - 342
  • [19] Multi-Stage data envelopment analysis congestion model
    Mithun J. Sharma
    Song Jin Yu
    Operational Research, 2013, 13 : 399 - 413
  • [20] A Multi-Stage Clustering Framework for Automotive Radar Data
    Scheiner, Nicolas
    Appenrodt, Nils
    Dickmann, Juergen
    Sick, Bernhard
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2060 - 2067