KNIME for reproducible cross-domain analysis of life science data

被引:102
|
作者
Fillbrunn, Alexander [1 ,5 ]
Dietz, Christian [1 ]
Pfeuffer, Julianus [2 ]
Rahn, Rene [3 ]
Landrum, Gregory A. [4 ]
Berthold, Michael R. [1 ,4 ,5 ]
机构
[1] Univ Konstanz, Univ Str 10, D-78457 Constance, Germany
[2] Eberhard Karls Univ Tubingen, Geschwister Scholl Pl, D-72074 Tubingen, Germany
[3] Free Univ Berlin, Kaiserswerther Str 16-18, D-14195 Berlin, Germany
[4] KNIME, Technopk Str 1, CH-8005 Zurich, Switzerland
[5] Konstanz Res Sch Chem Biol, Constance, Germany
关键词
KNIME; Workflow systems; Life science; HIGH-THROUGHPUT; MASS; EXPRESSION; PLATFORM; OPENMS; TANDEM;
D O I
10.1016/j.jbiotec.2017.07.028
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Experiments in the life sciences often involve tools from a variety of domains such as mass spectrometry, next generation sequencing, or image processing. Passing the data between those tools often involves complex scripts for controlling data flow, data transformation, and statistical analysis. Such scripts are not only prone to be platform dependent, they also tend to grow as the experiment progresses and are seldomly well documented, a fact that hinders the reproducibility of the experiment. Workflow systems such as KNIME Analytics Platform aim to solve these problems by providing a platform for connecting tools graphically and guaranteeing the same results on different operating systems. As an open source software, KNIME allows scientists and programmers to provide their own extensions to the scientific community. In this review paper we present selected extensions from the life sciences that simplify data exploration, analysis, and visualization and are interoperable due to KNIME's unified data model. Additionally, we name other workflow systems that are commonly used in the life sciences and highlight their similarities and differences to KNIME.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 50 条
  • [1] Cross-Domain Data Fusion
    Yang, Qiang
    [J]. COMPUTER, 2016, 49 (04) : 18 - 18
  • [2] Interactive and reproducible data analysis with the open-source KNIME Analytics Platform
    Landrum, Gregory
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [3] A Cross-domain Data Marketplace for Data Sharing
    Mavrogiorgou, Argyro
    Koukos, Vasileios
    Kouremenou, Eleftheria
    Kiourtis, Athanasios
    Raikos, Alexandros
    Manias, George
    Kyriazis, Dimosthenis
    [J]. PROCEEDINGS OF 2022 THE 3RD EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING, ESSE 2022, 2022, : 72 - 79
  • [4] Cross-domain Sound Recognition for Efficient Underwater Data Analysis
    Park, Jeongsoo
    Han, Dong-Gyun
    La, Hyoung Sul
    Lee, Sangmin
    Han, Yoonchang
    Yang, Eun-Jin
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2081 - 2086
  • [5] User Behavior Analysis by Cross-Domain Log Data Fusion
    Tao, Ye
    Guo, Shuaitong
    Shi, Cao
    Chu, Dianhui
    [J]. IEEE ACCESS, 2020, 8 : 400 - 406
  • [6] Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets
    Cortes, Andoni
    Rodriguez, Clemente
    Velez, Gorka
    Barandiaran, Javier
    Nieto, Marcos
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) : 190 - 199
  • [7] Unsupervised Domain Adaptation with Imbalanced Cross-Domain Data
    Hsu, Tzu-Ming Harry
    Chen, Wei-Yu
    Hou, Cheng-An
    Tsai, Yao-Hung Hubert
    Yeh, Yi-Ren
    Wang, Yu-Chiang Frank
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4121 - 4129
  • [8] A cross-domain analysis of change in students' attitudes toward science and attitudes about the utility of science
    George, R
    [J]. INTERNATIONAL JOURNAL OF SCIENCE EDUCATION, 2006, 28 (06) : 571 - 589
  • [9] Data-driven modeling of collaboration networks: a cross-domain analysis
    Tomasello, Mario V.
    Vaccario, Giacomo
    Schweitzer, Frank
    [J]. EPJ DATA SCIENCE, 2017, 6
  • [10] Cross-Domain Data Extraction and Knowledge Graph Construction for Dispute Analysis
    Guo, Qinglang
    Chen, Xiaolu
    Zhou, Pengyuan
    Liao, Yong
    [J]. 2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 959 - 960