SMITH: a LIMS for handling next-generation sequencing workflows

被引:11
|
作者
Venco, Francesco [1 ]
Vaskin, Yuriy [1 ]
Ceol, Arnaud [2 ]
Muller, Heiko [2 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, I-20133 Milan, Italy
[2] IIT, Ctr Genom Sci IIT SEMM, I-20139 Milan, Italy
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
MANAGEMENT-SYSTEM; BIOLOGY RESEARCH; GENOMIC DATA; VISUALIZATION; EXPLORATION; ENVIRONMENT; SOFTWARE; GALAXY;
D O I
10.1186/1471-2105-15-S14-S3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Life-science laboratories make increasing use of Next Generation Sequencing (NGS) for studying bio-macromolecules and their interactions. Array-based methods for measuring gene expression or protein-DNA interactions are being replaced by RNA-Seq and ChIP-Seq. Sequencing is generally performed by specialized facilities that have to keep track of sequencing requests, trace samples, ensure quality and make data available according to predefined privileges. An integrated tool helps to troubleshoot problems, to maintain a high quality standard, to reduce time and costs. Commercial and non-commercial tools called LIMS (Laboratory Information Management Systems) are available for this purpose. However, they often come at prohibitive cost and/or lack the flexibility and scalability needed to adjust seamlessly to the frequently changing protocols employed. In order to manage the flow of sequencing data produced at the Genomic Unit of the Italian Institute of Technology (IIT), we developed SMITH (Sequencing Machine Information Tracking and Handling). Methods: SMITH is a web application with a MySQL server at the backend. Wet-lab scientists of the Centre for Genomic Science and database experts from the Politecnico of Milan in the context of a Genomic Data Model Project developed SMITH. The data base schema stores all the information of an NGS experiment, including the descriptions of all protocols and algorithms used in the process. Notably, an attribute-value table allows associating an unconstrained textual description to each sample and all the data produced afterwards. This method permits the creation of metadata that can be used to search the database for specific files as well as for statistical analyses. Results: SMITH runs automatically and limits direct human interaction mainly to administrative tasks. SMITH data-delivery procedures were standardized making it easier for biologists and analysts to navigate the data. Automation also helps saving time. The workflows are available through an API provided by the workflow management system. The parameters and input data are passed to the workflow engine that performs de-multiplexing, quality control, alignments, etc. Conclusions: SMITH standardizes, automates, and speeds up sequencing workflows. Annotation of data with key-value pairs facilitates meta-analysis.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] SMITH: a LIMS for handling next-generation sequencing workflows
    Francesco Venco
    Yuriy Vaskin
    Arnaud Ceol
    Heiko Muller
    [J]. BMC Bioinformatics, 15
  • [2] Galaxy LIMS for next-generation sequencing
    Scholtalbers, Jelle
    Roessler, Jasmin
    Sorn, Patrick
    de Graaf, Jos
    Boisguerin, Valesca
    Castle, John
    Sahin, Ugur
    [J]. BIOINFORMATICS, 2013, 29 (09) : 1233 - 1234
  • [3] Automation Enhances Next-Generation Sequencing Workflows
    Varineau, Pierre
    Siembieda, Steve
    [J]. AMERICAN LABORATORY, 2016, 48 (01) : 32 - 33
  • [4] Spotlight on next-generation LIMS platforms
    Schneider, I
    [J]. GENETIC ENGINEERING NEWS, 2004, 24 (05): : 10 - +
  • [5] Next-generation sequencing of the next generation
    Darren J. Burgess
    [J]. Nature Reviews Genetics, 2011, 12 : 78 - 79
  • [6] LIMS software tools for next-generation data
    不详
    [J]. PHARMACOGENOMICS, 2011, 12 (04) : 463 - 463
  • [7] Next-Generation Sequencing: Next-Generation Quality in Pediatrics
    Wortmann, Saskia B.
    Spenger, Johannes
    Preisel, Martin
    Koch, Johannes
    Rauscher, Christian
    Bader, Ingrid
    Mayr, Johannes A.
    Sperl, Wolfgang
    [J]. PADIATRIE UND PADOLOGIE, 2018, 53 (06): : 278 - 283
  • [8] Next-generation sequencing for next-generation breeding, and more
    Tsai, Chung-Jui
    [J]. NEW PHYTOLOGIST, 2013, 198 (03) : 635 - 637
  • [9] Next-Generation Sequencing Demands Next-Generation Phenotyping
    Hennekam, Raoul C. M.
    Biesecker, Leslie G.
    [J]. HUMAN MUTATION, 2012, 33 (05) : 884 - 886
  • [10] Challenges in the Setup of Large-scale Next-Generation Sequencing Analysis Workflows
    Kulkarni, Pranav
    Frommolt, Peter
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2017, 15 : 471 - 477