An Open-source Azure Solution for Scalable Genomics Workflows

被引:1
|
作者
Yang-Turner, Fan [1 ]
Gripper, Lawrence [2 ]
Swann, Jeremy [1 ]
Do, Trien [1 ]
Foster, Dona [1 ]
Volk, Denis [1 ]
Ramanan, Anita [2 ]
Robinson, Marcus [2 ]
Peto, Tim [1 ]
Crook, Derrick [1 ]
机构
[1] Univ Oxford, Nuffield Dept Clin Med, Oxford, England
[2] Microsoft CSE, London, England
关键词
scientific workflow; container; cluster system; genomic data analytics; bioinformatic pipeline; cloud computing;
D O I
10.1109/SERVICES.2018.00033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an open-source Azure solution for running scalable genomics workflows. It benefits from state-of-art distributed workflow framework, container and cloud technologies and allows users to create a cluster that is scaled to suit their workload in minutes. We describe the design decisions, solution testing and automation options to support a variety of users for their genomic data analytics. The solution demonstrates a generic and customizable approach to run genomic data analytics workflows on a cloud environment
引用
收藏
页码:39 / 40
页数:2
相关论文
共 50 条
  • [1] The open-source solution
    Constantine, Larry
    [J]. TECHNOLOGY REVIEW, 2007, 110 (01) : 26 - 26
  • [2] Cyber Arena: An Open-Source Solution for Scalable Cybersecurity Labs in the Cloud
    Huff, Philip
    Leiterman, Sandra
    Springer, Jan P.
    [J]. PROCEEDINGS OF THE 54TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, VOL 1, SIGCSE 2023, 2023, : 221 - 227
  • [3] NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations
    Valiev, M.
    Bylaska, E. J.
    Govind, N.
    Kowalski, K.
    Straatsma, T. P.
    Van Dam, H. J. J.
    Wang, D.
    Nieplocha, J.
    Apra, E.
    Windus, T. L.
    de Jong, Wa.
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2010, 181 (09) : 1477 - 1489
  • [4] Building Forecasting Solutions Using Open-Source and Azure Machine Learning
    Hu, Chenhui
    Paunic, Vanja
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3497 - 3498
  • [5] OpenWorkstation: A modular open-source technology for automated in vitro workflows
    Eggert, Sebastian
    Mieszczanek, Pawel
    Meinert, Christoph
    Hutmacher, Dietmar W.
    [J]. HARDWAREX, 2020, 8
  • [6] The importance of open-source integrative genomics to drug discovery
    Chesler, Elissa J.
    Baker, Erich J.
    [J]. CURRENT OPINION IN DRUG DISCOVERY & DEVELOPMENT, 2010, 13 (03) : 310 - 316
  • [7] Scalable Open-Source System-on-Chip Design
    Carloni, Luca P.
    [J]. 2020 IFIP/IEEE 28TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2020, : 7 - 9
  • [8] Fostering Human Activity Recognition Workflows: An Open-Source Baseline Framework
    Demrozi, Florenc
    Turetta, Cristian
    Pravadelli, Graziano
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH, 2023, : 75 - 80
  • [9] PyPop: a mature open-source software pipeline for population genomics
    Lancaster, Alexander K.
    Single, Richard M.
    Mack, Steven J.
    Sochat, Vanessa
    Mariani, Michael P.
    Webster, Gordon D.
    [J]. FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [10] Open-source based integration solution for hospitals
    Oliveira, Raphael
    Ferreira, Duarte
    Ferreira, Ricardo
    Cruz-Correia, Ricardo
    [J]. 2016 IEEE 29TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2016, : 294 - 299