Secure and robust cloud computing for high-throughput forensic microsatellite sequence analysis and databasing

被引:9
|
作者
Bailey, Sarah F. [1 ,2 ]
Scheible, Melissa K. [1 ,2 ]
Williams, Christopher [1 ,2 ]
Silva, Deborah S. B. S. [1 ,2 ]
Hoggan, Marina [1 ]
Eichman, Christopher [3 ]
Faith, Seth A. [1 ,2 ]
机构
[1] NC State Univ, Mol Biomed Sci, 1060 William Moore Dr, Raleigh, NC 27607 USA
[2] NC State Univ, Forens Sci Inst, 1060 William Moore Dr, Raleigh, NC 27607 USA
[3] NC State Univ, Coll Vet Med, Off Informat Technol, 1060 William Moore Dr, Raleigh, NC 27607 USA
关键词
Cloud; Bioinformatics; Microsatellite; Database; Sequencing; Security; SHORT TANDEM REPEATS; SIGNATURE PREP KIT; STRAIT RAZOR; MPS DATA; SYSTEM; MISEQ; TOOL;
D O I
10.1016/j.fsigen.2017.08.008
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Next-generation Sequencing (NGS) is a rapidly evolving technology with demonstrated benefits for forensic genetic applications, and the strategies to analyze and manage the massive NGS datasets are currently in development. Here, the computing, data storage, connectivity, and security resources of the Cloud were evaluated as a model for forensic laboratory systems that produce NGS data. A complete front-to-end Cloud system was developed to upload, process, and interpret raw NGS data using a web browser dashboard. The system was extensible, demonstrating analysis capabilities of autosomal and Y-STRs from a variety of NGS instrumentation (Illumina MiniSeq and MiSeq, and Oxford Nanopore MinION). NGS data for STRs were concordant with standard reference materials previously characterized with capillary electrophoresis and Sanger sequencing. The computing power of the Cloud was implemented with on-demand auto-scaling to allow multiple file analysis in tandem. The system was designed to store resulting data in a relational database, amenable to downstream sample interpretations and databasing applications following the most recent guidelines in nomenclature for sequenced alleles. Lastly, a multilayered Cloud security architecture was tested and showed that industry standards for securing data and computing resources were readily applied to the NGS system without disadvantageous effects for bioinformatic analysis, connectivity or data storage/retrieval. The results of this study demonstrate the feasibility of using Cloud-based systems for secured NGS data analysis, storage, databasing, and multiuser distributed connectivity. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 47
页数:8
相关论文
共 50 条
  • [1] High-Throughput Encryption for Cloud Computing Storage System
    Jararweh, Yaser
    Al-Sharqawi, Ola
    Abdulla, Nawaf
    Tawalbeh, Lo'ai
    Alhammouri, Mohammad
    INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2014, 4 (02) : 1 - 14
  • [2] MICROSATELLITE DEVELOPMENT IN RHODOPHYTA USING HIGH-THROUGHPUT SEQUENCE DATA
    Couceiro, Lucia
    Maneiro, Isabel
    Mauger, Stephane
    Valero, Myriam
    Miguel Ruiz, Jose
    Barreiro, Rodolfo
    JOURNAL OF PHYCOLOGY, 2011, 47 (06) : 1258 - 1265
  • [3] The Role of High Performance, Grid and Cloud Computing in High-Throughput Sequencing
    Lightbody, Gaye
    Browne, Fiona
    Zheng, Huiru
    Haberland, Valeriia
    Blayney, Jaine
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 890 - 895
  • [4] High-Throughput Cloud Computing with the Cloudscheduler VM Provisioning Service
    Berghaus F.
    Casteels K.
    Driemel C.
    Ebert M.
    Galindo F.F.
    Leavett-Brown C.
    MacDonell D.
    Paterson M.
    Seuster R.
    Sobie R.J.
    Tolkamp S.
    Weldon J.
    Computing and Software for Big Science, 2020, 4 (1)
  • [5] CMS@home: Integrating the Volunteer Cloud and High-Throughput Computing
    Field L.
    Spiga D.
    Reid I.
    Riahi H.
    Cristella L.
    Computing and Software for Big Science, 2018, 2 (1)
  • [6] Cloud computing platform for high-throughput virtual screening and drug discovery
    Wein, Samuel
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 244
  • [7] The rise of high-throughput computing
    Ning-Hui Sun
    Yun-Gang Bao
    Dong-Rui Fan
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 1245 - 1250
  • [8] The rise of high-throughput computing
    Sun, Ning-Hui
    Bao, Yun-Gang
    Fan, Dong-Rui
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (10) : 1245 - 1250
  • [9] HIGH-THROUGHPUT COMPUTING IN THE SCIENCES
    Morgan, Mark
    Grimshaw, Andrew
    METHODS IN ENZYMOLOGY: COMPUTER METHODS, PART B, 2009, 467 : 197 - 227
  • [10] A High-throughput Gene Sequence Alignment Strategy Using Parallel Computing
    Yang, Rui
    Zhao, Yinhong
    Su, Yuncong
    Pan, Chao
    Duan, Huilong
    Deng, Ning
    2014 7TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2014), 2014, : 638 - 642