Comprehensive simulation of metagenomic sequencing data with non-uniform sampling distribution

被引:2
|
作者
Liu, Shansong
Hua, Kui
Chen, Sijie
Zhang, Xuegong [1 ]
机构
[1] Tsinghua Univ, TNLIST, Bioinformat Div, Key Lab Bioinformat,MOE, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
simulation; metagenomic sequencing data; non-uniform sampling; nuMetaSim;
D O I
10.1007/s40484-018-0142-9
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundMetagenomic sequencing is a complex sampling procedure from unknown mixtures of many genomes. Having metagenome data with known genome compositions is essential for both benchmarking bioinformatics software and for investigating influences of various factors on the data. Compared to data from real microbiome samples or from defined microbial mock community, simulated data with proper computational models are better for the purpose as they provide more flexibility for controlling multiple factors.MethodsWe developed a non-uniform metagenomic sequencing simulation system (nuMetaSim) that is capable of mimicking various factors in real metagenomic sequencing to reflect multiple properties of real data with customizable parameter settings.ResultsWe generated 9 comprehensive metagenomic datasets with different composition complexity from of 203 bacterial genomes and 2 archaeal genomes related with human intestine system.ConclusionThe data can serve as benchmarks for comparing performance of different methods at different situations, and the software package allows users to generate simulation data that can better reflect the specific properties in their scenarios.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 50 条
  • [1] Comprehensive simulation of metagenomic sequencing data with non-uniform sampling distribution
    Shansong Liu
    Kui Hua
    Sijie Chen
    Xuegong Zhang
    Quantitative Biology, 2018, 6 (02) : 175 - 185
  • [2] Non-uniform sampling of NMR relaxation data
    Troels E. Linnet
    Kaare Teilum
    Journal of Biomolecular NMR, 2016, 64 : 165 - 173
  • [3] Non-uniform sampling of NMR relaxation data
    Linnet, Troels E.
    Teilum, Kaare
    JOURNAL OF BIOMOLECULAR NMR, 2016, 64 (02) : 165 - 173
  • [4] Data Sampling and Processing: Uniform vs. Non-Uniform Schemes
    Beyrouthy, Taha
    Fesquet, Laurent
    Rolland, Robin
    PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE ON EVENT-BASED CONTROL, COMMUNICATION AND SIGNAL PROCESSING EBCCSP 2015, 2015,
  • [5] On non-uniform sampling of signals
    Brueller, NN
    Peterfreund, N
    Porat, M
    IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE 98) - PROCEEDINGS, VOLS 1 AND 2, 1998, : 249 - 252
  • [6] Non-uniform frequency domain for optimal exploitation of non-uniform sampling
    Kazimierczuk, Krzysztof
    Zawadzka-Kazimierczuk, Anna
    Kozminski, Wiktor
    JOURNAL OF MAGNETIC RESONANCE, 2010, 205 (02) : 286 - 292
  • [7] A neural algorithm for the non-uniform and adaptive sampling of biomedical data
    Mesin, Luca
    COMPUTERS IN BIOLOGY AND MEDICINE, 2016, 71 : 223 - 230
  • [8] Use of Non-uniform Grids for Reduced Spatial Sampling in FDTD Simulation
    Ramachandran, Aravind
    Cangellaris, Andreas C.
    FREQUENZ, 2008, 62 (7-8) : 156 - 159
  • [9] Non-uniform systematic sampling in stereology
    Dorph-Petersen, KA
    Gundersen, HJG
    Jensen, EBV
    JOURNAL OF MICROSCOPY-OXFORD, 2000, 200 (02): : 148 - 157
  • [10] Non-uniform sampling in biomolecular NMR
    Billeter, Martin
    JOURNAL OF BIOMOLECULAR NMR, 2017, 68 (02) : 65 - 66