Comprehensive simulation of metagenomic sequencing data with non-uniform sampling distribution

被引:2
|
作者
Liu, Shansong
Hua, Kui
Chen, Sijie
Zhang, Xuegong [1 ]
机构
[1] Tsinghua Univ, TNLIST, Bioinformat Div, Key Lab Bioinformat,MOE, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
simulation; metagenomic sequencing data; non-uniform sampling; nuMetaSim;
D O I
10.1007/s40484-018-0142-9
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundMetagenomic sequencing is a complex sampling procedure from unknown mixtures of many genomes. Having metagenome data with known genome compositions is essential for both benchmarking bioinformatics software and for investigating influences of various factors on the data. Compared to data from real microbiome samples or from defined microbial mock community, simulated data with proper computational models are better for the purpose as they provide more flexibility for controlling multiple factors.MethodsWe developed a non-uniform metagenomic sequencing simulation system (nuMetaSim) that is capable of mimicking various factors in real metagenomic sequencing to reflect multiple properties of real data with customizable parameter settings.ResultsWe generated 9 comprehensive metagenomic datasets with different composition complexity from of 203 bacterial genomes and 2 archaeal genomes related with human intestine system.ConclusionThe data can serve as benchmarks for comparing performance of different methods at different situations, and the software package allows users to generate simulation data that can better reflect the specific properties in their scenarios.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 50 条
  • [41] Non-uniform Sampling Schemes for RF Bandpass Sampling Receiver
    Bechir, Dadi Mohamed
    Ridha, Bouallegue
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, : 13 - 17
  • [42] Low Power Data Conversion based on Non-Uniform Sampling for Multistandard Receiver
    Ben-Romdhane, Manel
    Rebai, Chiheb
    Ghazel, Adel
    Desgreys, Patricia
    Loumeau, Patrick
    DTIS: 2009 4TH IEEE INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE ERA, PROCEEDINGS, 2009, : 261 - +
  • [43] NON-UNIFORM DISTRIBUTION OF FACES IN A ZONE
    HARTMAN, P
    ZEITSCHRIFT FUR KRISTALLOGRAPHIE, 1965, 121 (01): : 78 - &
  • [44] Non-uniform data distribution for communication-efficient parallel clustering
    Goodall, Tabitha
    Pettinger, David
    Di Fatta, Giuseppe
    JOURNAL OF COMPUTATIONAL SCIENCE, 2013, 4 (06) : 489 - 495
  • [45] Sampling-interval-dependent stability for linear sampled-data systems with non-uniform sampling
    Shao, Hanyong
    Lam, James
    Feng, Zhiguang
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2016, 47 (12) : 2893 - 2900
  • [46] Numerical Simulation of Gas Explosion with Non-uniform Concentration Distribution by Using OpenFOAM
    Yang, Aobo
    Liu, Yujiao
    Gao, Ke
    Li, Runzhi
    Li, Qiwen
    Li, Shengnan
    ACS OMEGA, 2023, 8 (51): : 48798 - 48812
  • [47] Non-Uniform Sampling and Reconstruction from Sampling Sets with Unknown Jitter
    Akram Aldroubi
    Casey Leonetti
    Sampling Theory in Signal and Image Processing, 2008, 7 (2): : 187 - 195
  • [48] Deformation prediction and simulation of weft knitted fabrics with non-uniform density distribution
    Ru X.
    Zhu W.
    Shi W.
    Peng L.
    Fangzhi Xuebao/Journal of Textile Research, 2022, 43 (06): : 63 - 69+78
  • [49] Mesoscopic numerical simulation of temperature crack with non-uniform temperature distribution in concrete
    Duan, Yin
    Zhang, Chao
    Chang, Xiaolin
    APPLIED MECHANICS AND MATERIALS II, PTS 1 AND 2, 2014, 477-478 : 1014 - +
  • [50] Uniform LFM Reconstruction From Non-uniform Data
    Zhang, Xuebo
    Liu, Xvbo
    Jin, Zhou
    Liu, Yaqian
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,