Assessing and removing the effect of unwanted technical variations in microbiome data

被引:0
|
作者
Muhamad Fachrul
Guillaume Méric
Michael Inouye
Sünje Johanna Pamp
Agus Salim
机构
[1] Baker Heart and Diabetes Institute,Cambridge Baker Systems Genomics Initiative
[2] University of Melbourne,Department of Clinical Pathology
[3] Monash University,Department of Infectious Diseases, Central Clinical School
[4] University of Cambridge,Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care
[5] University of Cambridge,British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care
[6] University of Cambridge,British Heart Foundation Centre of Research Excellence
[7] Health Data Research UK Cambridge,Novo Nordisk Foundation Center for Biosustainability
[8] Wellcome Genome Campus and University of Cambridge,Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health
[9] National Food Institute,School of Mathematics and Statistics
[10] Technical University of Denmark,Department of Population Health
[11] Technical University of Denmark,Department Mathematics and Statistics
[12] The University of Melbourne,undefined
[13] The University of Melbourne,undefined
[14] Baker Heart and Diabetes Institute,undefined
[15] La Trobe University,undefined
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Varying technologies and experimental approaches used in microbiome studies often lead to irreproducible results due to unwanted technical variations. Such variations, often unaccounted for and of unknown source, may interfere with true biological signals, resulting in misleading biological conclusions. In this work, we aim to characterize the major sources of technical variations in microbiome data and demonstrate how in-silico approaches can minimize their impact. We analyzed 184 pig faecal metagenomes encompassing 21 specific combinations of deliberately introduced factors of technical and biological variations. Using the novel Removing Unwanted Variations-III-Negative Binomial (RUV-III-NB), we identified several known experimental factors, specifically storage conditions and freeze–thaw cycles, as likely major sources of unwanted variation in metagenomes. We also observed that these unwanted technical variations do not affect taxa uniformly, with freezing samples affecting taxa of class Bacteroidia the most, for example. Additionally, we benchmarked the performances of different correction methods, including ComBat, ComBat-seq, RUVg, RUVs, and RUV-III-NB. While RUV-III-NB performed consistently robust across our sensitivity and specificity metrics, most other methods did not remove unwanted variations optimally. Our analyses suggest that a careful consideration of possible technical confounders is critical during experimental design of microbiome studies, and that the inclusion of technical replicates is necessary to efficiently remove unwanted variations computationally.
引用
收藏
相关论文
共 50 条
  • [1] Assessing and removing the effect of unwanted technical variations in microbiome data
    Fachrul, Muhamad
    Meric, Guillaume
    Inouye, Michael
    Pamp, Sunje Johanna
    Salim, Agus
    SCIENTIFIC REPORTS, 2022, 12 (01):
  • [2] SUPPRESSION OF UNWANTED VARIATIONS IN DATA
    BERNHARDT, PA
    SCHLAPP, DM
    JOURNAL OF GEOPHYSICAL RESEARCH-SPACE PHYSICS, 1976, 81 (19): : 3428 - 3430
  • [3] Assessing Unwanted Variations in Rheumatology Clinic Previsit Rooming
    Ramly, Edmond
    Stroik, Brad
    Lauver, Diane R.
    Johnson, Heather M.
    McBride, Patrick
    Steffen Lewicki, Kristin
    Arnason, Jon
    Bartels, Christie M.
    JCR-JOURNAL OF CLINICAL RHEUMATOLOGY, 2019, 25 (03) : E1 - E7
  • [4] Removing Unwanted Noise from Operational Modal Analysis Data
    Bonness, William K.
    Jenkins, David M.
    TOPICS IN MODAL ANALYSIS, VOL 10, 2015, : 115 - 122
  • [5] Removing unwanted fluctuations in the AVHRR thermal calibration data using robust techniques
    Trishchenko, AP
    JOURNAL OF ATMOSPHERIC AND OCEANIC TECHNOLOGY, 2002, 19 (12) : 1939 - 1954
  • [6] svaseq: removing batch effects and other unwanted noise from sequencing data
    Leek, Jeffrey T.
    NUCLEIC ACIDS RESEARCH, 2014, 42 (21) : e161
  • [7] Removing unwanted variation from large-scale RNA sequencing data with PRPS
    Molania, Ramyar
    Foroutan, Momeneh
    Gagnon-Bartsch, Johann A.
    Gandolfo, Luke C.
    Jain, Aryan
    Sinha, Abhishek
    Olshansky, Gavriel
    Dobrovic, Alexander
    Papenfuss, Anthony T.
    Speed, Terence P.
    NATURE BIOTECHNOLOGY, 2023, 41 (01) : 82 - +
  • [8] Removing unwanted variation from large-scale RNA sequencing data with PRPS
    Ramyar Molania
    Momeneh Foroutan
    Johann A. Gagnon-Bartsch
    Luke C. Gandolfo
    Aryan Jain
    Abhishek Sinha
    Gavriel Olshansky
    Alexander Dobrovic
    Anthony T. Papenfuss
    Terence P. Speed
    Nature Biotechnology, 2023, 41 : 82 - 95
  • [9] Removing unwanted variation in a differential methylation analysis of Illumina HumanMethylation450 array data
    Maksimovic, Jovana
    Gagnon-Bartsch, Johann A.
    Speed, Terence P.
    Oshlack, Alicia
    NUCLEIC ACIDS RESEARCH, 2015, 43 (16)
  • [10] Assessing Technical Progress Via Data Envelopment Analysis
    LI Guanajin
    LIU Yongqing(Automation Dept.
    Journal of Systems Science and Systems Engineering, 1996, (03) : 257 - 265