Combining compositional data sets introduces error in covariance network reconstruction

被引:2
|
作者
Brunner, James D. [1 ,2 ,3 ]
Robinson, Aaron J. [1 ]
Chain, Patrick S. G. [1 ]
机构
[1] Los Alamos Natl Lab, Biosci Div, Los Alamos, NM 87545 USA
[2] Los Alamos Natl Lab, Ctr Nonlinear Studies, Los Alamos, NM 87545 USA
[3] Los Alamos Natl Lab, POB 1663, Los Alamos, NM 87545 USA
来源
ISME COMMUNICATIONS | 2024年 / 4卷 / 01期
关键词
transkingdom network inference; microbiome; bacterial fungal interaction; STATISTICAL-ANALYSIS; COMMUNITIES; BACTERIAL;
D O I
10.1093/ismeco/ycae057
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Microbial communities are diverse biological systems that include taxa from across multiple kingdoms of life. Notably, interactions between bacteria and fungi play a significant role in determining community structure. However, these statistical associations across kingdoms are more difficult to infer than intra-kingdom associations due to the nature of the data involved using standard network inference techniques. We quantify the challenges of cross-kingdom network inference from both theoretical and practical points of view using synthetic and real-world microbiome data. We detail the theoretical issue presented by combining compositional data sets drawn from the same environment, e.g. 16S and ITS sequencing of a single set of samples, and we survey common network inference techniques for their ability to handle this error. We then test these techniques for the accuracy and usefulness of their intra- and inter-kingdom associations by inferring networks from a set of simulated samples for which a ground-truth set of associations is known. We show that while the two methods mitigate the error of cross-kingdom inference, there is little difference between techniques for key practical applications including identification of strong correlations and identification of possible keystone taxa (i.e. hub nodes in the network). Furthermore, we identify a signature of the error caused by transkingdom network inference and demonstrate that it appears in networks constructed using real-world environmental microbiome data.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Combining data sets with different phylogenetic histories
    Wiens, JJ
    SYSTEMATIC BIOLOGY, 1998, 47 (04) : 568 - 581
  • [32] Analysis of longitudinal data by combining multiple dynamic covariance models
    Xu, Lin
    Tang, Man-Lai
    Chen, Ziqi
    STATISTICS AND ITS INTERFACE, 2019, 12 (03) : 479 - 487
  • [33] Combining Evolutionary Covariance and NMR Data for Protein Structure Determination
    Huang, Yuanpeng Janet
    Brock, Kelly P.
    Ishida, Yojiro
    Swapna, Gurla V. T.
    Inouye, Masayori
    Marks, Debora S.
    Sander, Chris
    Montelione, Gaetano T.
    BIOLOGICAL NMR PT A, 2019, 614 : 363 - 392
  • [34] Estimating Error Covariance and Correlation Region in UV Irradiance Data Fusion by Combining TOMS-OMI and UVMRP Ground Observations
    Sun, Zhibin
    Davis, John
    Gao, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (01): : 355 - 370
  • [35] A two-pass approach for minimising error in synthetically generated network traffic data sets
    Soper, Jacob
    Kien Nguyen
    Xu, Yue
    Foo, Ernest
    Jadidi, Zahra
    PROCEEDINGS OF 2023 AUSTRALIAN COMPUTER SCIENCE WEEK, ACSW 2023, 2023, : 18 - 27
  • [36] PARAMETER SETS FOR BOUNDED-ERROR DATA
    MOORE, R
    MATHEMATICS AND COMPUTERS IN SIMULATION, 1992, 34 (02) : 113 - 119
  • [37] RECONSTRUCTION ERROR OF SAMPLED DATA ESTIMATES
    EPHREMIDES, A
    BRANDENBURG, LH
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1973, 19 (03) : 365 - 367
  • [38] A review of forecast error covariance statistics in atmospheric variational data assimilation. II: Modelling the forecast error covariance statistics
    Bannister, R. N.
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2008, 134 (637) : 1971 - 1996
  • [39] Large Covariance Estimation for Compositional Data Via Composition-Adjusted Thresholding
    Cao, Yuanpei
    Lin, Wei
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 759 - 772
  • [40] ONLINE ESTIMATION OF ERROR COVARIANCE PARAMETERS FOR ATMOSPHERIC DATA ASSIMILATION
    DEE, DP
    MONTHLY WEATHER REVIEW, 1995, 123 (04) : 1128 - 1145