coda4microbiome: compositional data analysis for microbiome cross-sectional and longitudinal studies

被引:23
|
作者
Calle, M. Luz [1 ]
Pujolassos, Meritxell [1 ]
Susin, Antoni [2 ]
机构
[1] Univ Vic, Cent Univ Catalonia, Fac Sci Tech Engn, Biosci Dept, Carrer Laura 13, Vic 08500, Spain
[2] UPC Barcelona Tech, Math Dept, Barcelona, Spain
关键词
Compositional data analysis; Log-ratio analysis; Longitudinal studies; Microbiome analysis; Microbial signatures; Penalized regression; DIFFERENTIAL ABUNDANCE ANALYSIS; LINEAR-MODELS; GUT;
D O I
10.1186/s12859-023-05205-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: One of the main challenges of microbiome analysis is its compositional nature that if ignored can lead to spurious results. Addressing the compositional structure of microbiome data is particularly critical in longitudinal studies where abundances measured at different times can correspond to different sub-compositions.Results: We developed coda4microbiome, a new R package for analyzing microbiome data within the Compositional Data Analysis (CoDA) framework in both, cross-sectional and longitudinal studies. The aim of coda4microbiome is prediction, more specifically, the method is designed to identify a model (microbial signature) containing the minimum number of features with the maximum predictive power. The algorithm relies on the analysis of log-ratios between pairs of components and variable selection is addressed through penalized regression on the "all-pairs log-ratio model ", the model containing all possible pairwise log-ratios. For longitudinal data, the algorithm infers dynamic microbial signatures by performing penalized regression over the summary of the log-ratio trajectories (the area under these trajectories). In both, cross-sectional and longitudinal studies, the inferred microbial signature is expressed as the (weighted) balance between two groups of taxa, those that contribute positively to the microbial signature and those that contribute negatively. The package provides several graphical representations that facilitate the interpretation of the analysis and the identified microbial signatures. We illustrate the new method with data from a Crohn's disease study (cross-sectional data) and on the developing microbiome of infants (longitudinal data).Conclusions: coda4microbiome is a new algorithm for identification of microbial signatures in both, cross-sectional and longitudinal studies. The algorithm is implemented as an R package that is available at CRAN () and is accompanied with a vignette with a detailed description of the functions.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] coda4microbiome: compositional data analysis for microbiome cross-sectional and longitudinal studies
    M. Luz Calle
    Meritxell Pujolassos
    Antoni Susin
    BMC Bioinformatics, 24
  • [2] Microbiome compositional data analysis for survival studies
    Pujolassos, Meritxell
    Susin, Antoni
    Calle, M. Luz
    NAR GENOMICS AND BIOINFORMATICS, 2024, 6 (02)
  • [3] COMPOSITIONAL MEDIATION ANALYSIS FOR MICROBIOME STUDIES
    Sohn, Michael B.
    Li, Hongzhe
    ANNALS OF APPLIED STATISTICS, 2019, 13 (01): : 661 - 681
  • [4] REGRESSION ANALYSIS FOR MICROBIOME COMPOSITIONAL DATA
    Shi, Pixu
    Zhang, Anru
    Li, Hongzhe
    ANNALS OF APPLIED STATISTICS, 2016, 10 (02): : 1019 - 1040
  • [5] A cross-sectional analysis of the urine microbiome of children with neuropathic bladders
    Forster, Catherine S.
    Panchapakesan, Karuna
    Stroud, Crystal
    Banerjee, Payal
    Gordish-Dressman, Heather
    Hsieh, Michael H.
    JOURNAL OF PEDIATRIC UROLOGY, 2020, 16 (05) : 593.e1 - 593.e8
  • [6] Bayesian Hierarchical Compositional Models for Analysing Longitudinal Abundance Data from Microbiome Studies
    Marti, I. Creus
    Moya, A.
    Santonja, F. J.
    COMPLEXITY, 2022, 2022
  • [7] Variable selection in microbiome compositional data analysis
    Susin, Antoni
    Wang, Yiwen
    Cao, Kim-Anh Le
    Calle, M. Luz
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (02)
  • [8] The Dynamics of the Gut Microbiome in Rheumatoid Arthritis Susceptibility: A Cross-Sectional and Longitudinal Observational Study
    Rooney, Christopher
    Jeffery, Ian
    Mankia, Kulveer
    Wilcox, Mark
    Emery, Paul
    ARTHRITIS & RHEUMATOLOGY, 2023, 75 : 3511 - 3514
  • [9] COMPONENT ANALYSIS IN CROSS-SECTIONAL AND LONGITUDINAL DATA
    MILLSAP, RE
    MEREDITH, W
    PSYCHOMETRIKA, 1988, 53 (01) : 123 - 134
  • [10] Compositional data analysis of the microbiome: fundamentals, tools, and challenges
    Tsilimigras, Matthew C. B.
    Fodor, Anthony A.
    ANNALS OF EPIDEMIOLOGY, 2016, 26 (05) : 330 - 335