Mixtures of logistic normal multinomial regression models for microbiome data

被引:0
|
作者
Dai, Wenshu [1 ]
Fang, Yuan [2 ]
Subedi, Sanjeena [3 ]
机构
[1] Binghamton Univ, Dept Math & Stat, Binghamton, NY USA
[2] Binghamton Univ, Dept Pharmaceut Sci, Binghamton, NY USA
[3] Carleton Univ, Sch Math & Stat, Ottawa, ON, Canada
关键词
Clustering; regression-based mixture models; microbiome data; logistic-normal multinomial model; variational Gaussian approximation; LIKELIHOOD;
D O I
10.1080/02664763.2024.2383286
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the realm of bioinformatics, we frequently encounter discrete data, particularly microbiome taxa count data obtained through 16S rRNA sequencing. These microbiome datasets are commonly characterized by their high dimensionality and the ability to provide insights solely into relative abundance, necessitating their classification as compositional data. Analyzing such data presents challenges due to their confinement within a simplex. Additionally, microbiome taxa counts are subject to influence by various biological and environmental factors like age, gender, and diet. Thus, we have developed a novel approach involving regression-based mixtures of logistic normal multinomial models for clustering microbiome data. These models effectively categorize samples into more homogeneous subpopulations, enabling the exploration of relationships between bacterial abundance and biological or environmental covariates within each identified group. To enhance the accuracy and efficiency of parameter estimation, we employ a robust framework based on variational Gaussian approximations (VGA). Our proposed method's effectiveness is demonstrated through its application to simulated and real datasets.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Clustering microbiome data using mixtures of logistic normal multinomial models
    Yuan Fang
    Sanjeena Subedi
    [J]. Scientific Reports, 13
  • [2] Clustering microbiome data using mixtures of logistic normal multinomial models
    Fang, Yuan
    Subedi, Sanjeena
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [3] A Logistic Normal Multinomial Regression Model for Microbiome Compositional Data Analysis
    Xia, Fan
    Chen, Jun
    Fung, Wing Kam
    Li, Hongzhe
    [J]. BIOMETRICS, 2013, 69 (04) : 1053 - 1063
  • [4] Cluster analysis of microbiome data by using mixtures of Dirichlet-multinomial regression models
    Subedi, Sanjeena
    Neish, Drew
    Bak, Stephen
    Feng, Zeny
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2020, 69 (05) : 1163 - 1187
  • [5] Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data
    Tu, Wangshu
    Subedi, Sanjeena
    [J]. JOURNAL OF CLASSIFICATION, 2023, 40 (03) : 638 - 667
  • [6] Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data
    Wangshu Tu
    Sanjeena Subedi
    [J]. Journal of Classification, 2023, 40 : 638 - 667
  • [7] Multinomial logistic regression
    Kwak, C
    Clayton-Matthews, A
    [J]. NURSING RESEARCH, 2002, 51 (06) : 406 - 412
  • [8] Confidence intervals for multinomial logistic regression in sparse data
    Bull, Shelley B.
    Lewinger, Juan Pablo
    Lee, Sophia S. F.
    [J]. STATISTICS IN MEDICINE, 2007, 26 (04) : 903 - 918
  • [9] Multinomial Principal Component Logistic Regression on Shape Data
    Moghimbeygi, Meisam
    Nodehi, Anahita
    [J]. JOURNAL OF CLASSIFICATION, 2022, 39 (03) : 578 - 599
  • [10] Multinomial Principal Component Logistic Regression on Shape Data
    Meisam Moghimbeygi
    Anahita Nodehi
    [J]. Journal of Classification, 2022, 39 : 578 - 599