BAYESIAN MODELING OF INTERACTION BETWEEN FEATURES IN SPARSE MULTIVARIATE COUNT DATA WITH APPLICATION TO MICROBIOME STUDY

被引:0
|
作者
Zhang, Shuangjie [1 ]
Shen, Yuning [2 ]
Chen, Irene A. [2 ]
Lee, Juhee [1 ]
机构
[1] Univ Calif Santa Cruz, Dept Stat, Santa Cruz, CA 95064 USA
[2] Univ Calif Los Angeles, Dept Chem & Biomol Engn, Los Angeles, CA USA
来源
ANNALS OF APPLIED STATISTICS | 2023年 / 17卷 / 03期
关键词
Covariance matrix; differential abundance; factor model; joint sparsity; kernel model; zero inflation; multivariate count data; MULTINOMIAL REGRESSION-MODEL; POSTERIOR CONTRACTION; COMPOSITIONAL DATA; COVARIANCE; RATES;
D O I
10.1214/22-AOAS1690
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many statistical methods have been developed for the analysis of microbial community profiles, but due to the complexity of typical microbiome measurements, inference of interactions between microbial features remains challenging. We develop a Bayesian zero-inflated rounded log-normal kernel method to model interaction between microbial features in a community using multivariate count data in the presence of covariates and excess zeros. The model carefully constructs the interaction structure by imposing joint sparsity on the covariance matrix of the kernel and obtains a reliable estimate of the structure with a small sample size. The model also includes zero inflation to account for excess zeros observed in data and infers differential abundance of microbial features associated with covariates through log-linear regression. We provide simulation studies and real data analysis examples to demonstrate the developed model. Comparison of the model to a simpler model and popular alternatives in simulation studies shows that, in addition to an added and important insight on the feature interaction, it yields superior parameter estimates and model fit in various settings.
引用
收藏
页码:1861 / 1883
页数:23
相关论文
共 50 条
  • [41] Bayesian modeling of multivariate spatial binary data with applications to dental caries
    Bandyopadhyay, Dipankar
    Reich, Brian J.
    Slate, Elizabeth H.
    [J]. STATISTICS IN MEDICINE, 2009, 28 (28) : 3492 - 3508
  • [42] A NEW UTILITY-CONSISTENT ECONOMETRIC APPROACH TO MULTIVARIATE COUNT DATA MODELING
    Bhat, Chandra R.
    Paleti, Rajesh
    Castro, Marisol
    [J]. JOURNAL OF APPLIED ECONOMETRICS, 2015, 30 (05) : 806 - 825
  • [43] IRT-ZIP Modeling for Multivariate Zero-Inflated Count Data
    Wang, Lijuan
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2010, 35 (06) : 671 - 692
  • [44] Genomic Bayesian Prediction Model for Count Data with Genotype x Environment Interaction
    Montesinos-Lopez, Abelardo
    Montesinos-Lopez, Osval A.
    Crossa, Jose
    Burgueno, Juan
    Eskridge, Kent M.
    Falconi-Castillo, Esteban
    He, Xinyao
    Singh, Pawan
    Cichy, Karen
    [J]. G3-GENES GENOMES GENETICS, 2016, 6 (05): : 1165 - 1177
  • [45] BAYESIAN SPARSE GRAPHICAL MODELS FOR CLASSIFICATION WITH APPLICATION TO PROTEIN EXPRESSION DATA
    Baladandayuthapani, Veerabhadran
    Talluri, Rajesh
    Ji, Yuan
    Coombes, Kevin R.
    Lu, Yiling
    Hennessy, Bryan T.
    Davies, Michael A.
    Mallick, Bani K.
    [J]. ANNALS OF APPLIED STATISTICS, 2014, 8 (03): : 1443 - 1468
  • [46] Joint Modeling of Multivariate Survival Data With an Application to Retirement
    Li, Grace
    Lesperance, Mary
    Wu, Zheng
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2022, 51 (04) : 1920 - 1946
  • [47] A Bayesian Analysis in the Presence of Covariates for Multivariate Survival Data: An example of Application
    Santos, Carlos Aparecido
    Achcar, Jorge Alberto
    [J]. REVISTA COLOMBIANA DE ESTADISTICA, 2011, 34 (01): : 111 - 131
  • [48] SuRF: A new method for sparse variable selection, with application in microbiome data analysis
    Liu, Lihui
    Gu, Hong
    Van Limbergen, Johan
    Kenney, Toby
    [J]. STATISTICS IN MEDICINE, 2021, 40 (04) : 897 - 919
  • [49] Probabilistic outlier detection for sparse multivariate geotechnical site investigation data using Bayesian learning
    Shuo Zheng
    Yu-Xin Zhu
    Dian-Qing Li
    Zi-Jun Cao
    Qin-Xuan Deng
    Kok-Kwang Phoon
    [J]. Geoscience Frontiers, 2021, 12 (01) : 425 - 439
  • [50] Probabilistic outlier detection for sparse multivariate geotechnical site investigation data using Bayesian learning
    Zheng, Shuo
    Zhu, Yu-Xin
    Li, Dian-Qing
    Cao, Zi-Jun
    Deng, Qin-Xuan
    Phoon, Kok-Kwang
    [J]. GEOSCIENCE FRONTIERS, 2021, 12 (01) : 425 - 439