A Bayesian multivariate mixture model for high throughput spatial transcriptomics

被引:6
|
作者
Allen, Carter [1 ,4 ]
Chang, Yuzhou [1 ,4 ]
Neelon, Brian [2 ]
Chang, Won [3 ]
Kim, Hang J. [3 ]
Li, Zihai [4 ]
Ma, Qin [1 ,4 ]
Chung, Dongjun [1 ,4 ]
机构
[1] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[2] Med Univ South Carolina, Dept Publ Hlth Sci, Charleston, SC 29425 USA
[3] Univ Cincinnati, Div Stat & Data Sci, Cincinnati, OH USA
[4] Ohio State Univ, Comprehens Canc Ctr, Pelotonia Inst Immunooncol, Columbus, OH 43210 USA
关键词
Bayesian models; conditionally autoregressive models; mixture models; skew-normal; spatial transcriptomics; SINGLE-CELL; INFERENCE;
D O I
10.1111/biom.13727
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
High throughput spatial transcriptomics (HST) is a rapidly emerging class of experimental technologies that allow for profiling gene expression in tissue samples at or near single-cell resolution while retaining the spatial location of each sequencing unit within the tissue sample. Through analyzing HST data, we seek to identify sub-populations of cells within a tissue sample that may inform biological phenomena. Existing computational methods either ignore the spatial heterogeneity in gene expression profiles, fail to account for important statistical features such as skewness, or are heuristic-based network clustering methods that lack the inferential benefits of statistical modeling. To address this gap, we develop SPRUCE: a Bayesian spatial multivariate finite mixture model based on multivariate skew-normal distributions, which is capable of identifying distinct cellular sub-populations in HST data. We further implement a novel combination of Polya-Gamma data augmentation and spatial random effects to infer spatially correlated mixture component membership probabilities without relying on approximate inference techniques. Via a simulation study, we demonstrate the detrimental inferential effects of ignoring skewness or spatial correlation in HST data. Using publicly available human brain HST data, SPRUCE outperforms existing methods in recovering expertly annotated brain layers. Finally, our application of SPRUCE to human breast cancer HST data indicates that SPRUCE can distinguish distinct cell populations within the tumor microenvironment. An R package spruce for fitting the proposed models is available through The Comprehensive R Archive Network.
引用
收藏
页码:1775 / 1787
页数:13
相关论文
共 50 条
  • [1] Recent advances in high-throughput single-cell transcriptomics and spatial transcriptomics
    Shen, Xiaohan
    Zhao, Yichun
    Wang, Zhuo
    Shi, Qihui
    LAB ON A CHIP, 2022, 22 (24) : 4774 - 4791
  • [2] Bayesian Inference on a Mixture Model With Spatial Dependence
    Cucala, Lionel
    Marin, Jean-Michel
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2013, 22 (03) : 584 - 597
  • [3] Bayesian analysis of a multivariate spatial ordered probit model
    Gao, Ping
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (04) : 1300 - 1317
  • [4] A Bayesian mixture model to quantify parameters of spatial clustering
    Schaefer, Martin
    Radon, Yvonne
    Klein, Thomas
    Herrmann, Sabrina
    Schwender, Holger
    Verveer, Peter J.
    Ickstadt, Katja
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 92 : 163 - 176
  • [5] High-throughput transcriptomics
    Nunzio D’Agostino
    Wenli Li
    Dapeng Wang
    Scientific Reports, 12
  • [6] Transcriptomics in a high throughput world
    Chad Nusbaum
    Genome Biology, 11 (Suppl 1)
  • [7] Transcriptomics in a high throughput world
    Nusbaum, Chad
    GENOME BIOLOGY, 2010, 11
  • [8] High-throughput transcriptomics
    D'Agostino, Nunzio
    Li, Wenli
    Wang, Dapeng
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [9] Bayesian Inference in Infinite Multivariate McDonald's Beta Mixture Model
    Forouzanfar, Darya
    Manouchehri, Narges
    Bouguila, Nizar
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 320 - 330
  • [10] Bayesian Dirichlet mixture model for multivariate extremes: A re-parametrization
    Sabourin, Anne
    Naveau, Philippe
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 542 - 567