scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model

被引:2
|
作者
Duan, Hongyu [1 ]
Li, Feng [1 ]
Shang, Junliang [1 ]
Liu, Jinxing [1 ]
Li, Yan [2 ]
Liu, Xikui [2 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Peoples R China
[2] Shandong Univ Sci & Technol, Dept Elect Engn & Informat Technol, Jinan 250031, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
scATAC-seq; Clustering; Deep learning; Variational autoencoder; Bayesian Gaussian-mixture model;
D O I
10.1007/s12539-022-00536-w
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A surge in research has occurred because of current developments in single-cell technologies. Above all, single-cell Assay for Transposase-Accessible Chromatin with high throughput sequencing (scATAC-seq) is a popular approach of analyzing chromatin accessibility differences at the level of single cell, either within or between groups. As a result, it is critical to examine cell heterogeneity at a previously unseen level and to identify both recognized and unknown cell types. However, with the ever-increasing number of cells engendered by technological development and the characteristics of the data, such as high noise, sparsity and dimension, challenges in distinguishing cell types have emerged. We propose scVAEBGM, which integrates a Variational Autoencoder (VAE) with a Bayesian Gaussian-mixture model (BGM) to process and analyze scATAC-seq data. This method combines and takes benefits of a Bayesian Gaussian mixture model to estimate the number of cell types without determining the cluster number in a beforehand. In other words, the size of the clusters is inferred from the data, thus avoiding biases introduced by subjective assessments when manually determining the size of the clusters. Additionally, the method is more robust to noise and can better represent single-cell data in lower dimensions. We also create a further clustering strategy. It is indicated by experiments that further clustering based on the already completed clustering can improve the clustering accuracy again. We test on six public datasets, and scVAEBGM outperforms various dimension reduction baselines. In downstream applications, scVAEBGM can reveal biological cell types. [GRAPHICS] .
引用
下载
收藏
页码:917 / 928
页数:12
相关论文
共 50 条
  • [1] scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model
    Hongyu Duan
    Feng Li
    Junliang Shang
    Jinxing Liu
    Yan Li
    Xikui Liu
    Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 917 - 928
  • [2] A deep generative model for multi-view profiling of single-cell RNA-seq and ATAC-seq data
    Li, Gaoyang
    Fu, Shaliu
    Wang, Shuguang
    Zhu, Chenyu
    Duan, Bin
    Tang, Chen
    Chen, Xiaohan
    Chuai, Guohui
    Wang, Ping
    Liu, Qi
    GENOME BIOLOGY, 2022, 23 (01)
  • [3] A deep generative model for multi-view profiling of single-cell RNA-seq and ATAC-seq data
    Gaoyang Li
    Shaliu Fu
    Shuguang Wang
    Chenyu Zhu
    Bin Duan
    Chen Tang
    Xiaohan Chen
    Guohui Chuai
    Ping Wang
    Qi Liu
    Genome Biology, 23
  • [4] Assessment of computational methods for the analysis of single-cell ATAC-seq data
    Chen, Huidong
    Lareau, Caleb A.
    Andreani, Tommaso
    Vinyard, Michael E.
    Garcia, Sara P.
    Clement, Kendell
    Andrade-Navarro, Miguel
    Buenrostro, Jason D.
    Pinello, Luca
    GENOME BIOLOGY, 2019, 20 (01)
  • [5] Assessment of computational methods for the analysis of single-cell ATAC-seq data
    Huidong Chen
    Caleb Lareau
    Tommaso Andreani
    Michael E. Vinyard
    Sara P. Garcia
    Kendell Clement
    Miguel A. Andrade-Navarro
    Jason D. Buenrostro
    Luca Pinello
    Genome Biology, 20
  • [6] Simultaneous dimensionality reduction and integration for single-cell ATAC-seq data using deep learning
    Kopp, Wolfgang
    Akalin, Altuna
    Ohler, Uwe
    NATURE MACHINE INTELLIGENCE, 2022, 4 (02) : 162 - +
  • [7] Simultaneous dimensionality reduction and integration for single-cell ATAC-seq data using deep learning
    Wolfgang Kopp
    Altuna Akalin
    Uwe Ohler
    Nature Machine Intelligence, 2022, 4 : 162 - 168
  • [8] Single-cell ATAC-seq: strength in numbers
    Pott, Sebastian
    Lieb, Jason D.
    GENOME BIOLOGY, 2015, 16
  • [9] Single-cell ATAC-seq: strength in numbers
    Sebastian Pott
    Jason D. Lieb
    Genome Biology, 16
  • [10] Fundamental and practical approaches for single-cell ATAC-seq analysis
    Shi, Peiyu
    Nie, Yage
    Yang, Jiawen
    Zhang, Weixing
    Tang, Zhongjie
    Xu, Jin
    ABIOTECH, 2022, 3 (03) : 212 - 223