scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model

被引:2
|
作者
Duan, Hongyu [1 ]
Li, Feng [1 ]
Shang, Junliang [1 ]
Liu, Jinxing [1 ]
Li, Yan [2 ]
Liu, Xikui [2 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Peoples R China
[2] Shandong Univ Sci & Technol, Dept Elect Engn & Informat Technol, Jinan 250031, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
scATAC-seq; Clustering; Deep learning; Variational autoencoder; Bayesian Gaussian-mixture model;
D O I
10.1007/s12539-022-00536-w
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A surge in research has occurred because of current developments in single-cell technologies. Above all, single-cell Assay for Transposase-Accessible Chromatin with high throughput sequencing (scATAC-seq) is a popular approach of analyzing chromatin accessibility differences at the level of single cell, either within or between groups. As a result, it is critical to examine cell heterogeneity at a previously unseen level and to identify both recognized and unknown cell types. However, with the ever-increasing number of cells engendered by technological development and the characteristics of the data, such as high noise, sparsity and dimension, challenges in distinguishing cell types have emerged. We propose scVAEBGM, which integrates a Variational Autoencoder (VAE) with a Bayesian Gaussian-mixture model (BGM) to process and analyze scATAC-seq data. This method combines and takes benefits of a Bayesian Gaussian mixture model to estimate the number of cell types without determining the cluster number in a beforehand. In other words, the size of the clusters is inferred from the data, thus avoiding biases introduced by subjective assessments when manually determining the size of the clusters. Additionally, the method is more robust to noise and can better represent single-cell data in lower dimensions. We also create a further clustering strategy. It is indicated by experiments that further clustering based on the already completed clustering can improve the clustering accuracy again. We test on six public datasets, and scVAEBGM outperforms various dimension reduction baselines. In downstream applications, scVAEBGM can reveal biological cell types. [GRAPHICS] .
引用
下载
收藏
页码:917 / 928
页数:12
相关论文
共 50 条
  • [41] epiAneufinder identifies copy number alterations from single-cell ATAC-seq data
    Akshaya Ramakrishnan
    Aikaterini Symeonidi
    Patrick Hanel
    Katharina T. Schmid
    Maria L. Richter
    Michael Schubert
    Maria Colomé-Tatché
    Nature Communications, 14
  • [42] SCALE method for single-cell ATAC-seq analysis via latent feature extraction
    Lei Xiong
    Kui Xu
    Kang Tian
    Yanqiu Shao
    Lei Tang
    Ge Gao
    Michael Zhang
    Tao Jiang
    Qiangfeng Cliff Zhang
    Nature Communications, 10
  • [43] SCALE method for single-cell ATAC-seq analysis via latent feature extraction
    Xiong, Lei
    Xu, Kui
    Tian, Kang
    Shao, Yanqiu
    Tang, Lei
    Gao, Ge
    Zhang, Michael
    Jiang, Tao
    Zhang, Qiangfeng Cliff
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [44] Integrative Single-Cell RNA-Seq and ATAC-Seq Analysis of Mouse Corneal Epithelial Cells
    Lu, Zhao-Jing
    Ye, Jin-Guo
    Wang, Dong-Liang
    Li, Meng-Ke
    Zhang, Qi-Kai
    Liu, Zhong
    Huang, Yan-Jing
    Pan, Cai-Neng
    Lin, Yu-Heng
    Shi, Zhuo-Xing
    Zheng, Ying-Feng
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2023, 64 (03)
  • [45] Benchmarking algorithms for joint integration of unpaired and paired single-cell RNA-seq and ATAC-seq data
    Lee M.Y.Y.
    Kaestner K.H.
    Li M.
    Genome Biology, 24 (1)
  • [46] Integrative Analysis of Single-Cell RNA-Seq and ATAC-Seq Data across Treatment Time Points in Pediatric AML
    Wei, Lisa
    Trinh, Diane
    Ries, Rhonda E.
    Jin, Dan
    Corbett, Richard D.
    Smith, Jenny L.
    Furlan, Scott N.
    Meshinchi, Soheil
    Marra, Marco A.
    BLOOD, 2020, 136
  • [47] Epi-Impute: Single-Cell RNA-seq Imputation via Integration with Single-Cell ATAC-seq
    Raevskiy, Mikhail
    Yanvarev, Vladislav
    Jung, Sascha
    Del Sol, Antonio
    Medvedeva, Yulia A.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (07)
  • [48] ATACAmp: a tool for detecting ecDNA/HSRs from bulk and single-cell ATAC-seq data
    Hansen Cheng
    Wenhao Ma
    Kun Wang
    Han Chu
    Guangchao Bao
    Yu Liao
    Yawen Yuan
    Yixiong Gou
    Liting Dong
    Jian Yang
    Haoyang Cai
    BMC Genomics, 24
  • [49] ATACAmp: a tool for detecting ecDNA/HSRs from bulk and single-cell ATAC-seq data
    Cheng, Hansen
    Ma, Wenhao
    Wang, Kun
    Chu, Han
    Bao, Guangchao
    Liao, Yu
    Yuan, Yawen
    Gou, Yixiong
    Dong, Liting
    Yang, Jian
    Cai, Haoyang
    BMC GENOMICS, 2023, 24 (01)
  • [50] Translator: A Transfer Learning Approach to Facilitate Single-Cell ATAC-Seq Data Analysis from Reference Dataset
    Xu, Siwei
    Skarica, Mario
    Hwang, Ahyeon
    Dai, Yi
    Lee, Cheyu
    Girgenti, Matthew J.
    Zhang, Jing
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (07) : 619 - 633