Consensus clustering of single-cell RNA-seq data by enhancing network affinity

被引:29
|
作者
Cui, Yaxuan [1 ]
Zhang, Shaoqiang [1 ]
Liang, Ying [1 ]
Wang, Xiangyun [1 ]
Ferraro, Thomas N. [2 ]
Chen, Yong [3 ]
机构
[1] Tianjin Normal Univ, Coll Comp & Informat Engn, Tianjin 300387, Peoples R China
[2] CMSRU, Dept Biomed Sci, Camden, NJ USA
[3] Rowan Univ, Dept Mol & Cellular Biosci, Camden, NJ 08028 USA
基金
美国国家科学基金会;
关键词
single-cell RNA-seq; clustering algorithm; bioinformatics; cell typing; GENE-EXPRESSION; HETEROGENEITY; EMBRYOS; STATES; ATLAS; FATE;
D O I
10.1093/bib/bbab236
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Elucidation of cell subpopulations at high resolution is a key and challenging goal of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) data analysis. Although unsupervised clustering methods have been proposed for de novo identification of cell populations, their performance and robustness suffer from the high variability, low capture efficiency and high dropout rates which are characteristic of scRNA-seq experiments. Here, we present a novel unsupervised method for Single-cell Clustering by Enhancing Network Affinity (SCENA), which mainly employed three strategies: selecting multiple gene sets, enhancing local affinity among cells and clustering of consensus matrices. Large-scale validations on 13 real scRNA-seq datasets show that SCENA has high accuracy in detecting cell populations and is robust against dropout noise. When we applied SCENA to large-scale scRNA-seq data of mouse brain cells, known cell types were successfully detected, and novel cell types of interneurons were identified with differential expression of gamma-aminobutyric acid receptor subunits and transporters. SCENA is equipped with CPU+GPU (Central Processing Units+Graphics Processing Units) heterogeneous parallel computing to achieve high running speed. The high performance and running speed of SCENA combine into a new and efficient platform for biological discoveries in clustering analysis of large and diverse scRNA-seq datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] An interpretable framework for clustering single-cell RNA-Seq datasets
    Jesse M. Zhang
    Jue Fan
    H. Christina Fan
    David Rosenfeld
    David N. Tse
    BMC Bioinformatics, 19
  • [32] scMAE: a masked autoencoder for single-cell RNA-seq clustering
    Fang, Zhaoyu
    Zheng, Ruiqing
    Li, Min
    BIOINFORMATICS, 2024, 40 (01)
  • [33] Single-cell RNA-seq clustering: datasets, models, and algorithms
    Peng, Lihong
    Tian, Xiongfei
    Tian, Geng
    Xu, Junlin
    Huang, Xin
    Weng, Yanbin
    Yang, Jialiang
    Zhou, Liqian
    RNA BIOLOGY, 2020, 17 (06) : 765 - 783
  • [34] Improving Single-Cell RNA-seq Clustering by Integrating Pathways
    Zhang, Chenxing
    Gao, Lin
    Wang, Bingbo
    Gao, Yong
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [35] An interpretable framework for clustering single-cell RNA-Seq datasets
    Zhang, Jesse M.
    Fan, Jue
    Fan, Christina
    Rosenfeld, David
    Tse, David N.
    BMC BIOINFORMATICS, 2018, 19
  • [36] Comparison of transformations for single-cell RNA-seq data
    Constantin Ahlmann-Eltze
    Wolfgang Huber
    Nature Methods, 2023, 20 : 665 - 672
  • [37] An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
    Sun, Xifang
    Sun, Shiquan
    Yang, Sheng
    CELLS, 2019, 8 (10)
  • [38] Comparison of transformations for single-cell RNA-seq data
    Ahlmann-Eltze, Constantin
    Huber, Wolfgang
    NATURE METHODS, 2023, 20 (05) : 665 - +
  • [39] TiC2D: Trajectory Inference From Single-Cell RNA-Seq Data Using Consensus Clustering
    Gan, Yanglan
    Li, Ning
    Guo, Cheng
    Zou, Guobing
    Guan, Jihong
    Zhou, Shuigeng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (04) : 2512 - 2522
  • [40] ccImpute: an accurate and scalable consensus clustering based algorithm to impute dropout events in the single-cell RNA-seq data
    Malec, Marcin
    Kurban, Hasan
    Dalkilic, Mehmet
    BMC BIOINFORMATICS, 2022, 23 (01)