Accurate Single-Cell Clustering through Ensemble Similarity Learning

被引:1
|
作者
Jeong, Hyundoo [1 ]
Shin, Sungtae [2 ]
Yeom, Hong-Gi [3 ]
机构
[1] Incheon Natl Univ, Dept Mechatron Engn, Incheon 22012, South Korea
[2] Dong A Univ, Dept Mech Engn, Busan 49315, South Korea
[3] Chosun Univ, Dept Elect Engn, Gwangju 61452, South Korea
基金
新加坡国家研究基金会;
关键词
single-cell RNA sequencing; zero-inflated noise reduction; ensemble similarity estimation; correspondence network; visualization and clustering; imputation; RNA-SEQ; IDENTIFICATION;
D O I
10.3390/genes12111670
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell sequencing provides novel means to interpret the transcriptomic profiles of individual cells. To obtain in-depth analysis of single-cell sequencing, it requires effective computational methods to accurately predict single-cell clusters because single-cell sequencing techniques only provide the transcriptomic profiles of each cell. Although an accurate estimation of the cell-to-cell similarity is an essential first step to derive reliable single-cell clustering results, it is challenging to obtain the accurate similarity measurement because it highly depends on a selection of genes for similarity evaluations and the optimal set of genes for the accurate similarity estimation is typically unknown. Moreover, due to technical limitations, single-cell sequencing includes a larger number of artificial zeros, and the technical noise makes it difficult to develop effective single-cell clustering algorithms. Here, we describe a novel single-cell clustering algorithm that can accurately predict single-cell clusters in large-scale single-cell sequencing by effectively reducing the zero-inflated noise and accurately estimating the cell-to-cell similarities. First, we construct an ensemble similarity network based on different similarity estimates, and reduce the artificial noise using a random walk with restart framework. Finally, starting from a larger number small size but highly consistent clusters, we iteratively merge a pair of clusters with the maximum similarities until it reaches the predicted number of clusters. Extensive performance evaluation shows that the proposed single-cell clustering algorithm can yield the accurate single-cell clustering results and it can help deciphering the key messages underlying complex biological mechanisms.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
    Xiangjie Li
    Kui Wang
    Yafei Lyu
    Huize Pan
    Jingxiao Zhang
    Dwight Stambolian
    Katalin Susztak
    Muredach P. Reilly
    Gang Hu
    Mingyao Li
    Nature Communications, 11
  • [22] AMC: accurate mutation clustering from single-cell DNA sequencing data
    Yu, Zhenhua
    Du, Fang
    BIOINFORMATICS, 2022, 38 (06) : 1732 - 1734
  • [23] Clustering single-cell multi-omics data via graph regularized multi-view ensemble learning
    Chen, Fuqun
    Zou, Guanhua
    Wu, Yongxian
    Ou-Yang, Le
    BIOINFORMATICS, 2024, 40 (04)
  • [24] scEFSC: Accurate single-cell RNA-seq data analysis via ensemble consensus clustering based on multiple feature selections
    Bian, Chuang
    Wang, Xubin
    Su, Yanchi
    Wang, Yunhe
    Wong, Ka-chun
    Li, Xiangtao
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 2181 - 2197
  • [25] A robust and accurate single-cell data trajectory inference method using ensemble pseudotime
    Yifan Zhang
    Duc Tran
    Tin Nguyen
    Sergiu M. Dascalu
    Frederick C. Harris
    BMC Bioinformatics, 24
  • [26] A robust and accurate single-cell data trajectory inference method using ensemble pseudotime
    Zhang, Yifan
    Tran, Duc
    Nguyen, Tin
    Dascalu, Sergiu M.
    Harris, Frederick C.
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [27] Interpreting Image-based Profiles using Similarity Clustering and Single-Cell Visualization
    Garcia-Fossa, Fernanda
    Cruz, Mario Costa
    Haghighi, Marzieh
    de Jesus, Marcelo Bispo
    Singh, Shantanu
    Carpenter, Anne E.
    Cimini, Beth A.
    CURRENT PROTOCOLS, 2023, 3 (03):
  • [28] EnClaSC: a novel ensemble approach for accurate and robust cell-type classification of single-cell transcriptomes
    Chen, Xiaoyang
    Chen, Shengquan
    Jiang, Rui
    BMC BIOINFORMATICS, 2020, 21 (Suppl 13)
  • [29] EnClaSC: a novel ensemble approach for accurate and robust cell-type classification of single-cell transcriptomes
    Xiaoyang Chen
    Shengquan Chen
    Rui Jiang
    BMC Bioinformatics, 21
  • [30] scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering
    Wang, Yunhe
    Yu, Zhuohan
    Li, Shaochuan
    Bian, Chuang
    Liang, Yanchun
    Wong, Ka-Chun
    Li, Xiangtao
    BIOINFORMATICS, 2023, 39 (02)