Accurate Single-Cell Clustering through Ensemble Similarity Learning

被引:1
|
作者
Jeong, Hyundoo [1 ]
Shin, Sungtae [2 ]
Yeom, Hong-Gi [3 ]
机构
[1] Incheon Natl Univ, Dept Mechatron Engn, Incheon 22012, South Korea
[2] Dong A Univ, Dept Mech Engn, Busan 49315, South Korea
[3] Chosun Univ, Dept Elect Engn, Gwangju 61452, South Korea
基金
新加坡国家研究基金会;
关键词
single-cell RNA sequencing; zero-inflated noise reduction; ensemble similarity estimation; correspondence network; visualization and clustering; imputation; RNA-SEQ; IDENTIFICATION;
D O I
10.3390/genes12111670
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell sequencing provides novel means to interpret the transcriptomic profiles of individual cells. To obtain in-depth analysis of single-cell sequencing, it requires effective computational methods to accurately predict single-cell clusters because single-cell sequencing techniques only provide the transcriptomic profiles of each cell. Although an accurate estimation of the cell-to-cell similarity is an essential first step to derive reliable single-cell clustering results, it is challenging to obtain the accurate similarity measurement because it highly depends on a selection of genes for similarity evaluations and the optimal set of genes for the accurate similarity estimation is typically unknown. Moreover, due to technical limitations, single-cell sequencing includes a larger number of artificial zeros, and the technical noise makes it difficult to develop effective single-cell clustering algorithms. Here, we describe a novel single-cell clustering algorithm that can accurately predict single-cell clusters in large-scale single-cell sequencing by effectively reducing the zero-inflated noise and accurately estimating the cell-to-cell similarities. First, we construct an ensemble similarity network based on different similarity estimates, and reduce the artificial noise using a random walk with restart framework. Finally, starting from a larger number small size but highly consistent clusters, we iteratively merge a pair of clusters with the maximum similarities until it reaches the predicted number of clusters. Extensive performance evaluation shows that the proposed single-cell clustering algorithm can yield the accurate single-cell clustering results and it can help deciphering the key messages underlying complex biological mechanisms.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Deep learning powered single-cell clustering framework with enhanced accuracy and stability
    Zhang, Yi
    Feng, Xi
    Wang, Yin
    Shi, Kai
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [42] A Fusion Learning Model Based on Deep Learning for Single-Cell RNA Sequencing Data Clustering
    Qiao, Tian-Jing
    Li, Feng
    Yuan, Sha-Sha
    Dai, Ling-Yun
    Wang, Juan
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2024, 31 (06) : 576 - 588
  • [43] TripletCell: a deep metric learning framework for accurate annotation of cell types at the single-cell level
    Liu, Yan
    Wei, Guo
    Li, Chen
    Shen, Long-Chen
    Gasser, Robin B.
    Song, Jiangning
    Chen, Dijun
    Yu, Dong-Jun
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [44] Enabling accurate single-cell genome amplification
    Darren J. Burgess
    Nature Reviews Genetics, 2016, 17 (9) : 503 - 503
  • [45] Ensemble Classification through Random Projections for Single-Cell RNA-Seq Data
    Vrahatis, Aristidis G.
    Tasoulis, Sotiris K.
    Georgakopoulos, Spiros V.
    Plagianakos, Vassilis P.
    INFORMATION, 2020, 11 (11) : 1 - 14
  • [46] EnTSSR: A Weighted Ensemble Learning Method to Impute Single-Cell RNA Sequencing Data
    Lu, Fan
    Lin, Yilong
    Yuan, Chongbin
    Zhang, Xiao-Fei
    Le Ou-Yang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2781 - 2787
  • [47] Accurate estimation of stroke risk with fuzzy clustering and ensemble learning methods
    Akyel, Anil
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77
  • [48] Learning for single-cell assignment
    Duan, Bin
    Zhu, Chenyu
    Chuai, Guohui
    Tang, Chen
    Chen, Xiaohan
    Chen, Shaoqi
    Fu, Shaliu
    Li, Gaoyang
    Liu, Qi
    SCIENCE ADVANCES, 2020, 6 (44):
  • [49] CTEC: a cross-tabulation ensemble clustering approach for single-cell RNA sequencing data analysis
    Wang, Liang
    Hong, Chenyang
    Song, Jiangning
    Yao, Jianhua
    BIOINFORMATICS, 2024, 40 (04)
  • [50] ECBN: Ensemble Clustering based on Bayesian Network inference for Single-cell RNA-seq Data
    Zhang, Dexin
    Zhu, Yuan
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 5884 - 5888