Accurate Single-Cell Clustering through Ensemble Similarity Learning

被引:1
|
作者
Jeong, Hyundoo [1 ]
Shin, Sungtae [2 ]
Yeom, Hong-Gi [3 ]
机构
[1] Incheon Natl Univ, Dept Mechatron Engn, Incheon 22012, South Korea
[2] Dong A Univ, Dept Mech Engn, Busan 49315, South Korea
[3] Chosun Univ, Dept Elect Engn, Gwangju 61452, South Korea
基金
新加坡国家研究基金会;
关键词
single-cell RNA sequencing; zero-inflated noise reduction; ensemble similarity estimation; correspondence network; visualization and clustering; imputation; RNA-SEQ; IDENTIFICATION;
D O I
10.3390/genes12111670
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell sequencing provides novel means to interpret the transcriptomic profiles of individual cells. To obtain in-depth analysis of single-cell sequencing, it requires effective computational methods to accurately predict single-cell clusters because single-cell sequencing techniques only provide the transcriptomic profiles of each cell. Although an accurate estimation of the cell-to-cell similarity is an essential first step to derive reliable single-cell clustering results, it is challenging to obtain the accurate similarity measurement because it highly depends on a selection of genes for similarity evaluations and the optimal set of genes for the accurate similarity estimation is typically unknown. Moreover, due to technical limitations, single-cell sequencing includes a larger number of artificial zeros, and the technical noise makes it difficult to develop effective single-cell clustering algorithms. Here, we describe a novel single-cell clustering algorithm that can accurately predict single-cell clusters in large-scale single-cell sequencing by effectively reducing the zero-inflated noise and accurately estimating the cell-to-cell similarities. First, we construct an ensemble similarity network based on different similarity estimates, and reduce the artificial noise using a random walk with restart framework. Finally, starting from a larger number small size but highly consistent clusters, we iteratively merge a pair of clusters with the maximum similarities until it reaches the predicted number of clusters. Extensive performance evaluation shows that the proposed single-cell clustering algorithm can yield the accurate single-cell clustering results and it can help deciphering the key messages underlying complex biological mechanisms.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] GRACE: Graph autoencoder based single-cell clustering through ensemble similarity learning
    Ha, Jun Seo
    Jeong, Hyundoo
    PLOS ONE, 2023, 18 (04):
  • [2] Effective single-cell clustering through ensemble feature selection and similarity measurements
    Jeong, Hyundoo
    Khunlertgit, Navadon
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 87
  • [3] Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
    Yu, Lijia
    Liu, Chunlei
    Yang, Jean Yee Hwa
    Yang, Pengyi
    BIOINFORMATICS, 2023, 39 (06)
  • [4] A Global Similarity Learning for Clustering of Single-Cell RNA-Seq Data
    Zhu, Xiaoshu
    Guo, Lilu
    Xu, Yunpei
    Li, Hong-Dong
    Liao, Xingyu
    Wu, Fang-Xiang
    Peng, Xiaoqing
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 261 - 266
  • [5] Clustering single-cell RNA-seq data by rank constrained similarity learning
    Mei, Qinglin
    Li, Guojun
    Su, Zhengchang
    BIOINFORMATICS, 2021, 37 (19) : 3235 - 3242
  • [6] SAFE-clustering: Single-cell Aggregated (from Ensemble) clustering for single-cell RNA-seq data
    Yang, Yuchen
    Huh, Ruth
    Culpepper, Houston W.
    Lin, Yuan
    Love, Michael I.
    Li, Yun
    BIOINFORMATICS, 2019, 35 (08) : 1269 - 1277
  • [7] Contrastive Learning in Single-cell Multiomics Clustering
    Li, Bingjun
    Nabavi, Sheida
    14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,
  • [8] CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data
    Lin, Peijie
    Troup, Michael
    Ho, Joshua W. K.
    GENOME BIOLOGY, 2017, 18
  • [9] CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data
    Peijie Lin
    Michael Troup
    Joshua W. K. Ho
    Genome Biology, 18
  • [10] Developing ensemble clustering through similarity measures: A semi-supervised hierarchical clustering learning
    Wang, Dandan
    Li, Qi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (16):