Accurate Single-Cell Clustering through Ensemble Similarity Learning

被引:1
|
作者
Jeong, Hyundoo [1 ]
Shin, Sungtae [2 ]
Yeom, Hong-Gi [3 ]
机构
[1] Incheon Natl Univ, Dept Mechatron Engn, Incheon 22012, South Korea
[2] Dong A Univ, Dept Mech Engn, Busan 49315, South Korea
[3] Chosun Univ, Dept Elect Engn, Gwangju 61452, South Korea
基金
新加坡国家研究基金会;
关键词
single-cell RNA sequencing; zero-inflated noise reduction; ensemble similarity estimation; correspondence network; visualization and clustering; imputation; RNA-SEQ; IDENTIFICATION;
D O I
10.3390/genes12111670
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell sequencing provides novel means to interpret the transcriptomic profiles of individual cells. To obtain in-depth analysis of single-cell sequencing, it requires effective computational methods to accurately predict single-cell clusters because single-cell sequencing techniques only provide the transcriptomic profiles of each cell. Although an accurate estimation of the cell-to-cell similarity is an essential first step to derive reliable single-cell clustering results, it is challenging to obtain the accurate similarity measurement because it highly depends on a selection of genes for similarity evaluations and the optimal set of genes for the accurate similarity estimation is typically unknown. Moreover, due to technical limitations, single-cell sequencing includes a larger number of artificial zeros, and the technical noise makes it difficult to develop effective single-cell clustering algorithms. Here, we describe a novel single-cell clustering algorithm that can accurately predict single-cell clusters in large-scale single-cell sequencing by effectively reducing the zero-inflated noise and accurately estimating the cell-to-cell similarities. First, we construct an ensemble similarity network based on different similarity estimates, and reduce the artificial noise using a random walk with restart framework. Finally, starting from a larger number small size but highly consistent clusters, we iteratively merge a pair of clusters with the maximum similarities until it reaches the predicted number of clusters. Extensive performance evaluation shows that the proposed single-cell clustering algorithm can yield the accurate single-cell clustering results and it can help deciphering the key messages underlying complex biological mechanisms.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [32] Secuer: Ultrafast, scalable and accurate clustering of single-cell RNA-seq data
    Wei, Nana
    Nie, Yating
    Liu, Lin
    Zheng, Xiaoqi
    Wu, Hua-Jun
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (12)
  • [33] Supervised clustering for single-cell analysis
    Jimmy Tsz Hang Lee
    Martin Hemberg
    Nature Methods, 2019, 16 : 965 - 966
  • [34] Supervised clustering for single-cell analysis
    Lee, Jimmy Tsz Hang
    Hemberg, Martin
    NATURE METHODS, 2019, 16 (10) : 965 - 966
  • [35] Ensemble learning for classifying single-cell data and projection across reference atlases
    Wang, Lin
    Catalan, Francisca
    Shamardani, Karin
    Babikir, Husam
    Diaz, Aaron
    BIOINFORMATICS, 2020, 36 (11) : 3585 - 3587
  • [36] Semisoft clustering of single-cell data
    Zhu, Lingxue
    Lei, Jing
    Klei, Lambertus
    Devlin, Bernie
    Roeder, Kathryn
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (02) : 466 - 471
  • [37] Single-Cell RNA Sequencing Data Clustering by Low-Rank Subspace Ensemble Framework
    Wang, ChuanYuan
    Gao, Ying-Lian
    Liu, Jin-Xing
    Kong, Xiong-Zhen
    Zheng, Chun-Hou
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 1154 - 1164
  • [38] GSE: Graph similarity enhancement algorithm for single-cell RNA-seq data clustering
    Bu, Shugui
    Guo, Lilu
    Li, Rongyuan
    Lu, Jianbo
    Zhu, Xiaoshu
    2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP 2019), 2019, : 406 - 410
  • [39] Density peaks based clustering for single-cell interpretation via multikernel learning
    Kausar, Samina
    Mehmood, Rashid
    Iqbal, Muhammad Shahid
    Bie, Rongfang
    Ali, Shujaat
    Shabir, Yasir
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 71 - 76
  • [40] An active learning approach for clustering single-cell RNA-seq data
    Lin, Xiang
    Liu, Haoran
    Wei, Zhi
    Roy, Senjuti Basu
    Gao, Nan
    LABORATORY INVESTIGATION, 2022, 102 (03) : 227 - 235