A cell marker-based clustering strategy(cmCluster) for precise cell type identification of scRNA-seq data

被引:0
|
作者
Yuwei Huang [1 ]
Huidan Chang [1 ]
Xiaoyi Chen [2 ]
Jiayue Meng [1 ]
Mengyao Han [1 ]
Tao Huang [1 ]
Liyun Yuan [1 ]
Guoqing Zhang [1 ]
机构
[1] CAS Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health,University of Chinese Academy of Sciences, Chinese Academy of Science
[2] Ningbo Institute of Life and Health Industry, University of Chinese Academy of
关键词
D O I
暂无
中图分类号
Q503 [生物化学技术];
学科分类号
摘要
Background: The precise and efficient analysis of single-cell transcriptome data provides powerful support for studying the diversity of cell functions at the single-cell level. The most important and challenging steps are cell clustering and recognition of cell populations. While the precision of clustering and annotation are considered separately in most current studies, it is worth attempting to develop an extensive and flexible strategy to balance clustering accuracy and biological explanation comprehensively.Methods: The cell marker-based clustering strategy(cm Cluster), which is a modified Louvain clustering method,aims to search the optimal clusters through genetic algorithm(GA) and grid search based on the cell type annotation results.Results: By applying cm Cluster on a set of single-cell transcriptome data, the results showed that it was beneficial for the recognition of cell populations and explanation of biological function even on the occasion of incomplete cell type information or multiple data resources. In addition, cm Cluster also produced clear boundaries and appropriate subtypes with potential marker genes. The relevant code is available in Git Hub website(huangyuwei301/cm Cluster).Conclusions: We speculate that cm Cluster provides researchers effective screening strategies to improve the accuracy of subsequent biological analysis, reduce artificial bias, and facilitate the comparison and analysis of multiple studies.
引用
收藏
页码:163 / 174
页数:12
相关论文
共 50 条
  • [41] A rank-based marker selection method for high throughput scRNA-seq data
    Alexander H. S. Vargo
    Anna C. Gilbert
    BMC Bioinformatics, 21
  • [42] TraSig: inferring cell-cell interactions from pseudotime ordering of scRNA-Seq data
    Dongshunyi Li
    Jeremy J. Velazquez
    Jun Ding
    Joshua Hislop
    Mo R. Ebrahimkhani
    Ziv Bar-Joseph
    Genome Biology, 23
  • [43] An Effective Biclustering-Based Framework for Identifying Cell Subpopulations From scRNA-seq Data
    Fang, Qiong
    Su, Dewei
    Ng, Wilfred
    Feng, Jianlin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2249 - 2260
  • [44] Cell type-specific interaction analysis using doublets in scRNA-seq
    Schiebout, Courtney
    Lust, Hannah
    Huang, Yina
    Frost, H. Robert
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [45] AGImpute: imputation of scRNA-seq data based on a hybrid GAN with dropouts identification
    Zhu, Xiaoshu
    Meng, Shuang
    Li, Gaoshi
    Wang, Jianxin
    Peng, Xiaoqing
    BIOINFORMATICS, 2024, 40 (02)
  • [46] A rank-based marker selection method for high throughput scRNA-seq data
    Vargo, Alexander H. S.
    Gilbert, Anna C.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [47] Evaluation of T Cell Receptor Construction Methods from scRNA-Seq Data
    Tian, Ruonan
    Yu, Zhejian
    Xue, Ziwei
    Wu, Jiaxin
    Wu, Lize
    Cai, Shuo
    Gao, Bing
    He, Bing
    Zhao, Yu
    Yao, Jianhua
    Lu, Linrong
    Liu, Wanlu
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2025, 22 (06):
  • [48] TraSig: inferring cell-cell interactions from pseudotime ordering of scRNA-Seq data
    Li, Dongshunyi
    Velazquez, Jeremy J.
    Ding, Jun
    Hislop, Joshua
    Ebrahimkhani, Mo R.
    Bar-Joseph, Ziv
    GENOME BIOLOGY, 2022, 23 (01)
  • [49] Discovering single-cell eQTLs from scRNA-seq data only
    Ma, Tianxing
    Li, Haochen
    Zhang, Xuegong
    GENE, 2022, 829
  • [50] SCExecute: custom cell barcode-stratified analyses of scRNA-seq data
    Edwards, Nathan
    Dillard, Christian
    Prashant, N. M.
    Hongyu, Liu
    Yang, Mia
    Ulianova, Evgenia
    Horvath, Anelia
    BIOINFORMATICS, 2023, 39 (01)