A cell marker-based clustering strategy(cmCluster) for precise cell type identification of scRNA-seq data

被引:0
|
作者
Yuwei Huang [1 ]
Huidan Chang [1 ]
Xiaoyi Chen [2 ]
Jiayue Meng [1 ]
Mengyao Han [1 ]
Tao Huang [1 ]
Liyun Yuan [1 ]
Guoqing Zhang [1 ]
机构
[1] CAS Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health,University of Chinese Academy of Sciences, Chinese Academy of Science
[2] Ningbo Institute of Life and Health Industry, University of Chinese Academy of
关键词
D O I
暂无
中图分类号
Q503 [生物化学技术];
学科分类号
摘要
Background: The precise and efficient analysis of single-cell transcriptome data provides powerful support for studying the diversity of cell functions at the single-cell level. The most important and challenging steps are cell clustering and recognition of cell populations. While the precision of clustering and annotation are considered separately in most current studies, it is worth attempting to develop an extensive and flexible strategy to balance clustering accuracy and biological explanation comprehensively.Methods: The cell marker-based clustering strategy(cm Cluster), which is a modified Louvain clustering method,aims to search the optimal clusters through genetic algorithm(GA) and grid search based on the cell type annotation results.Results: By applying cm Cluster on a set of single-cell transcriptome data, the results showed that it was beneficial for the recognition of cell populations and explanation of biological function even on the occasion of incomplete cell type information or multiple data resources. In addition, cm Cluster also produced clear boundaries and appropriate subtypes with potential marker genes. The relevant code is available in Git Hub website(huangyuwei301/cm Cluster).Conclusions: We speculate that cm Cluster provides researchers effective screening strategies to improve the accuracy of subsequent biological analysis, reduce artificial bias, and facilitate the comparison and analysis of multiple studies.
引用
收藏
页码:163 / 174
页数:12
相关论文
共 50 条
  • [1] A cell marker-based clustering strategy (cmCluster) for precise cell type identification of scRNA-seq data
    Huang, Yuwei
    Chang, Huidan
    Chen, Xiaoyi
    Meng, Jiayue
    Han, Mengyao
    Huang, Tao
    Yuan, Liyun
    Zhang, Guoqing
    QUANTITATIVE BIOLOGY, 2023, 11 (02) : 163 - 174
  • [2] Consensus Clustering Strategy for Cell Type Assignments of scRNA-seq Data
    Riva, Simone G.
    Myers, Brynelle
    Cazzaniga, Paolo
    Buffa, Francesca M.
    Tangherloni, Andrea
    2023 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, CIBCB, 2023, : 23 - 30
  • [3] Attention-based deep clustering method for scRNA-seq cell type identification
    Li, Shenghao
    Guo, Hui
    Zhang, Simai
    Li, Yizhou
    Li, Menglong
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (11)
  • [4] Thresholding CITE-Seq-based Antibody Expression Data to Improve Cell Type Identification in scRNA-Seq
    Khan, Amir
    Oliaeimotlagh, Mohammad
    Suthahar, Sujit Silas Armstrong
    Iqneibi, Shahad
    Nettersheim, Felix
    Ley, Klaus
    JOURNAL OF IMMUNOLOGY, 2023, 210 (01):
  • [5] Automated methods for cell type annotation on scRNA-seq data
    Pasquini, Giovanni
    Arias, Jesus Eduardo Rojo
    Schaefer, Patrick
    Busskamp, Volker
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 961 - 969
  • [6] A Deep Learning-Based Method Facilitates scRNA-seq Cell Type Identification
    Wang, Xin
    Li, Zhuo
    Hang, Jie
    Xu, Ren
    Meng, Lin
    NEURAL COMPUTING FOR ADVANCED APPLICATIONS, NCAA 2024, PT I, 2025, 2181 : 171 - 185
  • [7] scFED: Clustering Identifying Cell Types of scRNA-Seq Data Based on Feature Engineering Denoising
    Liu, Yang
    Li, Feng
    Shang, Junliang
    Liu, Jinxing
    Wang, Juan
    Ge, Daohui
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (04) : 590 - 601
  • [8] scFED: Clustering Identifying Cell Types of scRNA-Seq Data Based on Feature Engineering Denoising
    Yang Liu
    Feng Li
    Junliang Shang
    Jinxing Liu
    Juan Wang
    Daohui Ge
    Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 590 - 601
  • [9] scDeepInsight: a supervised cell-type identification method for scRNA-seq data with deep learning
    Jia, Shangru
    Lysenko, Artem
    Boroevich, Keith A.
    Sharma, Alok
    Tsunoda, Tatsuhiko
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [10] GNN-based embedding for clustering scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    BIOINFORMATICS, 2022, 38 (04) : 1037 - 1044