scRAA: the development of a robust and automatic annotation procedure for single-cell RNA sequencing data

被引:0
|
作者
Yan, Dongyan [1 ]
Sun, Zhe [1 ]
Fang, Jiyuan [1 ]
Cao, Shanshan [1 ]
Wang, Wenjie [2 ]
Chang, Xinyue [2 ]
Badirli, Sarkhan [2 ]
Fu, Haoda [2 ]
Liu, Yushi [1 ,3 ]
机构
[1] Eli Lilly & Co, Global Stat Sci, Indianapolis, IN USA
[2] Eli Lilly & Co, Adv Analyt & Data Sci, Indianapolis, IN USA
[3] Eli Lilly & Co, Global Stat Sci, 893 Delaware St, Indianapolis, IN 46225 USA
关键词
Batch effect correction; cell-type classification; ensembled method; SEQ;
D O I
10.1080/10543406.2023.2208671
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
A critical task in single-cell RNA sequencing (scRNA-Seq) data analysis is to identify cell types from heterogeneous tissues. While the majority of classification methods demonstrated high performance in scRNA-Seq annotation problems, a robust and accurate solution is desired to generate reliable outcomes for downstream analyses, for instance, marker genes identification, differentially expressed genes, and pathway analysis. It is hard to establish a universally good metric. Thus, a universally good classification method for all kinds of scenarios does not exist. In addition, reference and query data in cell classification are usually from different experimental batches, and failure to consider batch effects may result in misleading conclusions. To overcome this bottleneck, we propose a robust ensemble approach to classify cells and utilize a batch correction method between reference and query data. We simulated four scenarios that comprise simple to complex batch effect and account for varying cell-type proportions. We further tested our approach on both lung and pancreas data. We found improved prediction accuracy and robust performance across simulation scenarios and real data. The incorporation of batch effect correction between reference and query, and the ensemble approach improve cell-type prediction accuracy while maintaining robustness. We demonstrated these through simulated and real scRNA-Seq data.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Automatic Cell Type Annotation Using Marker Genes for Single-Cell RNA Sequencing Data
    Chen, Yu
    Zhang, Shuqin
    BIOMOLECULES, 2022, 12 (10)
  • [2] scCATCH: Automatic Annotation on Cell Types of Clusters from Single-Cell RNA Sequencing Data
    Shao, Xin
    Liao, Jie
    Lu, Xiaoyan
    Xue, Rui
    Ai, Ni
    Fan, Xiaohui
    ISCIENCE, 2020, 23 (03)
  • [3] scds: computational annotation of doublets in single-cell RNA sequencing data
    Bais, Abha S.
    Kostka, Dennis
    BIOINFORMATICS, 2020, 36 (04) : 1150 - 1158
  • [4] A comparison of automatic cell identification methods for single-cell RNA sequencing data
    Abdelaal, Tamim
    Michielsen, Lieke
    Cats, Davy
    Hoogduin, Dylan
    Mei, Hailiang
    Reinders, Marcel J. T.
    Mahfouz, Ahmed
    GENOME BIOLOGY, 2019, 20 (01)
  • [5] A comparison of automatic cell identification methods for single-cell RNA sequencing data
    Tamim Abdelaal
    Lieke Michielsen
    Davy Cats
    Dylan Hoogduin
    Hailiang Mei
    Marcel J. T. Reinders
    Ahmed Mahfouz
    Genome Biology, 20
  • [6] Combining single-cell ATAC and RNA sequencing for supervised cell annotation
    Gill, Jaidip
    Dasgupta, Abhijit
    Manry, Brychan
    Markuzon, Natasha
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [7] Multi-Target Integration and Annotation of Single-Cell RNA-Sequencing Data
    Bhandari, Sapan
    Whitener, Nathan P.
    Zhao, Konghao
    Khuri, Natalia
    13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [8] Combining single-cell ATAC and RNA sequencing for supervised cell annotation
    Gill, Jaidip
    Dasgupta, Abhijit
    Manry, Brychan
    Markuzon, Natasha
    CANCER RESEARCH, 2024, 84 (06)
  • [9] SIEVE: identifying robust single cell variable genes for single-cell RNA sequencing data
    Zhang, Yinan
    Xie, Xiaowei
    Wu, Peng
    Zhu, Ping
    BLOOD SCIENCE, 2021, 3 (02): : 35 - 39
  • [10] scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets
    Liu, Hongjia
    Li, Huamei
    Sharma, Amit
    Huang, Wenjuan
    Pan, Duo
    Gu, Yu
    Lin, Lu
    Sun, Xiao
    Liu, Hongde
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)