scRAA: the development of a robust and automatic annotation procedure for single-cell RNA sequencing data

被引:0
|
作者
Yan, Dongyan [1 ]
Sun, Zhe [1 ]
Fang, Jiyuan [1 ]
Cao, Shanshan [1 ]
Wang, Wenjie [2 ]
Chang, Xinyue [2 ]
Badirli, Sarkhan [2 ]
Fu, Haoda [2 ]
Liu, Yushi [1 ,3 ]
机构
[1] Eli Lilly & Co, Global Stat Sci, Indianapolis, IN USA
[2] Eli Lilly & Co, Adv Analyt & Data Sci, Indianapolis, IN USA
[3] Eli Lilly & Co, Global Stat Sci, 893 Delaware St, Indianapolis, IN 46225 USA
关键词
Batch effect correction; cell-type classification; ensembled method; SEQ;
D O I
10.1080/10543406.2023.2208671
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
A critical task in single-cell RNA sequencing (scRNA-Seq) data analysis is to identify cell types from heterogeneous tissues. While the majority of classification methods demonstrated high performance in scRNA-Seq annotation problems, a robust and accurate solution is desired to generate reliable outcomes for downstream analyses, for instance, marker genes identification, differentially expressed genes, and pathway analysis. It is hard to establish a universally good metric. Thus, a universally good classification method for all kinds of scenarios does not exist. In addition, reference and query data in cell classification are usually from different experimental batches, and failure to consider batch effects may result in misleading conclusions. To overcome this bottleneck, we propose a robust ensemble approach to classify cells and utilize a batch correction method between reference and query data. We simulated four scenarios that comprise simple to complex batch effect and account for varying cell-type proportions. We further tested our approach on both lung and pancreas data. We found improved prediction accuracy and robust performance across simulation scenarios and real data. The incorporation of batch effect correction between reference and query, and the ensemble approach improve cell-type prediction accuracy while maintaining robustness. We demonstrated these through simulated and real scRNA-Seq data.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Splatter: simulation of single-cell RNA sequencing data
    Luke Zappia
    Belinda Phipson
    Alicia Oshlack
    Genome Biology, 18
  • [22] Application of Single-Cell RNA Sequencing in Ovarian Development
    Gong, Xiaoqin
    Zhang, Yan
    Ai, Jihui
    Li, Kezhen
    BIOMOLECULES, 2023, 13 (01)
  • [23] Microfluidics Facilitates the Development of Single-Cell RNA Sequencing
    Pan, Yating
    Cao, Wenjian
    Mu, Ying
    Zhu, Qiangyuan
    BIOSENSORS-BASEL, 2022, 12 (07):
  • [24] Application of single-cell RNA sequencing in embryonic development
    Yu Shangguan
    Li, Chunhong
    Lin, Hua
    Ou, Minglin
    Tang, Donge
    Dai, Yong
    Yan, Qiang
    GENOMICS, 2020, 112 (06) : 4547 - 4551
  • [25] Single-cell Mayo Map (scMayoMap): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis
    Lu Yang
    Yan Er Ng
    Haipeng Sun
    Ying Li
    Lucas C. S. Chini
    Nathan K. LeBrasseur
    Jun Chen
    Xu Zhang
    BMC Biology, 21
  • [26] Single-cell Mayo Map (scMayoMap): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis
    Yang, Lu
    Ng, Yan Er
    Sun, Haipeng
    Li, Ying
    Chini, Lucas C. S.
    Lebrasseur, Nathan K.
    Chen, Jun
    Zhang, Xu
    BMC BIOLOGY, 2023, 21 (01)
  • [27] A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
    Chen, Minhui
    Dahl, Andy
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [28] scDA: Single cell discriminant analysis for single-cell RNA sequencing data
    Shi, Qianqian
    Li, Xinxing
    Peng, Qirui
    Zhang, Chuanchao
    Chen, Luonan
    Computational and Structural Biotechnology Journal, 2021, 19 : 3234 - 3244
  • [29] scDA: Single cell discriminant analysis for single-cell RNA sequencing data
    Shi, Qianqian
    Li, Xinxing
    Peng, Qirui
    Zhang, Chuanchao
    Chen, Luonan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 3234 - 3244
  • [30] scSwin: a supervised cell-type annotation method for single-cell RNA sequencing data using Swin Transformer
    Zhang, Shichen
    Xiang, Yiwen
    PROCEEDINGS OF 2024 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND INTELLIGENT COMPUTING, BIC 2024, 2024, : 479 - 484