scRAA: the development of a robust and automatic annotation procedure for single-cell RNA sequencing data

被引:0
|
作者
Yan, Dongyan [1 ]
Sun, Zhe [1 ]
Fang, Jiyuan [1 ]
Cao, Shanshan [1 ]
Wang, Wenjie [2 ]
Chang, Xinyue [2 ]
Badirli, Sarkhan [2 ]
Fu, Haoda [2 ]
Liu, Yushi [1 ,3 ]
机构
[1] Eli Lilly & Co, Global Stat Sci, Indianapolis, IN USA
[2] Eli Lilly & Co, Adv Analyt & Data Sci, Indianapolis, IN USA
[3] Eli Lilly & Co, Global Stat Sci, 893 Delaware St, Indianapolis, IN 46225 USA
关键词
Batch effect correction; cell-type classification; ensembled method; SEQ;
D O I
10.1080/10543406.2023.2208671
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
A critical task in single-cell RNA sequencing (scRNA-Seq) data analysis is to identify cell types from heterogeneous tissues. While the majority of classification methods demonstrated high performance in scRNA-Seq annotation problems, a robust and accurate solution is desired to generate reliable outcomes for downstream analyses, for instance, marker genes identification, differentially expressed genes, and pathway analysis. It is hard to establish a universally good metric. Thus, a universally good classification method for all kinds of scenarios does not exist. In addition, reference and query data in cell classification are usually from different experimental batches, and failure to consider batch effects may result in misleading conclusions. To overcome this bottleneck, we propose a robust ensemble approach to classify cells and utilize a batch correction method between reference and query data. We simulated four scenarios that comprise simple to complex batch effect and account for varying cell-type proportions. We further tested our approach on both lung and pancreas data. We found improved prediction accuracy and robust performance across simulation scenarios and real data. The incorporation of batch effect correction between reference and query, and the ensemble approach improve cell-type prediction accuracy while maintaining robustness. We demonstrated these through simulated and real scRNA-Seq data.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Single-Cell RNA Sequencing Approaches for Tracing T Cell Development
    Oh, Seungyoul
    Gray, Daniel H. D.
    Chong, Mark M. W.
    JOURNAL OF IMMUNOLOGY, 2021, 207 (02): : 363 - 370
  • [42] Single-cell RNA sequencing in cardiovascular development, disease and medicine
    David T. Paik
    Sangkyun Cho
    Lei Tian
    Howard Y. Chang
    Joseph C. Wu
    Nature Reviews Cardiology, 2020, 17 : 457 - 473
  • [43] Applications of single-cell RNA sequencing in drug discovery and development
    van de Sande, Bram
    Lee, Joon Sang
    Mutasa-Gottgens, Euphemia
    Naughton, Bart
    Bacon, Wendi
    Manning, Jonathan
    Wang, Yong
    Pollard, Jack
    Mendez, Melissa
    Hill, Jon
    Kumar, Namit
    Cao, Xiaohong
    Chen, Xiao
    Khaladkar, Mugdha
    Wen, Ji
    Leach, Andrew
    Ferran, Edgardo
    NATURE REVIEWS DRUG DISCOVERY, 2023, 22 (06) : 496 - 520
  • [44] Single-cell RNA sequencing in cardiovascular development, disease and medicine
    Paik, David T.
    Cho, Sangkyun
    Tian, Lei
    Chang, Howard Y.
    Wu, Joseph C.
    NATURE REVIEWS CARDIOLOGY, 2020, 17 (08) : 457 - 473
  • [45] Applications of single-cell RNA sequencing in drug discovery and development
    Bram Van de Sande
    Joon Sang Lee
    Euphemia Mutasa-Gottgens
    Bart Naughton
    Wendi Bacon
    Jonathan Manning
    Yong Wang
    Jack Pollard
    Melissa Mendez
    Jon Hill
    Namit Kumar
    Xiaohong Cao
    Xiao Chen
    Mugdha Khaladkar
    Ji Wen
    Andrew Leach
    Edgardo Ferran
    Nature Reviews Drug Discovery, 2023, 22 : 496 - 520
  • [46] Values of Single-Cell RNA Sequencing in Development of Cerebral Cortex
    Chang, Enqiang
    Ruan, Xiaoguo
    Zhu, Ruilou
    Wang, Yangyang
    Zhang, Jiaqiang
    SINGLE-CELL SEQUENCING AND METHYLATION: METHODS AND CLINICAL APPLICATIONS, 2020, 1255 : 231 - 247
  • [47] Single-cell RNA sequencing for the study of development, physiology and disease
    S. Steven Potter
    Nature Reviews Nephrology, 2018, 14 : 479 - 492
  • [48] Single-cell RNA sequencing for the study of development, physiology and disease
    Potter, S. Steven
    NATURE REVIEWS NEPHROLOGY, 2018, 14 (08) : 479 - 492
  • [49] Automated annotation of rare-cell types from single-cell RNA-sequencing data through synthetic oversampling
    Bej, Saptarshi
    Galow, Anne-Marie
    David, Robert
    Wolfien, Markus
    Wolkenhauer, Olaf
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [50] Single-cell RNA Sequencing in Immunology
    Cao, Yirui
    Qiu, Yue
    Tu, Guowei
    Yang, Cheng
    CURRENT GENOMICS, 2020, 21 (08) : 564 - 575