Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data

被引:0
|
作者
Wang, Xi [1 ,2 ]
Lian, Qiwei [1 ,2 ]
Dong, Haoyu [1 ]
Xu, Shuo [2 ]
Su, Yaru [3 ]
Wu, Xiaohui [1 ]
机构
[1] Soochow Univ, Suzhou Med Coll, Pasteurien Coll, Suzhou 215000, Peoples R China
[2] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[3] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Peoples R China
基金
中国国家自然科学基金;
关键词
Single-cell ATAC-seq; Gene set scoring; Pathway analysis; Single-cell RNA-seq; Benchmark; CHROMATIN ACCESSIBILITY; RNA-SEQ; REVEALS;
D O I
10.1093/gpbjnl/qzae014
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Gene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA sequencing (RNA-seq) data, which helps to decipher single-cell heterogeneity and cell type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell type-specific markers as if in single-cell RNA-seq (scRNA-seq). However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. Here, we systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five scRNA-seq tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we found that the performance of GSS tools on scATAC-seq data was comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then, the performance of different GSS tools was extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more dependent on GSS tools or datasets. Finally, we provided practical guidelines for choosing appropriate preprocessing methods and GSS tools in different application scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Benchmarking algorithms for joint integration of unpaired and paired single-cell RNA-seq and ATAC-seq data
    Lee M.Y.Y.
    Kaestner K.H.
    Li M.
    Genome Biology, 24 (1)
  • [2] Benchmarking automated cell type annotation tools for single-cell ATAC-seq data
    Wang, Yuge
    Sun, Xingzhi
    Zhao, Hongyu
    FRONTIERS IN GENETICS, 2022, 13
  • [3] Single-cell ATAC-seq: strength in numbers
    Pott, Sebastian
    Lieb, Jason D.
    GENOME BIOLOGY, 2015, 16
  • [4] Single-cell ATAC-seq: strength in numbers
    Sebastian Pott
    Jason D. Lieb
    Genome Biology, 16
  • [5] Assessment of computational methods for the analysis of single-cell ATAC-seq data
    Chen, Huidong
    Lareau, Caleb A.
    Andreani, Tommaso
    Vinyard, Michael E.
    Garcia, Sara P.
    Clement, Kendell
    Andrade-Navarro, Miguel
    Buenrostro, Jason D.
    Pinello, Luca
    GENOME BIOLOGY, 2019, 20 (01)
  • [6] Modeling Single-Cell ATAC-Seq Data Based on Contrastive Learning
    Lan, Wei
    Zhou, Weihao
    Chen, Qingfeng
    Zheng, Ruiqing
    Pan, Yi
    Chen, Yi-Ping Phoebe
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT I, ISBRA 2024, 2024, 14954 : 473 - 482
  • [7] Assessment of computational methods for the analysis of single-cell ATAC-seq data
    Huidong Chen
    Caleb Lareau
    Tommaso Andreani
    Michael E. Vinyard
    Sara P. Garcia
    Kendell Clement
    Miguel A. Andrade-Navarro
    Jason D. Buenrostro
    Luca Pinello
    Genome Biology, 20
  • [8] Decoding cell replicational age from single-cell ATAC-seq data
    Xiao, Yu
    Zhang, Yi
    NATURE BIOTECHNOLOGY, 2024,
  • [9] simATAC: a single-cell ATAC-seq simulation framework
    Zeinab Navidi
    Lin Zhang
    Bo Wang
    Genome Biology, 22
  • [10] simATAC: a single-cell ATAC-seq simulation framework
    Navidi, Zeinab
    Zhang, Lin
    Wang, Bo
    GENOME BIOLOGY, 2021, 22 (01)