Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data

被引:0
|
作者
Wang, Xi [1 ,2 ]
Lian, Qiwei [1 ,2 ]
Dong, Haoyu [1 ]
Xu, Shuo [2 ]
Su, Yaru [3 ]
Wu, Xiaohui [1 ]
机构
[1] Soochow Univ, Suzhou Med Coll, Pasteurien Coll, Suzhou 215000, Peoples R China
[2] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[3] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Peoples R China
基金
中国国家自然科学基金;
关键词
Single-cell ATAC-seq; Gene set scoring; Pathway analysis; Single-cell RNA-seq; Benchmark; CHROMATIN ACCESSIBILITY; RNA-SEQ; REVEALS;
D O I
10.1093/gpbjnl/qzae014
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Gene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA sequencing (RNA-seq) data, which helps to decipher single-cell heterogeneity and cell type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell type-specific markers as if in single-cell RNA-seq (scRNA-seq). However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. Here, we systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five scRNA-seq tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we found that the performance of GSS tools on scATAC-seq data was comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then, the performance of different GSS tools was extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more dependent on GSS tools or datasets. Finally, we provided practical guidelines for choosing appropriate preprocessing methods and GSS tools in different application scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Comprehensive analysis of single cell ATAC-seq data with SnapATAC
    Rongxin Fang
    Sebastian Preissl
    Yang Li
    Xiaomeng Hou
    Jacinta Lucero
    Xinxin Wang
    Amir Motamedi
    Andrew K. Shiau
    Xinzhu Zhou
    Fangming Xie
    Eran A. Mukamel
    Kai Zhang
    Yanxiao Zhang
    M. Margarita Behrens
    Joseph R. Ecker
    Bing Ren
    Nature Communications, 12
  • [32] Modeling fragment counts improves single-cell ATAC-seq analysis
    Martens, Laura D.
    Fischer, David S.
    Yepez, Vicente A.
    Theis, Fabian J.
    Gagneur, Julien
    NATURE METHODS, 2024, 21 (01) : 28 - 31
  • [33] scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model
    Duan, Hongyu
    Li, Feng
    Shang, Junliang
    Liu, Jinxing
    Li, Yan
    Liu, Xikui
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (04) : 917 - 928
  • [34] Epi-Impute: Single-Cell RNA-seq Imputation via Integration with Single-Cell ATAC-seq
    Raevskiy, Mikhail
    Yanvarev, Vladislav
    Jung, Sascha
    Del Sol, Antonio
    Medvedeva, Yulia A.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (07)
  • [35] Simultaneous dimensionality reduction and integration for single-cell ATAC-seq data using deep learning
    Kopp, Wolfgang
    Akalin, Altuna
    Ohler, Uwe
    NATURE MACHINE INTELLIGENCE, 2022, 4 (02) : 162 - +
  • [36] Simultaneous dimensionality reduction and integration for single-cell ATAC-seq data using deep learning
    Wolfgang Kopp
    Altuna Akalin
    Uwe Ohler
    Nature Machine Intelligence, 2022, 4 : 162 - 168
  • [37] ATACAmp: a tool for detecting ecDNA/HSRs from bulk and single-cell ATAC-seq data
    Hansen Cheng
    Wenhao Ma
    Kun Wang
    Han Chu
    Guangchao Bao
    Yu Liao
    Yawen Yuan
    Yixiong Gou
    Liting Dong
    Jian Yang
    Haoyang Cai
    BMC Genomics, 24
  • [38] ATACAmp: a tool for detecting ecDNA/HSRs from bulk and single-cell ATAC-seq data
    Cheng, Hansen
    Ma, Wenhao
    Wang, Kun
    Chu, Han
    Bao, Guangchao
    Liao, Yu
    Yuan, Yawen
    Gou, Yixiong
    Dong, Liting
    Yang, Jian
    Cai, Haoyang
    BMC GENOMICS, 2023, 24 (01)
  • [39] Scalable and unbiased sequence-informed embedding of single-cell ATAC-seq data with CellSpace
    Tayyebi, Zakieh
    Pine, Allison R.
    Leslie, Christina S.
    NATURE METHODS, 2024, 21 (06) : 1014 - 1022
  • [40] scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model
    Hongyu Duan
    Feng Li
    Junliang Shang
    Jinxing Liu
    Yan Li
    Xikui Liu
    Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 917 - 928