Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data

被引:0
|
作者
Wang, Xi [1 ,2 ]
Lian, Qiwei [1 ,2 ]
Dong, Haoyu [1 ]
Xu, Shuo [2 ]
Su, Yaru [3 ]
Wu, Xiaohui [1 ]
机构
[1] Soochow Univ, Suzhou Med Coll, Pasteurien Coll, Suzhou 215000, Peoples R China
[2] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[3] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Peoples R China
基金
中国国家自然科学基金;
关键词
Single-cell ATAC-seq; Gene set scoring; Pathway analysis; Single-cell RNA-seq; Benchmark; CHROMATIN ACCESSIBILITY; RNA-SEQ; REVEALS;
D O I
10.1093/gpbjnl/qzae014
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Gene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA sequencing (RNA-seq) data, which helps to decipher single-cell heterogeneity and cell type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell type-specific markers as if in single-cell RNA-seq (scRNA-seq). However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. Here, we systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five scRNA-seq tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we found that the performance of GSS tools on scATAC-seq data was comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then, the performance of different GSS tools was extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more dependent on GSS tools or datasets. Finally, we provided practical guidelines for choosing appropriate preprocessing methods and GSS tools in different application scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Integrative single-cell RNA-seq and ATAC-seq analysis of myogenic differentiation in pig
    Cai, Shufang
    Hu, Bin
    Wang, Xiaoyu
    Liu, Tongni
    Lin, Zhuhu
    Tong, Xian
    Xu, Rong
    Chen, Meilin
    Duo, Tianqi
    Zhu, Qi
    Liang, Ziyun
    Li, Enru
    Chen, Yaosheng
    Li, Jianhao
    Liu, Xiaohong
    Mo, Delin
    BMC BIOLOGY, 2023, 21 (01)
  • [42] A Cell Cycle-Aware Network for Data Integration and Label Transferring of Single-Cell RNA-Seq and ATAC-Seq
    Liu, Jiajia
    Ma, Jian
    Wen, Jianguo
    Zhou, Xiaobo
    ADVANCED SCIENCE, 2024,
  • [43] A deep generative model for multi-view profiling of single-cell RNA-seq and ATAC-seq data
    Li, Gaoyang
    Fu, Shaliu
    Wang, Shuguang
    Zhu, Chenyu
    Duan, Bin
    Tang, Chen
    Chen, Xiaohan
    Chuai, Guohui
    Wang, Ping
    Liu, Qi
    GENOME BIOLOGY, 2022, 23 (01)
  • [44] Discovering single nucleotide variants and indels from bulk and single-cell ATAC-seq
    Massarat, Arya R.
    Sen, Arko
    Jaureguy, Jeff
    Tyndale, Selene T.
    Fu, Yi
    Erikson, Galina
    McVicker, Graham
    NUCLEIC ACIDS RESEARCH, 2021, 49 (14) : 7986 - 7994
  • [45] Integrative single-cell RNA-seq and ATAC-seq analysis of myogenic differentiation in pig
    Shufang Cai
    Bin Hu
    Xiaoyu Wang
    Tongni Liu
    Zhuhu Lin
    Xian Tong
    Rong Xu
    Meilin Chen
    Tianqi Duo
    Qi Zhu
    Ziyun Liang
    Enru Li
    Yaosheng Chen
    Jianhao Li
    Xiaohong Liu
    Delin Mo
    BMC Biology, 21
  • [46] Integrative Single-Cell RNA-Seq and ATAC-Seq Analysis of Human Developmental Hematopoiesis
    Ranzoni, Anna Maria
    Tangherloni, Andrea
    Berest, Ivan
    Riva, Simone Giovanni
    Myers, Brynelle
    Strzelecka, Paulina M.
    Xu, Jiarui
    Panada, Elisa
    Mohorianu, Irina
    Zaugg, Judith B.
    Cvejic, Ana
    CELL STEM CELL, 2021, 28 (03) : 472 - +
  • [47] scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning
    Yingxin Lin
    Tung-Yu Wu
    Sheng Wan
    Jean Y. H. Yang
    Wing H. Wong
    Y. X. Rachel Wang
    Nature Biotechnology, 2022, 40 : 703 - 710
  • [48] Cellcano: supervised cell type identification for single cell ATAC-seq data
    Ma, Wenjing
    Lu, Jiaying
    Wu, Hao
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [49] A deep generative model for multi-view profiling of single-cell RNA-seq and ATAC-seq data
    Gaoyang Li
    Shaliu Fu
    Shuguang Wang
    Chenyu Zhu
    Bin Duan
    Chen Tang
    Xiaohan Chen
    Guohui Chuai
    Ping Wang
    Qi Liu
    Genome Biology, 23
  • [50] scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning
    Lin, Yingxin
    Wu, Tung-Yu
    Wan, Sheng
    Yang, Jean Y. H.
    Wong, Wing H.
    Wang, Y. X. Rachel
    NATURE BIOTECHNOLOGY, 2022, 40 (05) : 703 - +