DeepGSEA: explainable deep gene set enrichment analysis for single-cell transcriptomic data

被引:1
|
作者
Xiong, Guangzhi [1 ]
Leroy, Nathan J. [2 ]
Bekiranov, Stefan [3 ]
Sheffield, Nathan C. [2 ]
Zhang, Aidong [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, 85 Engineers Way, Charlottesville, VA 22904 USA
[2] Univ Virginia, Ctr Publ Hlth Genom, Charlottesville, VA 22904 USA
[3] Univ Virginia, Dept Biochem & Mol Genet, Charlottesville, VA 22908 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
REVEALS; WHETHER; CA1;
D O I
10.1093/bioinformatics/btae434
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Gene set enrichment (GSE) analysis allows for an interpretation of gene expression through pre-defined gene set databases and is a critical step in understanding different phenotypes. With the rapid development of single-cell RNA sequencing (scRNA-seq) technology, GSE analysis can be performed on fine-grained gene expression data to gain a nuanced understanding of phenotypes of interest. However, with the cellular heterogeneity in single-cell gene profiles, current statistical GSE analysis methods sometimes fail to identify enriched gene sets. Meanwhile, deep learning has gained traction in applications like clustering and trajectory inference in single-cell studies due to its prowess in capturing complex data patterns. However, its use in GSE analysis remains limited, due to interpretability challenges.Results In this paper, we present DeepGSEA, an explainable deep gene set enrichment analysis approach which leverages the expressiveness of interpretable, prototype-based neural networks to provide an in-depth analysis of GSE. DeepGSEA learns the ability to capture GSE information through our designed classification tasks, and significance tests can be performed on each gene set, enabling the identification of enriched sets. The underlying distribution of a gene set learned by DeepGSEA can be explicitly visualized using the encoded cell and cellular prototype embeddings. We demonstrate the performance of DeepGSEA over commonly used GSE analysis methods by examining their sensitivity and specificity with four simulation studies. In addition, we test our model on three real scRNA-seq datasets and illustrate the interpretability of DeepGSEA by showing how its results can be explained.Availability and implementation https://github.com/Teddy-XiongGZ/DeepGSEA
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Global coordination level in single-cell transcriptomic data
    Guy Amit
    Dana Vaknin Ben Porath
    Orr Levy
    Omer Hamdi
    Amir Bashan
    Scientific Reports, 12
  • [22] Emergent statistical laws in single-cell transcriptomic data
    Lazzardi, Silvia
    Valle, Filippo
    Mazzolini, Andrea
    Scialdone, Antonio
    Caselle, Michele
    Osella, Matteo
    PHYSICAL REVIEW E, 2023, 107 (04)
  • [23] Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data
    Aditya Pratapa
    Amogh P. Jalihal
    Jeffrey N. Law
    Aditya Bharadwaj
    T. M. Murali
    Nature Methods, 2020, 17 : 147 - 154
  • [24] scEGG: an exogenous gene-guided clustering method for single-cell transcriptomic data
    Hu, Dayu
    Guan, Renxiang
    Liang, Ke
    Yu, Hao
    Quan, Hao
    Zhao, Yawei
    Liu, Xinwang
    He, Kunlun
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [25] Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data
    Pratapa, Aditya
    Jalihal, Amogh P.
    Law, Jeffrey N.
    Bharadwaj, Aditya
    Murali, T. M.
    NATURE METHODS, 2020, 17 (02) : 147 - +
  • [26] InterCellar enables interactive analysis and exploration of cell−cell communication in single-cell transcriptomic data
    Marta Interlandi
    Kornelius Kerl
    Martin Dugas
    Communications Biology, 5
  • [27] Semi-Supervised Deep Learning for Cell Type Identification From Single-Cell Transcriptomic Data
    Dong, Xishuang
    Chowdhury, Shanta
    Victor, Uboho
    Li, Xiangfang
    Qian, Lijun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1492 - 1505
  • [28] xSiGra: explainable model for single-cell spatial data elucidation
    Budhkar, Aishwarya
    Tang, Ziyang
    Liu, Xiang
    Zhang, Xuhong
    Su, Jing
    Song, Qianqian
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (05)
  • [29] Deep learning shapes single-cell data analysis COMMENT
    Ma, Qin
    Xu, Dong
    NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2022, 23 (05) : 303 - 304
  • [30] Deep learning training dynamics analysis for single-cell data
    Karin, Jonathan
    Mintz, Reshef
    NATURE COMPUTATIONAL SCIENCE, 2024, : 886 - 887