DeepGSEA: explainable deep gene set enrichment analysis for single-cell transcriptomic data

被引:1
|
作者
Xiong, Guangzhi [1 ]
Leroy, Nathan J. [2 ]
Bekiranov, Stefan [3 ]
Sheffield, Nathan C. [2 ]
Zhang, Aidong [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, 85 Engineers Way, Charlottesville, VA 22904 USA
[2] Univ Virginia, Ctr Publ Hlth Genom, Charlottesville, VA 22904 USA
[3] Univ Virginia, Dept Biochem & Mol Genet, Charlottesville, VA 22908 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
REVEALS; WHETHER; CA1;
D O I
10.1093/bioinformatics/btae434
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Gene set enrichment (GSE) analysis allows for an interpretation of gene expression through pre-defined gene set databases and is a critical step in understanding different phenotypes. With the rapid development of single-cell RNA sequencing (scRNA-seq) technology, GSE analysis can be performed on fine-grained gene expression data to gain a nuanced understanding of phenotypes of interest. However, with the cellular heterogeneity in single-cell gene profiles, current statistical GSE analysis methods sometimes fail to identify enriched gene sets. Meanwhile, deep learning has gained traction in applications like clustering and trajectory inference in single-cell studies due to its prowess in capturing complex data patterns. However, its use in GSE analysis remains limited, due to interpretability challenges.Results In this paper, we present DeepGSEA, an explainable deep gene set enrichment analysis approach which leverages the expressiveness of interpretable, prototype-based neural networks to provide an in-depth analysis of GSE. DeepGSEA learns the ability to capture GSE information through our designed classification tasks, and significance tests can be performed on each gene set, enabling the identification of enriched sets. The underlying distribution of a gene set learned by DeepGSEA can be explicitly visualized using the encoded cell and cellular prototype embeddings. We demonstrate the performance of DeepGSEA over commonly used GSE analysis methods by examining their sensitivity and specificity with four simulation studies. In addition, we test our model on three real scRNA-seq datasets and illustrate the interpretability of DeepGSEA by showing how its results can be explained.Availability and implementation https://github.com/Teddy-XiongGZ/DeepGSEA
引用
收藏
页数:10
相关论文
共 50 条
  • [31] CIRI-Deep Enables Single-Cell and Spatial Transcriptomic Analysis of Circular RNAs with Deep Learning
    Zhou, Zihan
    Zhang, Jinyang
    Zheng, Xin
    Pan, Zhicheng
    Zhao, Fangqing
    Gao, Yuan
    ADVANCED SCIENCE, 2024, 11 (14)
  • [32] Assessing transcriptomic heterogeneity of single-cell RNASeq data by bulk-level gene expression data
    Tiong, Khong-Loon
    Luzhbin, Dmytro
    Yeang, Chen-Hsiang
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [33] Using deep learning to quantify neuronal activation from single-cell and spatial transcriptomic data
    Ethan Bahl
    Snehajyoti Chatterjee
    Utsav Mukherjee
    Muhammad Elsadany
    Yann Vanrobaeys
    Li-Chun Lin
    Miriam McDonough
    Jon Resch
    K. Peter Giese
    Ted Abel
    Jacob J. Michaelson
    Nature Communications, 15
  • [34] Using deep learning to quantify neuronal activation from single-cell and spatial transcriptomic data
    Bahl, Ethan
    Chatterjee, Snehajyoti
    Mukherjee, Utsav
    Elsadany, Muhammad
    Vanrobaeys, Yann
    Lin, Li-Chun
    Mcdonough, Miriam
    Resch, Jon
    Giese, K. Peter
    Abel, Ted
    Michaelson, Jacob J.
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [35] Boosting single-cell gene regulatory network reconstruction via bulk-cell transcriptomic data
    Shu, Hantao
    Ding, Fan
    Zhou, Jingtian
    Xue, Yexiang
    Zhao, Dan
    Zeng, Jianyang
    Ma, Jianzhu
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [36] Network-based integrative analysis of single-cell transcriptomic and epigenomic data for cell types
    Wu, Wenming
    Zhang, Wensheng
    Ma, Xiaoke
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [37] Scrublet: Computational Identification of Cell Doublets in Single-Cell Transcriptomic Data
    Wolock, Samuel L.
    Lopez, Romain
    Klein, Allon M.
    CELL SYSTEMS, 2019, 8 (04) : 281 - +
  • [38] Differential variability analysis of single-cell gene expression data
    Liu, Jiayi
    Kreimer, Anat
    Li, Wei Vivian
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [39] Single-cell transcriptomic analysis of Alzheimer's disease
    Mathys, Hansruedi
    Davila-Velderrain, Jose
    Peng, Zhuyu
    Gao, Fan
    Mohammadi, Shahin
    Young, Jennie Z.
    Menon, Madhvi
    He, Liang
    Abdurrob, Fatema
    Jiang, Xueqiao
    Martorell, Anthony J.
    Ransohoff, Richard M.
    Hafler, Brian P.
    Bennett, David A.
    Kellis, Manolis
    Tsai, Li-Huei
    NATURE, 2019, 570 (7761) : 332 - +
  • [40] Single-cell transcriptomic analysis of oligodendrocyte lineage cells
    van Bruggen, David
    Agirre, Eneritz
    Castelo-Branco, Goncalo
    CURRENT OPINION IN NEUROBIOLOGY, 2017, 47 : 168 - 175