Distribution-Agnostic Deep Learning Enables Accurate Single-Cell Data Recovery and Transcriptional Regulation Interpretation

被引:3
|
作者
Su, Yanchi [1 ]
Yu, Zhuohan [1 ]
Yang, Yuning [2 ]
Wong, Ka-Chun [3 ]
Li, Xiangtao [1 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun 130012, Peoples R China
[2] Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON M5S 3E1, Canada
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金;
关键词
imputation; optimal transport; single-cell RNA sequencing; DIFFERENTIATION; EXPRESSION; INDUCTION; DIVERSITY; DISTINCT; IMMUNE; OXYGEN; HEART; ATLAS; FATE;
D O I
10.1002/advs.202307280
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Single-cell RNA sequencing (scRNA-seq) is a robust method for studying gene expression at the single-cell level, but accurately quantifying genetic material is often hindered by limited mRNA capture, resulting in many missing expression values. Existing imputation methods rely on strict data assumptions, limiting their broader application, and lack reliable supervision, leading to biased signal recovery. To address these challenges, authors developed Bis, a distribution-agnostic deep learning model for accurately recovering missing sing-cell gene expression from multiple platforms. Bis is an optimal transport-based autoencoder model that can capture the intricate distribution of scRNA-seq data while addressing the characteristic sparsity by regularizing the cellular embedding space. Additionally, they propose a module using bulk RNA-seq data to guide reconstruction and ensure expression consistency. Experimental results show Bis outperforms other models across simulated and real datasets, showcasing superiority in various downstream analyses including batch effect removal, clustering, differential expression analysis, and trajectory inference. Moreover, Bis successfully restores gene expression levels in rare cell subsets in a tumor-matched peripheral blood dataset, revealing developmental characteristics of cytokine-induced natural killer cells within a head and neck squamous cell carcinoma microenvironment. The accurate measurement of genetic material encounters challenges due to limited intracellular mRNA capture, leading to many missing expression values. A distribution-agnostic deep learning model, informed by external cues from bulk RNA-seq data, is developed to address this issue. This model precisely reconstructs gene expression patterns, offering valuable insights into the developmental maturation mechanisms of cytokine-induced NK cells. image
引用
收藏
页数:27
相关论文
共 50 条
  • [21] DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning
    Christof Angermueller
    Heather J. Lee
    Wolf Reik
    Oliver Stegle
    Genome Biology, 18
  • [22] Reliable Identification and Interpretation of Single-Cell Molecular Heterogeneity and Transcriptional Regulation using Dynamic Ensemble Pruning
    Fan, Yi
    Wang, Yunhe
    Wang, Fuzhou
    Huang, Lei
    Yang, Yuning
    Wong, Ka-c.
    Li, Xiangtao
    ADVANCED SCIENCE, 2023, 10 (22)
  • [23] Erratum to: DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning
    Christof Angermueller
    Heather J. Lee
    Wolf Reik
    Oliver Stegle
    Genome Biology, 18
  • [24] Batch alignment of single-cell transcriptomics data using deep metric learning
    Yu, Xiaokang
    Xu, Xinyi
    Zhang, Jingxiao
    Li, Xiangjie
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [25] Batch alignment of single-cell transcriptomics data using deep metric learning
    Xiaokang Yu
    Xinyi Xu
    Jingxiao Zhang
    Xiangjie Li
    Nature Communications, 14
  • [26] Deep learning for inferring gene relationships from single-cell expression data
    Yuan, Ye
    Bar-Joseph, Ziv
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (52) : 27151 - 27158
  • [27] Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review
    Brendel, Matthew
    Su, Chang
    Bai, Zilong
    Zhang, Hao
    Elemento, Olivier
    Wang, Fei
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2022, 20 (05) : 814 - 835
  • [28] VSD genetic diagnosis exploiting single-cell expression data and deep learning
    von der Decken, Isabel
    Azimi, Hamid
    Lauber-Biason, Anna
    HORMONE RESEARCH IN PAEDIATRICS, 2022, 95 (SUPPL 2): : 561 - 562
  • [29] Ensemble deep learning of embeddings for clustering multimodal single-cell omics data
    Yu, Lijia
    Liu, Chunlei
    Yang, Jean Yee Hwa
    Yang, Pengyi
    BIOINFORMATICS, 2023, 39 (06)
  • [30] Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review
    Matthew Brendel
    Chang Su
    Zilong Bai
    Hao Zhang
    Olivier Elemento
    Fei Wang
    Genomics,Proteomics & Bioinformatics, 2022, (05) : 814 - 835