Distribution-Agnostic Deep Learning Enables Accurate Single-Cell Data Recovery and Transcriptional Regulation Interpretation

被引:3
|
作者
Su, Yanchi [1 ]
Yu, Zhuohan [1 ]
Yang, Yuning [2 ]
Wong, Ka-Chun [3 ]
Li, Xiangtao [1 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun 130012, Peoples R China
[2] Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON M5S 3E1, Canada
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金;
关键词
imputation; optimal transport; single-cell RNA sequencing; DIFFERENTIATION; EXPRESSION; INDUCTION; DIVERSITY; DISTINCT; IMMUNE; OXYGEN; HEART; ATLAS; FATE;
D O I
10.1002/advs.202307280
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Single-cell RNA sequencing (scRNA-seq) is a robust method for studying gene expression at the single-cell level, but accurately quantifying genetic material is often hindered by limited mRNA capture, resulting in many missing expression values. Existing imputation methods rely on strict data assumptions, limiting their broader application, and lack reliable supervision, leading to biased signal recovery. To address these challenges, authors developed Bis, a distribution-agnostic deep learning model for accurately recovering missing sing-cell gene expression from multiple platforms. Bis is an optimal transport-based autoencoder model that can capture the intricate distribution of scRNA-seq data while addressing the characteristic sparsity by regularizing the cellular embedding space. Additionally, they propose a module using bulk RNA-seq data to guide reconstruction and ensure expression consistency. Experimental results show Bis outperforms other models across simulated and real datasets, showcasing superiority in various downstream analyses including batch effect removal, clustering, differential expression analysis, and trajectory inference. Moreover, Bis successfully restores gene expression levels in rare cell subsets in a tumor-matched peripheral blood dataset, revealing developmental characteristics of cytokine-induced natural killer cells within a head and neck squamous cell carcinoma microenvironment. The accurate measurement of genetic material encounters challenges due to limited intracellular mRNA capture, leading to many missing expression values. A distribution-agnostic deep learning model, informed by external cues from bulk RNA-seq data, is developed to address this issue. This model precisely reconstructs gene expression patterns, offering valuable insights into the developmental maturation mechanisms of cytokine-induced NK cells. image
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Single-cell segmentation in bacterial biofilms with an optimized deep learning method enables tracking of cell lineages and measurements of growth rates
    Jelli, Eric
    Ohmura, Takuya
    Netter, Niklas
    Abt, Martin
    Jimenez-Siebert, Eva
    Neuhaus, Konstantin
    Rode, Daniel K. H.
    Nadell, Carey D.
    Drescher, Knut
    MOLECULAR MICROBIOLOGY, 2023, 119 (06) : 659 - 676
  • [42] Semi-Supervised Deep Learning for Cell Type Identification From Single-Cell Transcriptomic Data
    Dong, Xishuang
    Chowdhury, Shanta
    Victor, Uboho
    Li, Xiangfang
    Qian, Lijun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1492 - 1505
  • [43] Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers
    Sumeer Ahmad Khan
    Alberto Maillo
    Vincenzo Lagani
    Robert Lehmann
    Narsis A. Kiani
    David Gomez-Cabrero
    Jesper Tegner
    Nature Machine Intelligence, 2023, 5 : 1437 - 1446
  • [44] Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers
    Khan, Sumeer Ahmad
    Maillo, Alberto
    Lagani, Vincenzo
    Lehmann, Robert
    Kiani, Narsis A.
    Gomez-Cabrero, David
    Tegner, Jesper
    NATURE MACHINE INTELLIGENCE, 2023, 5 (12) : 1437 - 1446
  • [45] Using deep learning to quantify neuronal activation from single-cell and spatial transcriptomic data
    Bahl, Ethan
    Chatterjee, Snehajyoti
    Mukherjee, Utsav
    Elsadany, Muhammad
    Vanrobaeys, Yann
    Lin, Li-Chun
    Mcdonough, Miriam
    Resch, Jon
    Giese, K. Peter
    Abel, Ted
    Michaelson, Jacob J.
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [46] Using deep learning to quantify neuronal activation from single-cell and spatial transcriptomic data
    Ethan Bahl
    Snehajyoti Chatterjee
    Utsav Mukherjee
    Muhammad Elsadany
    Yann Vanrobaeys
    Li-Chun Lin
    Miriam McDonough
    Jon Resch
    K. Peter Giese
    Ted Abel
    Jacob J. Michaelson
    Nature Communications, 15
  • [47] Chromatin Folding Coordinate and Landscape Unraveled by Deep Learning Analysis of Single-Cell Imaging Data
    Xie, Wenjun
    Qi, Yifeng
    Zhang, Bin
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 548A - 548A
  • [48] Deep-learning-based isolation of perturbation-induced variations in single-cell data
    Weinberger, Ethan
    Lee, Su-In
    NATURE METHODS, 2023, 20 (09) : 1287 - 1288
  • [49] Interpretable deep learning of single-cell and epigenetic data reveals novel molecular insights in aging
    Li, Zhi-Peng
    Du, Zhaozhen
    Huang, De-Shuang
    Teschendorff, Andrew E.
    SCIENTIFIC REPORTS, 2025, 15 (01):