Distribution-Agnostic Deep Learning Enables Accurate Single-Cell Data Recovery and Transcriptional Regulation Interpretation

被引:3
|
作者
Su, Yanchi [1 ]
Yu, Zhuohan [1 ]
Yang, Yuning [2 ]
Wong, Ka-Chun [3 ]
Li, Xiangtao [1 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun 130012, Peoples R China
[2] Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON M5S 3E1, Canada
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金;
关键词
imputation; optimal transport; single-cell RNA sequencing; DIFFERENTIATION; EXPRESSION; INDUCTION; DIVERSITY; DISTINCT; IMMUNE; OXYGEN; HEART; ATLAS; FATE;
D O I
10.1002/advs.202307280
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Single-cell RNA sequencing (scRNA-seq) is a robust method for studying gene expression at the single-cell level, but accurately quantifying genetic material is often hindered by limited mRNA capture, resulting in many missing expression values. Existing imputation methods rely on strict data assumptions, limiting their broader application, and lack reliable supervision, leading to biased signal recovery. To address these challenges, authors developed Bis, a distribution-agnostic deep learning model for accurately recovering missing sing-cell gene expression from multiple platforms. Bis is an optimal transport-based autoencoder model that can capture the intricate distribution of scRNA-seq data while addressing the characteristic sparsity by regularizing the cellular embedding space. Additionally, they propose a module using bulk RNA-seq data to guide reconstruction and ensure expression consistency. Experimental results show Bis outperforms other models across simulated and real datasets, showcasing superiority in various downstream analyses including batch effect removal, clustering, differential expression analysis, and trajectory inference. Moreover, Bis successfully restores gene expression levels in rare cell subsets in a tumor-matched peripheral blood dataset, revealing developmental characteristics of cytokine-induced natural killer cells within a head and neck squamous cell carcinoma microenvironment. The accurate measurement of genetic material encounters challenges due to limited intracellular mRNA capture, leading to many missing expression values. A distribution-agnostic deep learning model, informed by external cues from bulk RNA-seq data, is developed to address this issue. This model precisely reconstructs gene expression patterns, offering valuable insights into the developmental maturation mechanisms of cytokine-induced NK cells. image
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Deep learning enables accurate alignment of single cell RNA-seq data
    Zhong, Yuanke
    Li, Jing
    Liu, Jie
    Zheng, Yan
    Shang, Xuequn
    Hu, Jialu
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 778 - 781
  • [2] Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
    Li, Xiangjie
    Wang, Kui
    Lyu, Yafei
    Pan, Huize
    Zhang, Jingxiao
    Stambolian, Dwight
    Susztak, Katalin
    Reilly, Muredach P.
    Hu, Gang
    Li, Mingyao
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [3] Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
    Xiangjie Li
    Kui Wang
    Yafei Lyu
    Huize Pan
    Jingxiao Zhang
    Dwight Stambolian
    Katalin Susztak
    Muredach P. Reilly
    Gang Hu
    Mingyao Li
    Nature Communications, 11
  • [4] Machine-learning-optimized Cas12a barcoding enables the recovery of single-cell lineages and transcriptional profiles
    Hughes, Nicholas W.
    Qu, Yuanhao
    Zhang, Jiaqi
    Tang, Weijing
    Pierce, Justin
    Wang, Chengkun
    Agrawal, Aditi
    Morri, Maurizio
    Neff, Norma
    Winslow, Monte M.
    Wang, Mengdi
    Cong, Le
    MOLECULAR CELL, 2022, 82 (16) : 3103 - +
  • [5] Deep learning shapes single-cell data analysis
    Qin Ma
    Dong Xu
    Nature Reviews Molecular Cell Biology, 2022, 23 : 303 - 304
  • [6] JS']JSNMF enables effective and accurate integrative analysis of single-cell multiomics data
    Ma, Yuanyuan
    Sun, Zexuan
    Zeng, Pengcheng
    Zhang, Wenyu
    Lin, Zhixiang
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [7] Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data
    Islam, Md Tauhidul
    Xing, Lei
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [8] Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data
    Md Tauhidul Islam
    Lei Xing
    Nature Communications, 14
  • [9] STEM enables mapping of single-cell and spatial transcriptomics data with transfer learning
    Hao, Minsheng
    Luo, Erpai
    Chen, Yixin
    Wu, Yanhong
    Li, Chen
    Chen, Sijie
    Gao, Haoxiang
    Bian, Haiyang
    Gu, Jin
    Wei, Lei
    Zhang, Xuegong
    COMMUNICATIONS BIOLOGY, 2024, 7 (01)
  • [10] STEM enables mapping of single-cell and spatial transcriptomics data with transfer learning
    Minsheng Hao
    Erpai Luo
    Yixin Chen
    Yanhong Wu
    Chen Li
    Sijie Chen
    Haoxiang Gao
    Haiyang Bian
    Jin Gu
    Lei Wei
    Xuegong Zhang
    Communications Biology, 7