Toward universal cell embeddings: integrating single-cell RNA-seq datasets across species with SATURN

被引:3
|
作者
Rosen, Yanay [1 ]
Brbic, Maria [2 ]
Roohani, Yusuf [3 ]
Swanson, Kyle [1 ]
Li, Ziang [4 ]
Leskovec, Jure [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Swiss Fed Inst Technol EPFL, Sch Comp & Commun Sci, Lausanne, Switzerland
[3] Stanford Univ, Dept Biomed Data Sci, Stanford, CA USA
[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
美国国家卫生研究院;
关键词
AQUEOUS-HUMOR; LANGUAGE; GLAUCOMA;
D O I
10.1038/s41592-024-02191-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, interspecies genomic differences limit the joint analysis of cross-species datasets to homologous genes. Here we present SATURN, a deep learning method for learning universal cell embeddings that encodes genes' biological properties using protein language models. By coupling protein embeddings from language models with RNA expression, SATURN integrates datasets profiled from different species regardless of their genomic similarity. SATURN can detect functionally related genes coexpressed across species, redefining differential expression for cross-species analysis. Applying SATURN to three species whole-organism atlases and frog and zebrafish embryogenesis datasets, we show that SATURN can effectively transfer annotations across species, even when they are evolutionarily remote. We also demonstrate that SATURN can be used to find potentially divergent gene functions between glaucoma-associated genes in humans and four other species. SATURN performs cross-species integration and analysis using both single-cell gene expression and protein representations generated by protein language models.
引用
收藏
页码:1492 / 1500
页数:29
相关论文
共 50 条
  • [41] Single-cell RNA-seq to decipher tumour architecture
    Cloney, Ross
    NATURE REVIEWS GENETICS, 2017, 18 (01) : 2 - 3
  • [42] Recent Developments in Single-Cell RNA-Seq of Microorganisms
    Zhang, Yi
    Gao, Jiaxin
    Huang, Yanyi
    Wang, Jianbin
    BIOPHYSICAL JOURNAL, 2018, 115 (02) : 173 - 180
  • [43] Quality control of single-cell RNA-seq by SinQC
    Jiang, Peng
    Thomson, James A.
    Stewart, Ron
    BIOINFORMATICS, 2016, 32 (16) : 2514 - 2516
  • [44] Single-cell RNA-seq to decipher tumour architecture
    Ross Cloney
    Nature Reviews Genetics, 2017, 18 : 2 - 3
  • [45] Embracing the dropouts in single-cell RNA-seq analysis
    Qiu, Peng
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [46] The Advances of Single-Cell RNA-Seq in Kidney Immunology
    Zeng, Honghui
    Yang, Xiaoqiang
    Luo, Siweier
    Zhou, Yiming
    FRONTIERS IN PHYSIOLOGY, 2021, 12
  • [47] Single-Cell RNA-Seq in Human Lung Cancer
    Kim, J.
    Xu, Z.
    Marignani, P.
    JOURNAL OF THORACIC ONCOLOGY, 2018, 13 (10) : S911 - S911
  • [48] Comparison of transformations for single-cell RNA-seq data
    Ahlmann-Eltze, Constantin
    Huber, Wolfgang
    NATURE METHODS, 2023, 20 (05) : 665 - +
  • [49] scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets
    Andreatta, Massimo
    Berenstein, Ariel J.
    Carmona, Santiago J.
    BIOINFORMATICS, 2022, 38 (09) : 2642 - 2644
  • [50] The contribution of cell cycle to heterogeneity in single-cell RNA-seq data
    McDavid, Andrew
    Finak, Greg
    Gottardo, Raphael
    NATURE BIOTECHNOLOGY, 2016, 34 (06) : 591 - 593