Cell type matching across species using protein embeddings and transfer learning

被引:1
|
作者
Biharie, Kirti [1 ,2 ,3 ]
Michielsen, Lieke [1 ,2 ,3 ]
Reinders, Marcel J. T. [1 ,2 ,3 ]
Mahfouz, Ahmed [1 ,2 ,3 ]
机构
[1] Delft Univ Technol, Delft Bioinformat Lab, NL-2628 XE Delft, Netherlands
[2] Leiden Univ, Dept Human Genet, Med Ctr, NL-2333 ZC Leiden, Netherlands
[3] Leiden Univ, Leiden Computat Biol Ctr, Med Ctr, NL-2333 ZC Leiden, Netherlands
关键词
D O I
10.1093/bioinformatics/btad248
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
MotivationKnowing the relation between cell types is crucial for translating experimental results from mice to humans. Establishing cell type matches, however, is hindered by the biological differences between the species. A substantial amount of evolutionary information between genes that could be used to align the species is discarded by most of the current methods since they only use one-to-one orthologous genes. Some methods try to retain the information by explicitly including the relation between genes, however, not without caveats.ResultsIn this work, we present a model to transfer and align cell types in cross-species analysis (TACTiCS). First, TACTiCS uses a natural language processing model to match genes using their protein sequences. Next, TACTiCS employs a neural network to classify cell types within a species. Afterward, TACTiCS uses transfer learning to propagate cell type labels between species. We applied TACTiCS on scRNA-seq data of the primary motor cortex of human, mouse, and marmoset. Our model can accurately match and align cell types on these datasets. Moreover, our model outperforms Seurat and the state-of-the-art method SAMap. Finally, we show that our gene matching method results in better cell type matches than BLAST in our model.
引用
收藏
页码:i404 / i412
页数:9
相关论文
共 50 条
  • [1] Cell type matching across species using protein embeddings and transfer learning
    Biharie, Kirti
    Michielsen, Lieke
    Reinders, Marcel J. T.
    Mahfouz, Ahmed
    BIOINFORMATICS, 2023, 39 : I404 - I412
  • [2] Transfer Learning via Relational Type Matching
    Kumaraswamy, Raksha
    Odom, Phillip
    Kersting, Kristian
    Leake, David
    Natarajan, Sriraam
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 811 - 816
  • [3] Improving the learning of chemical-protein interactions from literature using transfer learning and specialized word embeddings
    Corbett, P.
    Boyle, J.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
  • [4] Decomposing Cell Identity for Transfer Learning across Cellular Measurements, Platforms, Tissues, and Species
    Stein-O'Brien, Genevieve L.
    Clark, Brian S.
    Sherman, Thomas
    Zibetti, Cristina
    Hu, Qiwen
    Sealfon, Rachel
    Liu, Sheng
    Qian, Jiang
    Colantuoni, Carlo
    Blackshaw, Seth
    Goff, Loyal A.
    Fertig, Elana J.
    CELL SYSTEMS, 2019, 8 (05) : 395 - +
  • [5] Augmenting Semantic Lexicons Using Word Embeddings and Transfer Learning
    Alshaabi, Thayer
    Van Oort, Colin M. M.
    Fudolig, Mikaela Irene
    Arnold, Michael V. V.
    Danforth, Christopher M. M.
    Dodds, Peter Sheridan
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 4
  • [6] Mapping Across Relational Domains for Transfer Learning with Word Embeddings-Based Similarity
    Luca, Thais
    Paes, Aline
    Zaverucha, Gerson
    INDUCTIVE LOGIC PROGRAMMING (ILP 2021), 2022, 13191 : 167 - 182
  • [7] Active transfer learning of matching query results across multiple sources
    Xin, Jie
    Cui, Zhiming
    Zhao, Pengpeng
    He, Tianxu
    FRONTIERS OF COMPUTER SCIENCE, 2015, 9 (04) : 595 - 607
  • [8] Active transfer learning of matching query results across multiple sources
    Jie XIN
    Zhiming CUI
    Pengpeng ZHAO
    Tianxu HE
    Frontiers of Computer Science, 2015, 9 (04) : 595 - 607
  • [9] Active transfer learning of matching query results across multiple sources
    Jie Xin
    Zhiming Cui
    Pengpeng Zhao
    Tianxu He
    Frontiers of Computer Science, 2015, 9 : 595 - 607
  • [10] Transfer learning for atomistic simulations using GNNs and kernel mean embeddings
    Falk, John I.
    Bonati, Luigi
    Novelli, Pietro
    Parrinello, Michele
    Pontil, Massimiliano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,