SpaCCC: Large Language Model-Based Cell-Cell Communication Inference for Spatially Resolved Transcriptomic Data

被引:0
|
作者
Ji, Boya [1 ]
Wang, Xiaoqi [2 ]
Qiao, Debin [3 ,4 ]
Xu, Liwen [1 ]
Peng, Shaoliang [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710000, Peoples R China
[3] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[4] Zhengzhou Univ, Natl Supercomp Ctr Zhengzhou, Zhengzhou 450001, Peoples R China
来源
BIG DATA MINING AND ANALYTICS | 2024年 / 7卷 / 04期
基金
中国国家自然科学基金;
关键词
Accuracy; Large language models; Transcriptomics; Data visualization; Receivers; Spatial databases; Biology; Reliability; Spatial resolution; Signal resolution; Large Language Models (LLM); spatial transcriptome data; Cell-Cell Communications (CCCs); functional gene interaction networks; unified latent space;
D O I
10.26599/BDMA.2024.9020056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Drawing parallels between linguistic constructs and cellular biology, Large Language Models (LLMs) have achieved success in diverse downstream applications for single-cell data analysis. However, to date, it still lacks methods to take advantage of LLMs to infer Ligand-Receptor (LR)-mediated cell-cell communications for spatially resolved transcriptomic data. Here, we propose SpaCCC to facilitate the inference of spatially resolved cell-cell communications, which relies on our fine-tuned single-cell LLM and functional gene interaction network to embed ligand and receptor genes into a unified latent space. The LR pairs with a significant closer distance in latent space are taken to be more likely to interact with each other. After that, the molecular diffusion and permutation test strategies are respectively employed to calculate the communication strength and filter out communications with low specificities. The benchmarked performance of SpaCCC is evaluated on real single-cell spatial transcriptomic datasets with superiority over other methods. SpaCCC also infers known LR pairs concealed by existing aggregative methods and then identifies communication patterns for specific cell types and their signaling pathways. Furthermore, SpaCCC provides various cell-cell communication visualization results at both single-cell and cell type resolution. In summary, SpaCCC provides a sophisticated and practical tool allowing researchers to decipher spatially resolved cell-cell communications and related communication patterns and signaling pathways based on spatial transcriptome data. SpaCCC is free and publicly available at https://github.com/jiboyalab/SpaCCC.
引用
收藏
页码:1129 / 1147
页数:19
相关论文
共 50 条
  • [21] scConnect: a method for exploratory analysis of cell-cell communication based on single-cell RNA-sequencing data
    Jakobsson, Jon E. T.
    Spjuth, Ola
    Lagerstrom, Malin C.
    BIOINFORMATICS, 2021, 37 (20) : 3501 - 3508
  • [22] Characterizing spatial gene expression heterogeneity in spatially resolved single-cell transcriptomic data with nonuniform cellular densities
    Miller, Brendan F.
    Bambah-Mukku, Dhananjay
    Dulac, Catherine
    Zhuang, Xiaowei
    Fan, Jean
    GENOME RESEARCH, 2021, 31 (10) : 1843 - 1855
  • [23] Enhancing video temporal grounding with large language model-based data augmentation
    Tian, Yun
    Guo, Xiaobo
    Wang, Jinsong
    Li, Bin
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05):
  • [24] Inference and analysis of cell-cell communication of non-myeloid circulating cells in late sepsis based on single-cell RNA-seq
    Tao, Yanyan
    Li, Miaomiao
    Liu, Cheng
    IET SYSTEMS BIOLOGY, 2024, 18 (06) : 218 - 226
  • [25] Adaptive layer splitting for wireless large language model inference in edge computing: a model-based reinforcement learning approach
    Chen, Yuxuan
    Li, Rongpeng
    Yu, Xiaoxue
    Zhao, Zhifeng
    Zhang, Honggang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2025, 26 (02) : 278 - 292
  • [26] Inference and multiscale model of epithelial-to-mesenchymal transition via single-cell transcriptomic data
    Sha, Yutong
    Wang, Shuxiong
    Zhou, Peijie
    Nie, Qing
    NUCLEIC ACIDS RESEARCH, 2020, 48 (17) : 9505 - 9520
  • [27] Big Data Technology Trends in Transportation Leveraging a Large Language Model-Based System
    Shahraki, Hamed Shahrokhi
    Babazadeh, Abbas
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2025,
  • [28] Model-based inference of a dual role for HOPS in regulating guard cell vacuole fusion
    Hodgens, Charles
    Flaherty, D. T.
    Pullen, Anne-Marie
    Khan, Imran
    English, Nolan J.
    Gillan, Lydia
    Rojas-Pierce, Marcela
    Akpa, Belinda S.
    IN SILICO PLANTS, 2024, 6 (02):
  • [29] A Bayesian noisy logic model for inference of transcription factor activity from single cell and bulk transcriptomic data
    Arriojas, Argenis
    Patalano, Susan
    Macoska, Jill
    Zarringhalam, Kourosh
    NAR GENOMICS AND BIOINFORMATICS, 2023, 5 (04)
  • [30] Large Language Model-Based Critical Care Big Data Deployment and Extraction: Descriptive Analysis
    Yang, Zhongbao
    Xu, Shan-Shan
    Liu, Xiaozhu
    Xu, Ningyuan
    Chen, Yuqing
    Wang, Shuya
    Miao, Ming-Yue
    Hou, Mengxue
    Liu, Shuai
    Zhou, Yi-Min
    Zhou, Jian-Xin
    Zhang, Linlin
    JMIR MEDICAL INFORMATICS, 2025, 13