scGCL: an imputation method for scRNA-seq data based on graph contrastive learning

被引:17
|
作者
Xiong, Zehao [1 ]
Luo, Jiawei [1 ]
Shi, Wanwan [1 ]
Liu, Ying [1 ]
Xu, Zhongyuan [1 ]
Wang, Bo [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410083, Peoples R China
关键词
CELL; ATLAS; CORTEX;
D O I
10.1093/bioinformatics/btad098
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA-sequencing (scRNA-seq) is widely used to reveal cellular heterogeneity, complex disease mechanisms and cell differentiation processes. Due to high sparsity and complex gene expression patterns, scRNA-seq data present a large number of dropout events, affecting downstream tasks such as cell clustering and pseudo-time analysis. Restoring the expression levels of genes is essential for reducing technical noise and facilitating downstream analysis. However, existing scRNA-seq data imputation methods ignore the topological structure information of scRNA-seq data and cannot comprehensively utilize the relationships between cells.Results: Here, we propose a single-cell Graph Contrastive Learning method for scRNA-seq data imputation, named scGCL, which integrates graph contrastive learning and Zero-inflated Negative Binomial (ZINB) distribution to estimate dropout values. scGCL summarizes global and local semantic information through contrastive learning and selects positive samples to enhance the representation of target nodes. To capture the global probability distribution, scGCL introduces an autoencoder based on the ZINB distribution, which reconstructs the scRNA-seq data based on the prior distribution. Through extensive experiments, we verify that scGCL outperforms existing state-of-the-art imputation methods in clustering performance and gene imputation on 14 scRNA-seq datasets. Further, we find that scGCL can enhance the expression patterns of specific genes in Alzheimer's disease datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Cerebro: interactive visualization of scRNA-seq data
    Hillje, Roman
    Pelicci, Pier Giuseppe
    Luzi, Lucilla
    BIOINFORMATICS, 2020, 36 (07) : 2311 - 2313
  • [42] Machine learning and system biology application to scRNA-seq data analysis
    Arbatskiy, Mikhail
    Sysoeva, Veronika
    Rubina, Kseniya
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 293 - 294
  • [43] A rank-based marker selection method for high throughput scRNA-seq data
    Alexander H. S. Vargo
    Anna C. Gilbert
    BMC Bioinformatics, 21
  • [44] scDeepInsight: a supervised cell-type identification method for scRNA-seq data with deep learning
    Jia, Shangru
    Lysenko, Artem
    Boroevich, Keith A.
    Sharma, Alok
    Tsunoda, Tatsuhiko
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [45] A rank-based marker selection method for high throughput scRNA-seq data
    Vargo, Alexander H. S.
    Gilbert, Anna C.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [46] nsDCC: dual-level contrastive clustering with nonuniform sampling for scRNA-seq data analysis
    Wang, Linjie
    Li, Wei
    Zhou, Fanghui
    Yu, Kun
    Feng, Chaolu
    Zhao, Dazhe
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [47] Wasserstein Graph Convolutional Network with Attention for Imbalanced scRNA-seq Data Knowledge Discovery
    Ren, Jie
    Han, Henry
    RECENT ADVANCES IN NEXT-GENERATION DATA SCIENCE, SDSC 2024, 2024, 2158 : 1 - 16
  • [48] Robust Graph Regularized NMF with Dissimilarity and Similarity Constraints for ScRNA-seq Data Clustering
    Shu, Zhenqiu
    Long, Qinghan
    Zhang, Luping
    Yu, Zhengtao
    Wu, Xiao-Jun
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (23) : 6271 - 6286
  • [49] A graph neural network that combines scRNA-seq and protein-protein interaction data
    Sheinin, Ron
    Sharan, Roded
    Madi, Asaf
    NATURE METHODS, 2025, 22 (04) : 660 - 661
  • [50] Integration of scRNA-seq data by disentangled representation learning with condition domain adaptation
    Renjing Liu
    Kun Qian
    Xinwei He
    Hongwei Li
    BMC Bioinformatics, 25