scGCL: an imputation method for scRNA-seq data based on graph contrastive learning

被引:17
|
作者
Xiong, Zehao [1 ]
Luo, Jiawei [1 ]
Shi, Wanwan [1 ]
Liu, Ying [1 ]
Xu, Zhongyuan [1 ]
Wang, Bo [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410083, Peoples R China
关键词
CELL; ATLAS; CORTEX;
D O I
10.1093/bioinformatics/btad098
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA-sequencing (scRNA-seq) is widely used to reveal cellular heterogeneity, complex disease mechanisms and cell differentiation processes. Due to high sparsity and complex gene expression patterns, scRNA-seq data present a large number of dropout events, affecting downstream tasks such as cell clustering and pseudo-time analysis. Restoring the expression levels of genes is essential for reducing technical noise and facilitating downstream analysis. However, existing scRNA-seq data imputation methods ignore the topological structure information of scRNA-seq data and cannot comprehensively utilize the relationships between cells.Results: Here, we propose a single-cell Graph Contrastive Learning method for scRNA-seq data imputation, named scGCL, which integrates graph contrastive learning and Zero-inflated Negative Binomial (ZINB) distribution to estimate dropout values. scGCL summarizes global and local semantic information through contrastive learning and selects positive samples to enhance the representation of target nodes. To capture the global probability distribution, scGCL introduces an autoencoder based on the ZINB distribution, which reconstructs the scRNA-seq data based on the prior distribution. Through extensive experiments, we verify that scGCL outperforms existing state-of-the-art imputation methods in clustering performance and gene imputation on 14 scRNA-seq datasets. Further, we find that scGCL can enhance the expression patterns of specific genes in Alzheimer's disease datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Matrix factorization for biomedical link prediction and scRNA-seq data imputation: an empirical survey
    Ou-Yang, Le
    Lu, Fan
    Zhang, Zi-Chao
    Wu, Min
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [32] scNCL: transferring labels from scRNA-seq to scATAC-seq data with neighborhood contrastive regularization
    Yan, Xuhua
    Zheng, Ruiqing
    Chen, Jinmiao
    Li, Min
    BIOINFORMATICS, 2023, 39 (08)
  • [33] scRNA-seq data analysis method to improve analysis performance
    Lu, Junru
    Sheng, Yuqi
    Qian, Weiheng
    Pan, Min
    Zhao, Xiangwei
    Ge, Qinyu
    IET NANOBIOTECHNOLOGY, 2023, 17 (03) : 246 - 256
  • [34] MLG: multilayer graph clustering for multi-condition scRNA-seq data
    Lu, Shan
    Conn, Daniel J.
    Chen, Shuyang
    Johnson, Kirby D.
    Bresnick, Emery H.
    Keles, Sunduz
    NUCLEIC ACIDS RESEARCH, 2021, 49 (22) : E127
  • [35] A Deep Learning-Based Method Facilitates scRNA-seq Cell Type Identification
    Wang, Xin
    Li, Zhuo
    Hang, Jie
    Xu, Ren
    Meng, Lin
    NEURAL COMPUTING FOR ADVANCED APPLICATIONS, NCAA 2024, PT I, 2025, 2181 : 171 - 185
  • [36] GNN-based embedding for clustering scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    BIOINFORMATICS, 2022, 38 (04) : 1037 - 1044
  • [37] scCRT: a contrastive-based dimensionality reduction model for scRNA-seq trajectory inference
    Shi, Yuchen
    Wan, Jian
    Zhang, Xin
    Liang, Tingting
    Yin, Yuyu
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [38] Computational approaches for interpreting scRNA-seq data
    Rostom, Raghd
    Svensson, Valentine
    Teichmann, Sarah A.
    Kar, Gozde
    FEBS LETTERS, 2017, 591 (15) : 2213 - 2225
  • [39] scWMC: weighted matrix completion-based imputation of scRNA-seq data via prior subspace information
    Su, Yanchi
    Wang, Fuzhou
    Zhang, Shixiong
    Liang, Yanchun
    Wong, Ka-Chun
    Li, Xiangtao
    BIOINFORMATICS, 2022, 38 (19) : 4537 - 4545
  • [40] scMRA: a robust deep learning method to annotate scRNA-seq data with multiple reference datasets
    Yuan, Musu
    Chen, Liang
    Deng, Minghua
    BIOINFORMATICS, 2022, 38 (03) : 738 - 745