Network embedding-based representation learning for single cell RNA-seq data

被引:48
|
作者
Li, Xiangyu [1 ]
Chen, Weizheng [3 ]
Chen, Yang [1 ]
Zhang, Xuegong [1 ]
Gu, Jin [1 ]
Zhang, Michael Q. [1 ,2 ]
机构
[1] Tsinghua Univ, Ctr Synthet & Syst Biol, Dept Automat, MOE Key Lab Bioinformat,TNLIST Bioinformat Div, Beijing 100084, Peoples R China
[2] Univ Texas Dallas, Dept Biol Sci, Ctr Syst Biol, 800 West Campbell Rd,RL11, Richardson, TX 75080 USA
[3] Peking Univ, Dept Comp Sci, Inst Network Comp & Informat Syst, Beijing 100871, Peoples R China
关键词
EXPRESSION ANALYSIS; GENE; TRANSCRIPTOME;
D O I
10.1093/nar/gkx750
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single cell RNA-seq (scRNA-seq) techniques can reveal valuable insights of cell-to-cell heterogeneities. Projection of high-dimensional data into a low-dimensional subspace is a powerful strategy in general for mining such big data. However, scRNA-seq suffers from higher noise and lower coverage than traditional bulk RNA-seq, hence bringing in new-computational difficulties. One major challenge is how to deal with the frequent drop-out events. The events, usually caused by the stochastic burst effect in gene transcription and the technical failure of RNA transcript capture, often render traditional dimension reduction methods work inefficiently. To overcome this problem, we have developed a novel Single Cell Representation Learning (SCRL) method based on network embedding. This method can efficiently implement data-driven non-linear projection and incorporate prior biological knowledge (such as pathway information) to learn more meaningful low-dimensional representations for both cells and genes. Benchmark results show that SCRL outperforms other dimensional reduction methods on several recent scRNA-seq datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] GE-Impute: graph embedding-based imputation for single-cell RNA-seq data
    Wu, Xiaobin
    Zhou, Yuan
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [2] scLINE: A multi-network integration framework based on network embedding for representation of single-cell RNA-seq data
    Li, Huoyou
    Xiao, Xuesong
    Wu, Xiaohui
    Ye, Lishan
    Ji, Guoli
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 122
  • [3] scLINE: A multi-network integration framework based on network embedding for representation of single-cell RNA-seq data
    Li, Huoyou
    Xiao, Xuesong
    Wu, Xiaohui
    Ye, Lishan
    Ji, Guoli
    Journal of Biomedical Informatics, 2021, 122
  • [4] A network enhancement-based method for clustering of single cell RNA-seq data
    Zhu, Xiaoshu
    Guo, Lilu
    Li, Rongyuan
    Xu, Yunpei
    Wu, Fang-Xiang
    Peng, Xiaoqing
    Li, Hong-Dong
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 24 (04) : 306 - 325
  • [5] Model-based deep embedding for constrained clustering analysis of single cell RNA-seq data
    Tian, Tian
    Zhang, Jie
    Lin, Xiang
    Wei, Zhi
    Hakonarson, Hakon
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [6] Model-based deep embedding for constrained clustering analysis of single cell RNA-seq data
    Tian Tian
    Jie Zhang
    Xiang Lin
    Zhi Wei
    Hakon Hakonarson
    Nature Communications, 12
  • [7] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [8] MulCNN: An efficient and accurate deep learning method based on gene embedding for cell type identification in single-cell RNA-seq data
    Jiao, Linfang
    Ren, Yongqi
    Wang, Lulu
    Gao, Changnan
    Wang, Shuang
    Song, Tao
    FRONTIERS IN GENETICS, 2023, 14
  • [9] Single-cell RNA-seq data analysis based on directed graph neural network
    Feng, Xiang
    Zhang, Hongqi
    Lin, Hao
    Long, Haixia
    METHODS, 2023, 211 : 48 - 60
  • [10] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    GENOME BIOLOGY, 2019, 20 (1)