GNN-based embedding for clustering scRNA-seq data

被引:29
|
作者
Ciortan, Madalina [1 ]
Defrance, Matthieu [1 ]
机构
[1] Univ Libre Bruxelles, Interuniv Inst Bioinformat Brussels, Brussels, Belgium
关键词
CELL; ATLAS;
D O I
10.1093/bioinformatics/btab787
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the high sparsity of scRNA-seq data, accentuated by dropout events generates challenges that have motivated the development of numerous dedicated clustering methods. Nevertheless, there is still no consensus on the best performing method. Results: graph-sc is a new method leveraging a graph autoencoder network to create embeddings for scRNA-seq cell data. While this work analyzes the performance of clustering the embeddings with various clustering algorithms, other downstream tasks can also be performed. A broad experimental study has been performed on both simulated and scRNA-seq datasets. The results indicate that although there is no consistently best method across all the analyzed datasets, graph-sc compares favorably to competing techniques across all types of datasets. Furthermore, the proposed method is stable across consecutive runs, robust to input down-sampling, generally insensitive to changes in the network architecture or training parameters and more computationally efficient than other competing methods based on neural networks. Modeling the data as a graph provides increased flexibility to define custom features characterizing the genes, the cells and their interactions. Moreover, external data (e.g. gene network) can easily be integrated into the graph and used seamlessly under the same optimization task.
引用
收藏
页码:1037 / 1044
页数:8
相关论文
共 50 条
  • [31] Boosting scRNA-seq data clustering by cluster-aware feature weighting
    Li, Rui-Yi
    Guan, Jihong
    Zhou, Shuigeng
    [J]. BMC BIOINFORMATICS, 2021, 22 (SUPPL 6)
  • [32] Computational approaches for interpreting scRNA-seq data
    Rostom, Raghd
    Svensson, Valentine
    Teichmann, Sarah A.
    Kar, Gozde
    [J]. FEBS LETTERS, 2017, 591 (15) : 2213 - 2225
  • [33] Cerebro: interactive visualization of scRNA-seq data
    Hillje, Roman
    Pelicci, Pier Giuseppe
    Luzi, Lucilla
    [J]. BIOINFORMATICS, 2020, 36 (07) : 2311 - 2313
  • [34] Network-Based Structural Learning Nonnegative Matrix Factorization Algorithm for Clustering of scRNA-Seq Data
    Wu, Wenming
    Ma, Xiaoke
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 566 - 575
  • [35] JLONMFSC: Clustering scRNA-seq data based on joint learning of non-negative matrix factorization and subspace clustering
    Lan, Wei
    Liu, Mingyang
    Chen, Jianwei
    Ye, Jin
    Zheng, Ruiqing
    Zhu, Xiaoshu
    Peng, Wei
    [J]. METHODS, 2024, 222 : 1 - 9
  • [36] A subspace clustering method for satisfying stoimetric constraints in scRNA-seq
    Huang, Angela
    Kim, Junhyong
    [J]. 2021 IEEE 21ST INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (IEEE BIBE 2021), 2021,
  • [37] AAE-SC: A scRNA-Seq Clustering Framework Based on Adversarial Autoencoder
    Wu, Yulun
    Guo, Yanming
    Xiao, Yandong
    Lao, Songyang
    [J]. IEEE ACCESS, 2020, 8 : 178962 - 178975
  • [38] Robust Graph Regularized NMF with Dissimilarity and Similarity Constraints for ScRNA-seq Data Clustering
    Shu, Zhenqiu
    Long, Qinghan
    Zhang, Luping
    Yu, Zhengtao
    Wu, Xiao-Jun
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (23) : 6271 - 6286
  • [39] Supervised capacity preserving mapping: a clustering guided visualization method for scRNA-seq data
    Zhai, Zhiqian
    Lei, Yu L.
    Wang, Rongrong
    Xie, Yuying
    [J]. BIOINFORMATICS, 2022, 38 (09) : 2496 - 2503
  • [40] Clustering Deviation Index (CDI): a robust and accurate internal measure for evaluating scRNA-seq data clustering
    Fang, Jiyuan
    Chan, Cliburn
    Owzar, Kouros
    Wang, Liuyang
    Qin, Diyuan
    Li, Qi-Jing
    Xie, Jichun
    [J]. GENOME BIOLOGY, 2022, 23 (01)