GNN-based embedding for clustering scRNA-seq data

被引:29
|
作者
Ciortan, Madalina [1 ]
Defrance, Matthieu [1 ]
机构
[1] Univ Libre Bruxelles, Interuniv Inst Bioinformat Brussels, Brussels, Belgium
关键词
CELL; ATLAS;
D O I
10.1093/bioinformatics/btab787
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the high sparsity of scRNA-seq data, accentuated by dropout events generates challenges that have motivated the development of numerous dedicated clustering methods. Nevertheless, there is still no consensus on the best performing method. Results: graph-sc is a new method leveraging a graph autoencoder network to create embeddings for scRNA-seq cell data. While this work analyzes the performance of clustering the embeddings with various clustering algorithms, other downstream tasks can also be performed. A broad experimental study has been performed on both simulated and scRNA-seq datasets. The results indicate that although there is no consistently best method across all the analyzed datasets, graph-sc compares favorably to competing techniques across all types of datasets. Furthermore, the proposed method is stable across consecutive runs, robust to input down-sampling, generally insensitive to changes in the network architecture or training parameters and more computationally efficient than other competing methods based on neural networks. Modeling the data as a graph provides increased flexibility to define custom features characterizing the genes, the cells and their interactions. Moreover, external data (e.g. gene network) can easily be integrated into the graph and used seamlessly under the same optimization task.
引用
收藏
页码:1037 / 1044
页数:8
相关论文
共 50 条
  • [1] scCNC: a method based on capsule network for clustering scRNA-seq data
    Wang, Hai-Yun
    Zhao, Jian-Ping
    Zheng, Chun-Hou
    Su, Yan-Sen
    [J]. BIOINFORMATICS, 2022, 38 (15) : 3703 - 3709
  • [2] Contrastive self-supervised clustering of scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [3] scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data
    Wang, HaiYun
    Zhao, JianPing
    Zheng, ChunHou
    Su, YanSen
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (12)
  • [4] Contrastive self-supervised clustering of scRNA-seq data
    Madalina Ciortan
    Matthieu Defrance
    [J]. BMC Bioinformatics, 22
  • [5] A clustering method for small scRNA-seq data based on subspace and weighted distance
    Ning, Zilan
    Dai, Zhijun
    Zhang, Hongyan
    Chen, Yuan
    Yuan, Zheming
    [J]. PEERJ, 2023, 11 : 28 - 28
  • [6] Deep embedded clustering with multiple objectives on scRNA-seq data
    Li, Xiangtao
    Zhang, Shixiong
    Wong, Ka-Chun
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [7] Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data
    Gan, Yanglan
    Chen, Yuhan
    Xu, Guangwei
    Guo, Wenjing
    Zou, Guobing
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [8] scTPC: a novel semisupervised deep clustering model for scRNA-seq data
    Qiu, Yushan
    Yang, Lingfei
    Jiang, Hao
    Zou, Quan
    [J]. BIOINFORMATICS, 2024, 40 (05)
  • [9] A Streamlined scRNA-Seq Data Analysis Framework Based on Improved Sparse Subspace Clustering
    Zhuang, Jujuan
    Cui, Lingyu
    Qu, Tianqi
    Ren, Changjing
    Xu, Junlin
    Li, Tianbao
    Tian, Geng
    Yang, Jialiang
    [J]. IEEE ACCESS, 2021, 9 : 9719 - 9727
  • [10] A framework for scRNA-seq data clustering based on multi-view feature integration
    Li, Feng
    Liu, Yang
    Liu, Jinxing
    Ge, Daohui
    Shang, Junliang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89