Single-cell RNA-seq data clustering by deep information fusion

被引:3
|
作者
Ren, Liangrui [2 ]
Wang, Jun [3 ]
Li, Wei [4 ]
Guo, Maozu [5 ]
Yu, Guoxian [1 ,2 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Shandong Univ, Sch Software, Jinan, Peoples R China
[3] Shandong Univ, Joint SDU NTU Ctr Artificial Intelligence Res C FA, Jinan, Peoples R China
[4] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China
[5] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell RNA-seq clustering; graph convolution network; deep auto-encoder; ZINB; transcriptomics; VISUALIZATION; COMPLEX;
D O I
10.1093/bfgp/elad017
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Determining cell types by single-cell transcriptomics data is fundamental for downstream analysis. However, cell clustering and data imputation still face the computation challenges, due to the high dropout rate, sparsity and dimensionality of single-cell data. Although some deep learning based solutions have been proposed to handle these challenges, they still can not leverage gene attribute information and cell topology in a sensible way to explore the consistent clustering. In this paper, we present scDeepFC, a deep information fusion-based single-cell data clustering method for cell clustering and data imputation. Specifically, scDeepFC uses a deep auto-encoder (DAE) network and a deep graph convolution network to embed high-dimensional gene attribute information and high-order cell-cell topological information into different low-dimensional representations, and then fuses them to generate a more comprehensive and accurate consensus representation via a deep information fusion network. In addition, scDeepFC integrates the zero-inflated negative binomial (ZINB) into DAE to model the dropout events. By jointly optimizing the ZINB loss and cell graph reconstruction loss, scDeepFC generates a salient embedding representation for clustering cells and imputing missing data. Extensive experiments on real single-cell datasets prove that scDeepFC outperforms other popular single-cell analysis methods. Both the gene attribute and cell topology information can improve the cell clustering.
引用
收藏
页码:128 / 137
页数:10
相关论文
共 50 条
  • [21] scGMAI: a Gaussian mixture model for clustering single-cell RNA-Seq data based on deep autoencoder
    Yu, Bin
    Chen, Chen
    Qi, Ren
    Zheng, Ruiqing
    Skillman-Lawrence, Patrick J.
    Wang, Xiaolin
    Ma, Anjun
    Gu, Haiming
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [22] scGAC: a graph attentional architecture for clustering single-cell RNA-seq data
    Cheng, Yi
    Ma, Xiuli
    BIOINFORMATICS, 2022, 38 (08) : 2187 - 2193
  • [23] Clustering and visualization of single-cell RNA-seq data using path metrics
    Manousidaki, Andriana
    Little, Anna
    Xie, Yuying
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (05)
  • [24] Single-cell RNA-seq data clustering: A survey with performance comparison study
    Li, Ruiyi
    Guan, Jihong
    Zhou, Shuigeng
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2020, 18 (04)
  • [25] Consensus clustering of single-cell RNA-seq data by enhancing network affinity
    Cui, Yaxuan
    Zhang, Shaoqiang
    Liang, Ying
    Wang, Xiangyun
    Ferraro, Thomas N.
    Chen, Yong
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [26] scMUG: deep clustering analysis of single-cell RNA-seq data on multiple gene functional modules
    Liang, De-Min
    Du, Pu-Feng
    BRIEFINGS IN BIOINFORMATICS, 2025, 26 (02)
  • [27] SC3: consensus clustering of single-cell RNA-seq data
    Kiselev, Vladimir Yu
    Kirschner, Kristina
    Schaub, Michael T.
    Andrews, Tallulah
    Yiu, Andrew
    Chandra, Tamir
    Natarajan, Kedar N.
    Reik, Wolf
    Barahona, Mauricio
    Green, Anthony R.
    Hemberg, Martin
    NATURE METHODS, 2017, 14 (05) : 483 - +
  • [28] A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data
    Srinivasan, Suhas
    Leshchyk, Anastasia
    Johnson, Nathan T.
    Korkin, Dmitry
    RNA, 2020, 26 (10) : 1303 - 1319
  • [29] Toward Convex Manifolds: A Geometric Perspective for Deep Graph Clustering of Single-cell RNA-seq Data
    Mrabah, Nairouz
    Amar, Mohamed Mahmoud
    Bouguessa, Mohamed
    Diallo, Abdoulaye Banire
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4855 - 4863
  • [30] Clustering single-cell RNA-seq data by rank constrained similarity learning
    Mei, Qinglin
    Li, Guojun
    Su, Zhengchang
    BIOINFORMATICS, 2021, 37 (19) : 3235 - 3242