scTPC: a novel semisupervised deep clustering model for scRNA-seq data

被引:0
|
作者
Qiu, Yushan [1 ]
Yang, Lingfei [1 ]
Jiang, Hao [2 ]
Zou, Quan [3 ]
机构
[1] Shenzhen Univ, Sch Math Sci, Shenzhen 518000, Guangdong, Peoples R China
[2] Renmin Univ China, Sch Math, 59 Zhongguancun St, Beijing 100872, Peoples R China
[3] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 610056, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1093/bioinformatics/btae293
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Continuous advancements in single-cell RNA sequencing (scRNA-seq) technology have enabled researchers to further explore the study of cell heterogeneity, trajectory inference, identification of rare cell types, and neurology. Accurate scRNA-seq data clustering is crucial in single-cell sequencing data analysis. However, the high dimensionality, sparsity, and presence of "false" zero values in the data can pose challenges to clustering. Furthermore, current unsupervised clustering algorithms have not effectively leveraged prior biological knowledge, making cell clustering even more challenging.Results This study investigates a semisupervised clustering model called scTPC, which integrates the triplet constraint, pairwise constraint, and cross-entropy constraint based on deep learning. Specifically, the model begins by pretraining a denoising autoencoder based on a zero-inflated negative binomial distribution. Deep clustering is then performed in the learned latent feature space using triplet constraints and pairwise constraints generated from partial labeled cells. Finally, to address imbalanced cell-type datasets, a weighted cross-entropy loss is introduced to optimize the model. A series of experimental results on 10 real scRNA-seq datasets and five simulated datasets demonstrate that scTPC achieves accurate clustering with a well-designed framework.Availability and implementation scTPC is a Python-based algorithm, and the code is available from https://github.com/LF-Yang/Code or https://zenodo.org/records/10951780.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data
    Wang, HaiYun
    Zhao, JianPing
    Zheng, ChunHou
    Su, YanSen
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (12)
  • [2] Deep embedded clustering with multiple objectives on scRNA-seq data
    Li, Xiangtao
    Zhang, Shixiong
    Wong, Ka-Chun
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [3] scSDSC: Self-supervised Deep Subspace Clustering for scRNA-seq Data
    Yang, Bo
    Wang, Hai-Yun
    Zhao, Jian-Ping
    Zheng, Chun-Hou
    [J]. CURRENT BIOINFORMATICS, 2024,
  • [4] scSFCL:Deep clustering of scRNA-seq data with subspace feature confidence learning
    Meng, Xiaokun
    Zhang, Yuanyuan
    Xu, Xiaoyu
    Zhang, Kaihao
    Feng, Baoming
    [J]. Computational Biology and Chemistry, 2025, 114
  • [5] Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data
    Gan, Yanglan
    Chen, Yuhan
    Xu, Guangwei
    Guo, Wenjing
    Zou, Guobing
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [6] Contrastive self-supervised clustering of scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [7] GNN-based embedding for clustering scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    [J]. BIOINFORMATICS, 2022, 38 (04) : 1037 - 1044
  • [8] Contrastive self-supervised clustering of scRNA-seq data
    Madalina Ciortan
    Matthieu Defrance
    [J]. BMC Bioinformatics, 22
  • [9] scVIC: deep generative modeling of heterogeneity for scRNA-seq data
    Xiong, Jiankang
    Gong, Fuzhou
    Ma, Liang
    Wan, Lin
    [J]. BIOINFORMATICS ADVANCES, 2024, 4 (01):
  • [10] Dual-GCN-based deep clustering with triplet contrast for ScRNA-seq data analysis?
    Wang, Linjie
    Li, Wei
    Xie, Weidong
    Wang, Rui
    Yu, Kun
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2023, 106