A semi-supervised non-negative matrix factorization model for scRNA-seq data analysis

被引:0
|
作者
Lan, Junjie [1 ,2 ]
Zhuo, Xiaoling [1 ,2 ]
Ye, Siman [1 ,2 ]
Deng, Jin [1 ,2 ]
机构
[1] South China Agr Univ, Coll Math & Informat, Guangzhou 510642, Peoples R China
[2] Pazhou Lab, Guangzhou 510330, Peoples R China
关键词
scRNA-seq; Non-negative matrix factorization; Dimensionality reduction; Clustering; Omics; GENE-EXPRESSION; SINGLE;
D O I
10.1016/j.asoc.2025.112982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single-cell RNA sequencing (scRNA-seq) technology enables the measurement of cellular gene expression at the single-cell level, thus facilitating cell clustering at the gene level. Despite numerous dimensionality reduction methods developed for scRNA-seq data, many are limited to analyzing individual gene expression matrices and struggle to address false positives and false zero expression entries effectively. Moreover, existing methods often underutilize prior knowledge of similarity and dissimilarity between multi-omics data, leading to the loss of intercellular correlations and shared structural information, thus hindering desired dimensionality reduction outcomes. To address these limitations, a novel model termed joint non-negative matrix factorization with similarity and dissimilarity constraints (SDJNMF) was proposed to tailor for scRNA-seq data clustering. The model leverages prior knowledge of similarity and dissimilarity across multiple gene expression matrices, facilitating joint non-negative matrix factorization to extract common features from multi-omics data. By preserving shared structural and cellular relevance information, SDJNMF enhances the clustering of similar cells while effectively separating dissimilar ones. Furthermore, the SDJNMF model incorporates sparse Singular Value Decomposition during initialization to mitigate noise and redundancy and ensure robust dimensionality reduction. The experimental results demonstrate that the SDJNMF model exhibits superior performance on the 10 datasets, not only outperforming the other 14 algorithms in terms of clustering accuracy on the 9 datasets, but also enhancing the ARI of SDJNMF by an average of 0.0687 in comparison to the second-best algorithm on each dataset. In the visual representation, the model is able to efficiently and accurately cluster similar cells and effectively discriminate different classes of cells from each other. Additionally, the SDJNMF model was applied to identify informative genes and conduct enrichment analysis, validating that genes identified by SDJNMF significantly influence biological processes. Overall, the SDJNMF offers innovative tools for cell cluster identification and advances biological research. The source code of SDJNMF is available online at https://github.com/Jindsmu/SDJNMF.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Constrained Projective Non-negative Matrix Factorization for Semi-supervised Multi-label Learning
    Zhang, Xiang
    Guan, Naiyang
    Luo, Zhigang
    Yang, Xuejun
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 588 - 593
  • [32] Semi-Supervised Symmetric Non-Negative Matrix Factorization With Low-Rank Tensor Representation
    Jia, Yuheng
    Li, Jia-Nan
    Wu, Wenhui
    Wang, Ran
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1534 - 1547
  • [33] NMFE-SSCC: Non-negative matrix factorization ensemble for semi-supervised collective classification
    Wu, Qingyao
    Tan, Mingkui
    Li, Xutao
    Min, Huaqing
    Sun, Ning
    KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 160 - 172
  • [34] Non-negative Independent Factor Analysis disentangles discrete and continuous sources of variation in scRNA-seq data
    Mao, Weiguang
    Pouyan, Maziyar Baran
    Kostka, Dennis
    Chikina, Maria
    BIOINFORMATICS, 2022, 38 (10) : 2749 - 2756
  • [35] Semi-supervised multi-view clustering by label relaxation based non-negative matrix factorization
    Yang, Zuyuan
    Zhang, Huimin
    Liang, Naiyao
    Li, Zhenni
    Sun, Weijun
    VISUAL COMPUTER, 2023, 39 (04): : 1409 - 1422
  • [36] Semi-supervised multi-view clustering by label relaxation based non-negative matrix factorization
    Zuyuan Yang
    Huimin Zhang
    Naiyao Liang
    Zhenni Li
    Weijun Sun
    The Visual Computer, 2023, 39 : 1409 - 1422
  • [37] Constrained Propagation Self-Adaptived Semi-Supervised Non-Negative Matrix Factorization Clustering Algorithm
    Zhu, Tuoji
    Lin, Haoshen
    Zhao, Weihao
    Wang, Jing
    Yang, Xiaojun
    Computer Engineering and Applications, 2024, 60 (13) : 81 - 91
  • [38] A STATISTICAL APPROACH TO SEMI-SUPERVISED SPEECH ENHANCEMENT WITH LOW-ORDER NON-NEGATIVE MATRIX FACTORIZATION
    Mohammed, Shoaib
    Tashev, Ivan
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 546 - 550
  • [39] Correntropy Supervised Non-negative Matrix Factorization
    Zhang, Wenju
    Guan, Naiyang
    Tao, Dacheng
    Mao, Bin
    Huang, Xuhui
    Luo, Zhigang
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [40] A Unified Non-Negative Matrix Factorization Framework for Semi Supervised Learning on Graphs
    Mitra, Anasua
    Vijayan, Priyesh
    Parthasarathy, Srinivasan
    Ravindran, Balaraman
    PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM), 2020, : 487 - 495