Graph-Regularized Non-Negative Matrix Factorization for Single-Cell Clustering in scRNA-Seq Data

被引:0
|
作者
Jiang, Hanjing [1 ]
Wang, Mei-Neng [2 ,3 ]
Huang, Yu-An [4 ]
Huang, Yabing [5 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab Image Informat Proc & Intelligent Control, Minist China, Wuhan, Peoples R China
[2] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
[3] Yichun Univ, Sch Math & Comp Sci, Yichun 336000, Peoples R China
[4] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[5] Wuhan Univ, Renmin Hosp, Dept Pathol, Wuhan 430060, Peoples R China
基金
中国国家自然科学基金;
关键词
Kernel; Laplace equations; Bioinformatics; Clustering algorithms; Sparse matrices; Data mining; Unsupervised learning; scRNA-seq; non-negative matrix factorization; clustering; gene marker; HETEROGENEITY;
D O I
10.1109/JBHI.2024.3400050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The advent of single-cell RNA sequencing (scRNA-seq) has brought forth fresh perspectives on intricate biological processes, revealing the nuances and divergences present among distinct cells. Accurate single-cell analysis is a crucial prerequisite for in-depth investigation into the underlying mechanisms of heterogeneity. Due to various technical noises, like the impact of dropout values, scRNA-seq data remains challenging to interpret. In this work, we propose an unsupervised learning framework for scRNA-seq data analysis (aka Sc-GNNMF). Based on the non-negativity and sparsity of scRNA-seq data, we propose employing graph-regularized non-negative matrix factorization (GNNMF) algorithm for the analysis of scRNA-seq data, which involves estimating cell-cell sparse similarity and gene-gene sparse similarity through Laplacian kernels and p-nearest neighbor graphs (p-NNG). By assuming intrinsic geometric local invariance, we use a weighted p-nearest known neighbors (p-NKN) to optimize the scRNA-seq data. The optimized scRNA-seq data then participates in the matrix decomposition process, promoting the closeness of cells with similar types in cell-gene data space and determining a more suitable embedding space for clustering. Sc-GNNMF demonstrates superior performance compared to other methods and maintains satisfactory compatibility and robustness, as evidenced by experiments on 11 real scRNA-seq datasets. Furthermore, Sc-GNNMF yields excellent results in clustering tasks, extracting useful gene markers, and pseudo-temporal analysis.
引用
收藏
页码:4986 / 4994
页数:9
相关论文
共 50 条
  • [41] NMFLRR: Clustering scRNA-Seq Data by Integrating Nonnegative Matrix Factorization With Low Rank Representation
    Zhang, Wei
    Xue, Xiaoli
    Zheng, Xiaoying
    Fan, Zizhu
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (03) : 1394 - 1405
  • [42] A novel regularized asymmetric non-negative matrix factorization for text clustering
    Aghdam, Mehdi Hosseinzadeh
    Zanjani, Mohammad Daryaie
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)
  • [43] Graph regularized projective non-negative matrix factorization for face recognition
    Yu, Z. (yuzz@jlu.edu.cn), 2013, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [44] Robust automated graph regularized discriminative non-negative matrix factorization
    Xianzhong Long
    Jian Xiong
    Lei Chen
    Multimedia Tools and Applications, 2021, 80 : 14867 - 14886
  • [45] PROJECTIVE NON-NEGATIVE MATRIX FACTORIZATION FOR UNSUPERVISED GRAPH CLUSTERING
    Bampis, Christos G.
    Maragos, Petros
    Bovik, Alan C.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1254 - 1258
  • [46] Dual graph-regularized Constrained Nonnegative Matrix Factorization for Image Clustering
    Sun, Jing
    Cai, Xibiao
    Sun, Fuming
    Hong, Richang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (05): : 2607 - 2627
  • [47] Network-Based Structural Learning Nonnegative Matrix Factorization Algorithm for Clustering of scRNA-Seq Data
    Wu, Wenming
    Ma, Xiaoke
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 566 - 575
  • [48] SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data
    Qiu, Yushan
    Yan, Chang
    Zhao, Pu
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [49] Multi-View Clustering With Graph Learning for scRNA-Seq Data
    Wu, Wenming
    Zhang, Wensheng
    Hou, Weimin
    Ma, Xiaoke
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (06) : 3535 - 3546
  • [50] Non-Negative Low-Rank Representation With Similarity Correction for Cell Type Identification in scRNA-Seq Data
    Liu, Jing-Xing
    Zhang, Dai-Jun
    Zhao, Jing-Xiu
    Zheng, Chun-Hou
    Gao, Ying-Lian
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (06) : 3737 - 3747