SC-JNMF: single-cell clustering integrating multiple quantification methods based on joint non-negative matrix factorization

被引:7
|
作者
Shiga, Mikio [1 ]
Seno, Shigeto [1 ]
Onizuka, Makoto [1 ]
Matsuda, Hideo [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Osaka, Japan
来源
PEERJ | 2021年 / 9卷
关键词
Single-cell; RNA-seq; Non-negative matrix factorization; Clustering; DISCOVERY; MODULES;
D O I
10.7717/peerj.12087
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-cell RNA-sequencing is a rapidly evolving technology that enables us to understand biological processes at unprecedented resolution. Single-cell expression analysis requires a complex data processing pipeline, and the pipeline is divided into two main parts: The quantification part, which converts the sequence information into gene-cell matrix data; the analysis part, which analyzes the matrix data using statistics and/or machine learning techniques. In the analysis part, unsupervised cell clustering plays an important role in identifying cell types and discovering cell diversity and subpopulations. Identified cell clusters are also used for subsequent analysis, such as finding differentially expressed genes and inferring cell trajectories. However, singlecell clustering using gene expression profiles shows different results depending on the quantification methods. Clustering results are greatly affected by the quantification method used in the upstream process. In other words, even if the original RNAsequence data is the same, gene expression profiles processed by different quantification methods will produce different clusters. In this article, we propose a robust and highly accurate clustering method based on joint non-negative matrix factorization (jointNMF) by utilizing the information from multiple gene expression profiles quantified using different methods from the same RNA-sequence data. Our joint-NMF can extract common factors among multiple gene expression profiles by applying each NMF under the constraint that one of the factorized matrices is shared among multiple NMFs. The joint-NMF determines more robust and accurate cell clustering results by leveraging multiple quantification methods compared to conventional clustering methods, which use only a single gene expression profile. Additionally, we showed the usefulness of discovering marker genes with the extracted features using our method.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] NON-NEGATIVE MATRIX FACTORIZATION BASED UNCERTAINTY QUANTIFICATION METHOD FOR COMPLEX NETWORKED SYSTEMS
    Mukherjee, Arpan
    Rai, Rahul
    Singla, Puneet
    Singh, Tarunraj
    Patra, Abani
    INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2015, VOL 2A, 2016,
  • [32] Rain Removal Using Single Image based on Non-negative Matrix Factorization
    Liu, Pin-Hsian
    Lin, Chih-Yang
    Yeh, Chia-Hung
    Kang, Li-Wei
    Lo, Kyle Shih-Huang
    Hwang, Tai-Hwei
    Kuo, Chia-Chen
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1137 - 1146
  • [33] JS']JSNMFuP: a unsupervised method for the integrative analysis of single-cell multi-omics data based on non-negative matrix factorization
    Zhang, Bai
    Nan, Mengdi
    Wang, Liugen
    Wu, Hanwen
    Chen, Xiang
    Shi, Yongle
    Ma, Yibing
    Gao, Jie
    BMC GENOMICS, 2025, 26 (01):
  • [34] CoGAPS 3: Bayesian non-negative matrix factorization for single-cell analysis with asynchronous updates and sparse data structures
    Thomas D. Sherman
    Tiger Gao
    Elana J. Fertig
    BMC Bioinformatics, 21
  • [35] CoGAPS 3: Bayesian non-negative matrix factorization for single-cell analysis with asynchronous updates and sparse data structures
    Sherman, Thomas D.
    Gao, Tiger
    Fertig, Elana J.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [36] Document Clustering Based on Non-Negative Matrix Factorization and Affinity Propagation Using Preference Estimation
    Chen, Jiawei
    Li, Fei
    Wu, Xiaofan
    Zhang, Qinqin
    INDUSTRIAL ENGINEERING, MACHINE DESIGN AND AUTOMATION (IEMDA 2014) & COMPUTER SCIENCE AND APPLICATION (CCSA 2014), 2015, : 380 - 385
  • [38] Multi-modal Multi-view Clustering based on Non-negative Matrix Factorization
    Khalafaoui, Yasser
    Grozavu, Nistor
    Matei, Basarab
    Goix, Laurent-Walter
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1386 - 1391
  • [39] A Latent Semantic Approach to XML Clustering by Content and Structure based on Non-negative Matrix Factorization
    Costa, Gianni
    Ortale, Riccardo
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 179 - 184
  • [40] Clustering Algorithm for Unsupervised Monaural Musical Sound Separation Based on Non-negative Matrix Factorization
    Park, Sang Ha
    Lee, Seokjin
    Sung, Koeng-Mo
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (04) : 818 - 823