scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization

被引:0
|
作者
Qian, Yuqing [1 ,2 ]
Zou, Quan [1 ,2 ]
Zhao, Mengyuan [3 ]
Liu, Yi [1 ,2 ]
Guo, Fei [4 ]
Ding, Yijie [2 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Peoples R China
[3] Shenzhen Inst Adv Technol, Chinese Acad Sci, Shenzhen, Peoples R China
[4] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
EXPRESSION;
D O I
10.1371/journal.pcbi.1012339
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA sequencing (scRNA-seq) has emerged as a powerful tool in genomics research, enabling the analysis of gene expression at the individual cell level. However, scRNA-seq data often suffer from a high rate of dropouts, where certain genes fail to be detected in specific cells due to technical limitations. This missing data can introduce biases and hinder downstream analysis. To overcome this challenge, the development of effective imputation methods has become crucial in the field of scRNA-seq data analysis. Here, we propose an imputation method based on robust and non-negative matrix factorization (scRNMF). Instead of other matrix factorization algorithms, scRNMF integrates two loss functions: L2 loss and C-loss. The L2 loss function is highly sensitive to outliers, which can introduce substantial errors. We utilize the C-loss function when dealing with zero values in the raw data. The primary advantage of the C-loss function is that it imposes a smaller punishment for larger errors, which results in more robust factorization when handling outliers. Various datasets of different sizes and zero rates are used to evaluate the performance of scRNMF against other state-of-the-art methods. Our method demonstrates its power and stability as a tool for imputation of scRNA-seq data. It is still difficult to analyze scRNA-seq data because a significant portion of expressed genes have zeros. Gene expression levels can be restored through the imputation of scRNA-seq data, facilitating downstream analysis. To overcome this challenge, we propose an imputation method based on robust and non-negative matrix factorization (scRNMF). Instead of other matrix factorization algorithms, scRNMF integrates two loss functions: L2 loss and C-loss. Through the use of several simulated and real datasets, we perform an comprehensively evaluation of scRNMF against existing methods. scRNMF can enhance various aspects of downstream analysis, including gene expression data recovery, cell clustering analysis, gene differential expression analysis, and cellular trajectory reconstruction. The results of our study demonstrate that scRNMF is a powerful tool that can improve the accuracy of single-cell data analysis.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Imputation for Single-cell RNA-seq Data with Non-negative Matrix Factorization and Transfer Learning
    Zhu, Jiadi
    Yang, Youlong
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2023, 21 (06)
  • [2] Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization
    Zhu, Xun
    Ching, Travers
    Pan, Xinghua
    Weissman, Sherman M.
    Garmire, Lana
    [J]. PEERJ, 2017, 5
  • [3] SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data
    Qiu, Yushan
    Yan, Chang
    Zhao, Pu
    Zou, Quan
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [4] An accurate and robust imputation method scImpute for single-cell RNA-seq data
    Wei Vivian Li
    Jingyi Jessica Li
    [J]. Nature Communications, 9
  • [5] An accurate and robust imputation method scImpute for single-cell RNA-seq data
    Li, Wei Vivian
    Li, Jingyi Jessica
    [J]. NATURE COMMUNICATIONS, 2018, 9
  • [6] Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
    Jia, Zhilong
    Zhang, Xiang
    Guan, Naiyang
    Bo, Xiaochen
    Barnes, Michael R.
    Luo, Zhigang
    [J]. PLOS ONE, 2015, 10 (09):
  • [7] deepMc: Deep Matrix Completion for Imputation of Single-Cell RNA-seq Data
    Mongia, Aanchal
    Sengupta, Debarka
    Majumdar, Angshul
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (07) : 1011 - 1019
  • [8] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    [J]. GENOME BIOLOGY, 2019, 20 (1)
  • [9] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Tao Peng
    Qin Zhu
    Penghang Yin
    Kai Tan
    [J]. Genome Biology, 20
  • [10] Evaluating imputation methods for single-cell RNA-seq data
    Yi Cheng
    Xiuli Ma
    Lang Yuan
    Zhaoguo Sun
    Pingzhang Wang
    [J]. BMC Bioinformatics, 24