Mclmpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data

被引:49
|
作者
Mongia, Aanchal [1 ]
Sengupta, Debarka [1 ,2 ]
Majumdar, Angshul [3 ]
机构
[1] Indraprastha Inst Informat Technol Delhi, Dept Comp Sci & Engn, New Delhi, India
[2] Indraprastha Inst Informat Technol Delhi, Ctr Computat Biol, New Delhi, India
[3] Indraprastha Inst Informat Technol Delhi, Dept Elect & Commun Engn, New Delhi, India
关键词
scRNA-seq; dropouts; imputation; matrix completion; Nuclear norm minization; HETEROGENEITY; EMBRYOS;
D O I
10.3389/fgene.2019.00009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Motivation: Single-cell RNA sequencing has been proved to be revolutionary for its potential of zooming into complex biological systems. Genome-wide expression analysis at single-cell resolution provides a window into dynamics of cellular phenotypes. This facilitates the characterization of transcriptional heterogeneity in normal and diseased tissues under various conditions. It also sheds light on the development or emergence of specific cell populations and phenotypes. However, owing to the paucity of input RNA, a typical single cell RNA sequencing data features a high number of dropout events where transcripts fail to get amplified. Results: We introduce mclmpute, a low-rank matrix completion based technique to impute dropouts in single cell expression data. On a number of real datasets, application of mclmpute yields significant improvements in the separation of true zeros from dropouts, cell-clustering, differential expression analysis, cell type separability, the performance of dimensionality reduction techniques for cell visualization, and gene distribution.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data
    Peijie Lin
    Michael Troup
    Joshua W. K. Ho
    Genome Biology, 18
  • [32] Imputation method for single-cell RNA-seq data using neural topic model
    Qi, Yueyang
    Han, Shuangkai
    Tang, Lin
    Liu, Lin
    GIGASCIENCE, 2023, 12
  • [33] CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data
    Lin, Peijie
    Troup, Michael
    Ho, Joshua W. K.
    GENOME BIOLOGY, 2017, 18
  • [34] A deep matrix factorization based approach for single-cell RNA-seq data clustering
    Liang, Zhenlan
    Zheng, Ruiqing
    Chen, Siqi
    Yan, Xuhua
    Li, Min
    METHODS, 2022, 205 : 114 - 122
  • [35] CMF-Impute: an accurate imputation tool for single-cell RNA-seq data
    Xu, Junlin
    Cai, Lijun
    Liao, Bo
    Zhu, Wen
    Yang, JiaLiang
    BIOINFORMATICS, 2020, 36 (10) : 3139 - 3147
  • [36] CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data
    Shi, Yuchen
    Wan, Jian
    Zhang, Xin
    Yin, Yuyu
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [37] An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
    Sun, Xifang
    Sun, Shiquan
    Yang, Sheng
    CELLS, 2019, 8 (10)
  • [38] scINRB: single-cell gene expression imputation with network regularization and bulk RNA-seq data
    Kang, Yue
    Zhang, Hongyu
    Guan, Jinting
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [39] Collaborative Structure-Preserved Missing Data Imputation for Single-Cell RNA-Seq Clustering
    Gao, Hang
    Shen, Wenjun
    Li, Rui
    Liu, Cheng
    Wu, Si
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (05) : 1480 - 1491
  • [40] TsImpute: an accurate two-step imputation method for single-cell RNA-seq data
    Zheng, Weihua
    Min, Wenwen
    Wang, Shunfang
    BIOINFORMATICS, 2023, 39 (12)