A Non-Negative Matrix Factorization-Based Framework for the Analysis of Multi-Class Time-Series Single-Cell RNA-Seq Data

被引:3
|
作者
Jung, Inuk [1 ]
Choi, Joungmin [2 ]
Chae, Heejoon [2 ]
机构
[1] Kyungpook Natl Univ, Dept Comp Sci & Engn, Daegu 41566, South Korea
[2] Sookmyung Womens Univ, Dept Comp Sci, Seoul 04310, South Korea
关键词
Matrix decomposition; Gene expression; Sociology; Statistics; Stem cells; Tools; Mice; multi-class; single-cell; time-series; DIFFERENTIATION; NETWORKS;
D O I
10.1109/ACCESS.2020.2977106
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The development of single-cell RNA sequencing (scRNA-seq) has enabled gene expression to be quantified at single-cell resolution. Such advancement is expected to solve important issues that bulk RNA sequencing could not fully answer, such as inferring cell population heterogeneity, genetic variability of cells, detecting rare cell types, accurately predicting cell states and their localization. However, analyzing such large scale data, especially when they are sampled at multiple time points, brings new challenges in data mining informative genes, compared to single snapshot samples. It becomes even more complicated when gene expression patterns are to be mined from time-series scRNA-seq datasets generated from multiple conditions, which will constitute a data with gene, condition and time dimensions. Here, we focused on detecting gene expression patterns that well capture the underlying biological differences between time-series scRNA-seq datasets of three different types of stem cells. The gene expression profile of 2,128 time-series scRNA-seq samples from long-term hematopoietic stem cells (LT-HSC) and two of its progenitor cell types were analyzed using our framework. We have successfully detected condition specific feature genes that were able to achieve 90.03 & x0025; classification accuracy between the three cell types. Investigating the genes and clusters detected by our framework, we found that cell cycle related genes showed significantly high variance between the three cell types. Such results and transcriptomic characters detected from our analysis were consistent with the original study. Collectively, the framework was able to successfully detect biological meaningful gene sets and expression patterns from multi-condition time-series scRNA-seq samples.
引用
收藏
页码:42342 / 42348
页数:7
相关论文
共 50 条
  • [31] Computational analysis of alternative polyadenylation from standard RNA-seq and single-cell RNA-seq data
    Gao, Yipeng
    Li, Wei
    MRNA 3' END PROCESSING AND METABOLISM, 2021, 655 : 225 - 243
  • [32] One Cell At a Time (OCAT): a unified framework to integrate and analyze single-cell RNA-seq data
    Wang, Chloe X.
    Zhang, Lin
    Wang, Bo
    GENOME BIOLOGY, 2022, 23 (01)
  • [33] One Cell At a Time (OCAT): a unified framework to integrate and analyze single-cell RNA-seq data
    Chloe X. Wang
    Lin Zhang
    Bo Wang
    Genome Biology, 23
  • [34] Erratum to: ‘NMFP: a non-negative matrix factorization based preselection method to increase accuracy of identifying mRNA isoforms from RNA-seq data’
    Yuting Ye
    Jingyi Jessica Li
    BMC Genomics, 17
  • [35] Adding a time resolution to single-cell RNA-seq data with DynaSCOPE™
    Seker, M.
    Gargouri, B.
    Zhu, W.
    Fang, N.
    ACTA PHYSIOLOGICA, 2022, 236 : 981 - 983
  • [36] Locally Manifold Non-negative Matrix Factorization Based on Centroid for scRNA-seq Data Analysis
    Wang, Chuan-Yuan
    Gao, Ying-Lian
    Jiao, Cui-Na
    Liu, Jin-Xing
    Zheng, Chun-Hou
    Kong, Xiang-Zhen
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 121 - 125
  • [37] Convex Hull Convolutive Non-negative Matrix Factorization for Uncovering Temporal Patterns in Multivariate Time-Series Data
    Vaz, Colin
    Toutios, Asterios
    Narayanan, Shrikanth
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 963 - 967
  • [38] CoGAPS 3: Bayesian non-negative matrix factorization for single-cell analysis with asynchronous updates and sparse data structures
    Thomas D. Sherman
    Tiger Gao
    Elana J. Fertig
    BMC Bioinformatics, 21
  • [39] deepMc: Deep Matrix Completion for Imputation of Single-Cell RNA-seq Data
    Mongia, Aanchal
    Sengupta, Debarka
    Majumdar, Angshul
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (07) : 1011 - 1019
  • [40] CoGAPS 3: Bayesian non-negative matrix factorization for single-cell analysis with asynchronous updates and sparse data structures
    Sherman, Thomas D.
    Gao, Tiger
    Fertig, Elana J.
    BMC BIOINFORMATICS, 2020, 21 (01)