A Non-Negative Matrix Factorization-Based Framework for the Analysis of Multi-Class Time-Series Single-Cell RNA-Seq Data

被引:3
|
作者
Jung, Inuk [1 ]
Choi, Joungmin [2 ]
Chae, Heejoon [2 ]
机构
[1] Kyungpook Natl Univ, Dept Comp Sci & Engn, Daegu 41566, South Korea
[2] Sookmyung Womens Univ, Dept Comp Sci, Seoul 04310, South Korea
关键词
Matrix decomposition; Gene expression; Sociology; Statistics; Stem cells; Tools; Mice; multi-class; single-cell; time-series; DIFFERENTIATION; NETWORKS;
D O I
10.1109/ACCESS.2020.2977106
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The development of single-cell RNA sequencing (scRNA-seq) has enabled gene expression to be quantified at single-cell resolution. Such advancement is expected to solve important issues that bulk RNA sequencing could not fully answer, such as inferring cell population heterogeneity, genetic variability of cells, detecting rare cell types, accurately predicting cell states and their localization. However, analyzing such large scale data, especially when they are sampled at multiple time points, brings new challenges in data mining informative genes, compared to single snapshot samples. It becomes even more complicated when gene expression patterns are to be mined from time-series scRNA-seq datasets generated from multiple conditions, which will constitute a data with gene, condition and time dimensions. Here, we focused on detecting gene expression patterns that well capture the underlying biological differences between time-series scRNA-seq datasets of three different types of stem cells. The gene expression profile of 2,128 time-series scRNA-seq samples from long-term hematopoietic stem cells (LT-HSC) and two of its progenitor cell types were analyzed using our framework. We have successfully detected condition specific feature genes that were able to achieve 90.03 & x0025; classification accuracy between the three cell types. Investigating the genes and clusters detected by our framework, we found that cell cycle related genes showed significantly high variance between the three cell types. Such results and transcriptomic characters detected from our analysis were consistent with the original study. Collectively, the framework was able to successfully detect biological meaningful gene sets and expression patterns from multi-condition time-series scRNA-seq samples.
引用
收藏
页码:42342 / 42348
页数:7
相关论文
共 50 条
  • [21] CTISL: a dynamic stacking multi-class classification approach for identifying cell types from single-cell RNA-seq data
    Wang, Xiao
    Chai, Ziyi
    Li, Shaohua
    Liu, Yan
    Li, Chen
    Jiang, Yu
    Liu, Quanzhong
    BIOINFORMATICS, 2024, 40 (02)
  • [22] A kernel non-negative matrix factorization framework for single cell clustering
    Jiang, Hao
    Yi, Ming
    Zhang, Shihua
    APPLIED MATHEMATICAL MODELLING, 2021, 90 : 875 - 888
  • [23] SMURF: embedding single-cell RNA-seq data with matrix factorization preserving self-consistency
    Pu, Juhua
    Wang, Bingchen
    Liu, Xingwu
    Chen, Lingxi
    Li, Shuai Cheng
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (02)
  • [24] NMFP: a non-negative matrix factorization based preselection method to increase accuracy of identifying mRNA isoforms from RNA-seq data
    Yuting Ye
    Jingyi Jessica Li
    BMC Genomics, 17
  • [25] A framework for intelligent Twitter data analysis with non-negative matrix factorization
    Casalino, Gabriella
    Castiello, Ciro
    Del Buono, Nicoletta
    Mencar, Corrado
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2018, 14 (03) : 334 - 356
  • [26] RENGE infers gene regulatory networks using time-series single-cell RNA-seq data with CRISPR perturbations
    Masato Ishikawa
    Seiichi Sugino
    Yoshie Masuda
    Yusuke Tarumoto
    Yusuke Seto
    Nobuko Taniyama
    Fumi Wagai
    Yuhei Yamauchi
    Yasuhiro Kojima
    Hisanori Kiryu
    Kosuke Yusa
    Mototsugu Eiraku
    Atsushi Mochizuki
    Communications Biology, 6
  • [27] RENGE infers gene regulatory networks using time-series single-cell RNA-seq data with CRISPR perturbations
    Ishikawa, Masato
    Sugino, Seiichi
    Masuda, Yoshie
    Tarumoto, Yusuke
    Seto, Yusuke
    Taniyama, Nobuko
    Wagai, Fumi
    Yamauchi, Yuhei
    Kojima, Yasuhiro
    Kiryu, Hisanori
    Yusa, Kosuke
    Eiraku, Mototsugu
    Mochizuki, Atsushi
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [28] EXPLICIT BEAT STRUCTURE MODELING FOR NON-NEGATIVE MATRIX FACTORIZATION-BASED MULTIPITCH ANALYSIS
    Ochiai, Kazuki
    Kameoka, Hirokazu
    Sagayama, Shigeki
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 133 - 136
  • [29] Analysis of Single-Cell RNA-seq Data by Clustering Approaches
    Zhu, Xiaoshu
    Li, Hong-Dong
    Guo, Lilu
    Wu, Fang-Xiang
    Wang, Jianxin
    CURRENT BIOINFORMATICS, 2019, 14 (04) : 314 - 322
  • [30] SCell: integrated analysis of single-cell RNA-seq data
    Diaz, Aaron
    Liu, Siyuan J.
    Sandoval, Carmen
    Pollen, Alex
    Nowakowski, Tom J.
    Lim, Daniel A.
    Kriegstein, Arnold
    BIOINFORMATICS, 2016, 32 (14) : 2219 - 2220