An NMF-based approach to discover overlooked differentially expressed gene regions from single-cell RNA-seq data

被引:3
|
作者
Matsumoto, Hirotaka [1 ,2 ]
Hayashi, Tetsutaro [2 ]
Ozaki, Haruka [3 ,4 ]
Tsuyuzaki, Koki [2 ]
Umeda, Mana [2 ]
Iida, Tsuyoshi [5 ]
Nakamura, Masaya [5 ]
Okano, Hideyuki [6 ]
Nikaido, Itoshi [2 ,7 ]
机构
[1] RIKEN, Med Image Anal Team, Ctr Adv Intelligence Project, Chuo Ku, Nihonbashi 1 Chome Mitsui Bldg 15F, Tokyo 1030027, Japan
[2] RIKEN, Lab Bioinformat Res, Ctr Biosyst Dynam Res, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
[3] Univ Tsukuba, Ctr Artificial Intelligence Res, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058577, Japan
[4] Univ Tsukuba, Fac Med, Bioinformat Lab, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058577, Japan
[5] Keio Univ, Dept Orthopaed Surg, Sch Med, Shinjuku Ku, 35 Sinanomachi, Tokyo 1608582, Japan
[6] Keio Univ, Dept Physiol, Sch Med, Shinjuku Ku, 35 Sinanomachi, Tokyo 1608582, Japan
[7] Univ Tsukuba, Sch Integrat & Global Majors SIGMA, Masters Doctoral Program Life Sci Innovat T LSI, Bioinformat Course, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
基金
日本科学技术振兴机构;
关键词
ALTERNATIVE POLYADENYLATION; QUANTIFICATION; DIVERSITY; BIAS;
D O I
10.1093/nargab/lqz020
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell RNA sequencing has enabled researchers to quantify the transcriptomes of individual cells, infer cell types and investigate differential expression among cell types, which will lead to a better understanding of the regulatory mechanisms of cell states. Transcript diversity caused by phenomena such as aberrant splicing events have been revealed, and differential expression of previously unannotated transcripts might be overlooked by annotation-based analyses. Accordingly, we have developed an approach to discover overlooked differentially expressed (DE) gene regions that complements annotation-based methods. Our algorithm decomposes mapped count data matrix for a gene region using non-negative matrix factorization, quantifies the differential expression level based on the decomposed matrix, and compares the differential expression level based on annotation-based approach to discover previously unannotated DE transcripts. We performed single-cell RNA sequencing for human neural stem cells and applied our algorithm to the dataset. We also applied our algorithm to two public single-cell RNA sequencing datasets correspond to mouse ES and primitive endoderm cells, and human preimplantation embryos. As a result, we discovered several intriguing DE transcripts, including a transcript related to the modulation of neural stem/progenitor cell differentiation.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Tumor genetic analysis from single-cell RNA-seq data
    Nawy, Tal
    NATURE METHODS, 2018, 15 (07) : 571 - 571
  • [32] Robust identification of differentially expressed genes from RNA-seq data
    Shahjaman, Md
    Mollah, Md Manir Hossain
    Rahman, Md Rezanur
    Islam, S. M. Shahinul
    Mollah, Md Nurul Haque
    GENOMICS, 2020, 112 (02) : 2000 - 2010
  • [33] Tumor genetic analysis from single-cell RNA-seq data
    Tal Nawy
    Nature Methods, 2018, 15 : 571 - 571
  • [34] Comparison of Gene Selection Methods for Clustering Single-cell RNA-seq Data
    Zhu, Xiaoshu
    Wang, Jianxin
    Li, Rongruan
    Peng, Xiaoqing
    CURRENT BIOINFORMATICS, 2023, 18 (01) : 1 - 11
  • [35] FastCount: A Fast Gene Count Software for Single-cell RNA-seq Data
    Liu, Jinpeng
    Liu, Xinan
    Yu, Ye
    Wang, Chi
    Liu, Jinze
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [36] Gene Selection for Single-cell RNA-seq Data Based on Information Gain and Genetic Algorithm
    Zhang, Jie
    Feng, Junhong
    2018 14TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2018, : 57 - 61
  • [37] A probabilistic gene expression barcode for annotation of cell types from single-cell RNA-seq data
    Grabski, Isabella N.
    Irizarry, Rafael A.
    BIOSTATISTICS, 2022, 23 (04) : 1150 - 1164
  • [38] Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data
    Yip, Shun H.
    Sham, Pak Chung
    Wang, Junwen
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (04) : 1583 - 1589
  • [39] Clustering single-cell RNA-seq data with a model-based deep learning approach
    Tian, Tian
    Wan, Ji
    Song, Qi
    Wei, Zhi
    NATURE MACHINE INTELLIGENCE, 2019, 1 (04) : 191 - 198
  • [40] Clustering single-cell RNA-seq data with a model-based deep learning approach
    Tian Tian
    Ji Wan
    Qi Song
    Zhi Wei
    Nature Machine Intelligence, 2019, 1 : 191 - 198