An NMF-based approach to discover overlooked differentially expressed gene regions from single-cell RNA-seq data

被引:3
|
作者
Matsumoto, Hirotaka [1 ,2 ]
Hayashi, Tetsutaro [2 ]
Ozaki, Haruka [3 ,4 ]
Tsuyuzaki, Koki [2 ]
Umeda, Mana [2 ]
Iida, Tsuyoshi [5 ]
Nakamura, Masaya [5 ]
Okano, Hideyuki [6 ]
Nikaido, Itoshi [2 ,7 ]
机构
[1] RIKEN, Med Image Anal Team, Ctr Adv Intelligence Project, Chuo Ku, Nihonbashi 1 Chome Mitsui Bldg 15F, Tokyo 1030027, Japan
[2] RIKEN, Lab Bioinformat Res, Ctr Biosyst Dynam Res, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
[3] Univ Tsukuba, Ctr Artificial Intelligence Res, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058577, Japan
[4] Univ Tsukuba, Fac Med, Bioinformat Lab, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058577, Japan
[5] Keio Univ, Dept Orthopaed Surg, Sch Med, Shinjuku Ku, 35 Sinanomachi, Tokyo 1608582, Japan
[6] Keio Univ, Dept Physiol, Sch Med, Shinjuku Ku, 35 Sinanomachi, Tokyo 1608582, Japan
[7] Univ Tsukuba, Sch Integrat & Global Majors SIGMA, Masters Doctoral Program Life Sci Innovat T LSI, Bioinformat Course, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
基金
日本科学技术振兴机构;
关键词
ALTERNATIVE POLYADENYLATION; QUANTIFICATION; DIVERSITY; BIAS;
D O I
10.1093/nargab/lqz020
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell RNA sequencing has enabled researchers to quantify the transcriptomes of individual cells, infer cell types and investigate differential expression among cell types, which will lead to a better understanding of the regulatory mechanisms of cell states. Transcript diversity caused by phenomena such as aberrant splicing events have been revealed, and differential expression of previously unannotated transcripts might be overlooked by annotation-based analyses. Accordingly, we have developed an approach to discover overlooked differentially expressed (DE) gene regions that complements annotation-based methods. Our algorithm decomposes mapped count data matrix for a gene region using non-negative matrix factorization, quantifies the differential expression level based on the decomposed matrix, and compares the differential expression level based on annotation-based approach to discover previously unannotated DE transcripts. We performed single-cell RNA sequencing for human neural stem cells and applied our algorithm to the dataset. We also applied our algorithm to two public single-cell RNA sequencing datasets correspond to mouse ES and primitive endoderm cells, and human preimplantation embryos. As a result, we discovered several intriguing DE transcripts, including a transcript related to the modulation of neural stem/progenitor cell differentiation.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Markov random field model-based approach for differentially expressed gene detection from single-cell RNA-seq data
    Zhu, Biqing
    Li, Hongyu
    Zhang, Le
    Chandra, Sreeganga S.
    Zhao, Hongyu
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [2] MarcoPolo: a method to discover differentially expressed genes in single-cell RNA-seq data without depending on prior clustering
    Kim, Chanwoo
    Lee, Hanbin
    Jeong, Juhee
    Jung, Keehoon
    Han, Buhm
    NUCLEIC ACIDS RESEARCH, 2022, 50 (12)
  • [3] Robustness of single-cell RNA-seq for identifying differentially expressed genes
    Yong Liu
    Jing Huang
    Rajan Pandey
    Pengyuan Liu
    Bhavika Therani
    Qiongzi Qiu
    Sridhar Rao
    Aron M. Geurts
    Allen W. Cowley
    Andrew S. Greene
    Mingyu Liang
    BMC Genomics, 24
  • [4] Robustness of single-cell RNA-seq for identifying differentially expressed genes
    Liu, Yong
    Huang, Jing
    Pandey, Rajan
    Liu, Pengyuan
    Therani, Bhavika
    Qiu, Qiongzi
    Rao, Sridhar
    Geurts, Aron M.
    Cowley Jr, Allen W.
    Greene, Andrew S.
    Liang, Mingyu
    BMC GENOMICS, 2023, 24 (01)
  • [5] Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
    Thanh Minh Nguyen
    Jacob John Jeevan
    Nuo Xu
    Jake Y.Chen
    Genomics,Proteomics & Bioinformatics, 2021, (03) : 493 - 503
  • [6] Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
    Thanh Minh Nguyen
    Jeevan, Jacob John
    Xu, Nuo
    Chen, Jake Y.
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2021, 19 (03) : 493 - 503
  • [7] scENT for Revealing Gene Clusters From Single-Cell RNA-Seq Data
    Rao, Fan
    Chen, Minghan
    Yang, Defu
    Morrell, Bess
    Song, Qianqian
    Zhu, Wentao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 2266 - 2277
  • [8] Phylogenetic inference from single-cell RNA-seq data
    Xuan Liu
    Jason I. Griffiths
    Isaac Bishara
    Jiayi Liu
    Andrea H. Bild
    Jeffrey T. Chang
    Scientific Reports, 13
  • [9] Phylogenetic inference from single-cell RNA-seq data
    Liu, Xuan
    Griffiths, Jason I.
    Bishara, Isaac
    Liu, Jiayi
    Bild, Andrea H.
    Chang, Jeffrey T.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] An active learning approach for clustering single-cell RNA-seq data
    Lin, Xiang
    Liu, Haoran
    Wei, Zhi
    Roy, Senjuti Basu
    Gao, Nan
    LABORATORY INVESTIGATION, 2022, 102 (03) : 227 - 235