M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data

被引:8
|
作者
Zhang, Yu [1 ,2 ]
Wan, Changlin [2 ,3 ]
Wang, Pengcheng [4 ]
Chang, Wennan [2 ,3 ]
Huo, Yan [2 ,5 ]
Chen, Jian [6 ]
Ma, Qin [7 ]
Cao, Sha [2 ,8 ]
Zhang, Chi [2 ,3 ,9 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, MOE Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[2] Indiana Univ Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
[3] Purdue Univ, Dept Elect Comp Engn, W Lafayette, IN 47907 USA
[4] Indiana Univ Purdue Univ, Dept Comp Sci, Indianapolis, IN 46202 USA
[5] China Med Univ, Sch Fundamental Sci, Shenyang 110122, Peoples R China
[6] Tongji Univ, Shanghai Pulm Hosp, Sch Med, Shanghai 200082, Peoples R China
[7] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[8] Indiana Univ Sch Med, Dept Biostat, Indianapolis, IN 46202 USA
[9] Dept Med & Mol Genet, Indianapolis, IN 46202 USA
基金
中国国家自然科学基金;
关键词
Single cell RNA-seq; Multimodality; Differential gene expression analysis; Drop-seq; Left truncated mixture Gaussian;
D O I
10.1186/s12859-019-3243-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Various statistical models have been developed to model the single cell RNA-seq expression profiles, capture its multimodality, and conduct differential gene expression test. However, for expression data generated by different experimental design and platforms, there is currently lack of capability to determine the most proper statistical model. Results: We developed an R package, namely Multi-Modal Model Selection (M3S), for gene-wise selection of the most proper multi-modality statistical model and downstream analysis, useful in a single-cell or large scale bulk tissue transcriptomic data. M3S is featured with (1) gene-wise selection of the most parsimonious model among 11 most commonly utilized ones, that can best fit the expression distribution of the gene, (2) parameter estimation of a selected model, and (3) differential gene expression test based on the selected model. Conclusion: A comprehensive evaluation suggested that M3S can accurately capture the multimodality on simulated and real single cell data. An open source package and is available through GitHub at https://github.com/zy26/M3S.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] A comparison of integration methods for single-cell RNA sequencing data and ATAC sequencing data
    Kan, Yulong
    Wang, Weihao
    Qi, Yunjing
    Zhang, Zhongxiao
    Liang, Xikeng
    Jin, Shuilin
    QUANTITATIVE BIOLOGY, 2025, 13 (02)
  • [32] Comprehensive review on single-cell RNA sequencing: A new frontier in Alzheimer's disease research
    Jin, Wengang
    Pei, Jinjin
    Roy, Jeane Rebecca
    Jayaraman, Selvaraj
    Ahalliya, Rathi Muthaiyan
    Kanniappan, Gopalakrishnan Velliyur
    Mironescu, Monica
    Palanisamy, Chella Perumal
    AGEING RESEARCH REVIEWS, 2024, 100
  • [33] MISC: missing imputation for single-cell RNA sequencing data
    Yang, Mary Qu
    Weissman, Sherman M.
    Yang, William
    Zhang, Jialing
    Canaann, Allon
    Guan, Renchu
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [34] An Introduction to the Analysis of Single-Cell RNA-Sequencing Data
    AlJanahi, Aisha A.
    Danielsen, Mark
    Dunbar, Cynthia E.
    MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT, 2018, 10 : 189 - 196
  • [35] SNV identification from single-cell RNA sequencing data
    Schnepp, Patricia M.
    Chen, Mengjie
    Keller, Evan T.
    Zhou, Xiang
    HUMAN MOLECULAR GENETICS, 2019, 28 (21) : 3569 - 3583
  • [36] Normalizing single-cell RNA sequencing data: Challenges and opportunities
    Vallejos C.A.
    Risso D.
    Scialdone A.
    Dudoit S.
    Marioni J.C.
    Nature Methods, 2017, 14 (6) : 565 - 571
  • [37] Analysis of single-cell RNA sequencing data based on autoencoders
    Andrea Tangherloni
    Federico Ricciuti
    Daniela Besozzi
    Pietro Liò
    Ana Cvejic
    BMC Bioinformatics, 22
  • [38] SCRIP: an accurate simulator for single-cell RNA sequencing data
    Qin, Fei
    Luo, Xizhi
    Xiao, Feifei
    Cai, Guoshuai
    BIOINFORMATICS, 2022, 38 (05) : 1304 - 1311
  • [39] The shaky foundations of simulating single-cell RNA sequencing data
    Crowell, Helena L.
    Leonardo, Sarah X. Morillo X.
    Soneson, Charlotte
    Robinson, Mark D.
    GENOME BIOLOGY, 2023, 24 (01)
  • [40] Analysis of single-cell RNA sequencing data based on autoencoders
    Tangherloni, Andrea
    Ricciuti, Federico
    Besozzi, Daniela
    Lio, Pietro
    Cvejic, Ana
    BMC BIOINFORMATICS, 2021, 22 (01)