High-throughput sequencing (HTS) technologies have been instrumental in investigating biological questions at the bulk and single-cell levels. Comparative analysis of two HTS data sets often relies on testing the statistical significance for the difference of two negative binomial distributions (DOTNB). Although negative binomial distributions are well studied, the theoretical results for DOTNB remain largely unexplored. Here, we derive basic analytical results for DOTNB and examine its asymptotic properties. As a state-of-the-art application of DOTNB, we introduce DEGage, a computational method for detecting differentially expressed genes (DEGs) in scRNA-seq data. DEGage calculates the mean of the sample-wise differences of gene expression levels as the test statistic and determines significant differential expression by computing the P-value with DOTNB. Extensive validation using simulated and real scRNA-seq data sets demonstrates that DEGage outperforms five popular DEG analysis tools: DEGseq2, DEsingle, edgeR, Monocle3, and scDD. DEGage is robust against high dropout levels and exhibits superior sensitivity when applied to balanced and imbalanced data sets, even with small sample sizes. We utilize DEGage to analyze prostate cancer scRNA-seq data sets and identify marker genes for 17 cell types. Furthermore, we apply DEGage to scRNA-seq data sets of mouse neurons with and without fear memory and reveal eight potential memory-related genes overlooked in previous analyses. The theoretical results and supporting software for DOTNB can be widely applied to comparative analyses of dispersed count data in HTS and broad research questions.
机构:
Jiangsu Elect Power Informat Technol Co Ltd, Res & Dev Dept, Nanjing, Peoples R ChinaJiangsu Elect Power Informat Technol Co Ltd, Res & Dev Dept, Nanjing, Peoples R China
PengWu
JingTan
论文数: 0引用数: 0
h-index: 0
机构:
Jiangsu Elect Power Co Ltd, Technol Informat Dept, Nanjing, Peoples R ChinaJiangsu Elect Power Informat Technol Co Ltd, Res & Dev Dept, Nanjing, Peoples R China
JingTan
2018 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD),
2018,
: 77
-
82
机构:
Univ Tokyo, Inst Solid State Phys, Kashiwa, Chiba 2778581, JapanUniv Tokyo, Inst Solid State Phys, Kashiwa, Chiba 2778581, Japan
Motoyama, Yuichi
Yoshimi, Kazuyoshi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tokyo, Inst Solid State Phys, Kashiwa, Chiba 2778581, JapanUniv Tokyo, Inst Solid State Phys, Kashiwa, Chiba 2778581, Japan
Yoshimi, Kazuyoshi
Mochizuki, Izumi
论文数: 0引用数: 0
h-index: 0
机构:
High Energy Accelerator Res Org KEK, Inst Mat Struct Sci, Slow Positron Facil, Oho 1-1, Tsukuba, Ibaraki 3050801, JapanUniv Tokyo, Inst Solid State Phys, Kashiwa, Chiba 2778581, Japan
Mochizuki, Izumi
Iwamoto, Harumichi
论文数: 0引用数: 0
h-index: 0
机构:
Tottori Univ, Fac Engn, Dept Mech & Phys Engn, Tottori, Tottori 6808552, JapanUniv Tokyo, Inst Solid State Phys, Kashiwa, Chiba 2778581, Japan
Iwamoto, Harumichi
Ichinose, Hayato
论文数: 0引用数: 0
h-index: 0
机构:
Tottori Univ, Grad Sch Sustainabil Sci, Dept Engn, Tottori, Tottori 6808552, JapanUniv Tokyo, Inst Solid State Phys, Kashiwa, Chiba 2778581, Japan