Classification of Single-Cell Gene Expression Trajectories from Incomplete and Noisy Data

被引:6
|
作者
Karbalayghareh, Alireza [1 ]
Braga-Neto, Ulisses [1 ]
Dougherty, Edward R. [1 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
Gene regulatory network; probabilistic Boolean network; trajectory classification; Bayes classifier; expectation maximization; hidden Markov model; partially observed Boolean dynamical system; single-cell gene expression trajectory;
D O I
10.1109/TCBB.2017.2763946
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper studies classification of gene-expression trajectories coming from two classes, healthy and mutated (cancerous) using Boolean networks with perturbation (BNps) to model the dynamics of each class at the state level. Each class has its own BNp, which is partially known based on gene pathways. We employ a Gaussian model at the observation level to show the expression values of the genes given the hidden binary states at each time point. We use expectation maximization (EM) to learn the BNps and the unknown model parameters, derive closed-form updates for the parameters, and propose a learning algorithm. After learning, a plug-in Bayes classifier is used to classify unlabeled trajectories, which can have missing data. Measuring gene expressions at different times yields trajectories only when measurements come from a single cell. In multiple-cell scenarios, the expression values are averages over many cells with possibly different states. Via the central-limit theorem, we propose another model for expression data in multiple-cell scenarios. Simulations demonstrate that single-cell trajectory data can outperform multiple-cell average expression data relative to classification error, especially in high-noise situations. We also consider data generated via a mammalian cell-cycle network, both the wild-type and with a common mutation affecting p27.
引用
收藏
页码:193 / 207
页数:15
相关论文
共 50 条
  • [31] GiniClust: detecting rare cell types from single-cell gene expression data with Gini index
    Jiang, Lan
    Chen, Huidong
    Pinello, Luca
    Yuan, Guo-Cheng
    GENOME BIOLOGY, 2016, 17
  • [32] GiniClust: detecting rare cell types from single-cell gene expression data with Gini index
    Lan Jiang
    Huidong Chen
    Luca Pinello
    Guo-Cheng Yuan
    Genome Biology, 17
  • [33] ShinyCell: simple and sharable visualization of single-cell gene expression data
    Ouyang, John F.
    Kamaraj, Uma S.
    Cao, Elaine Y.
    Rackham, Owen J. L.
    BIOINFORMATICS, 2021, 37 (19) : 3374 - 3376
  • [34] Differential gene expression analysis in single-cell RNA sequencing data
    Wang, Tianyu
    Nabavi, Sheida
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 202 - 207
  • [35] A Novel Trajectory Inference Method on Single-Cell Gene Expression Data
    Tang, Daoxu
    Lu, Xinguo
    Jiang, Kaibao
    Sun, Fengxu
    Li, Jinxin
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 364 - 373
  • [36] Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing data
    Jong Kyoung Kim
    John C Marioni
    Genome Biology, 14
  • [37] Bayesian inference of gene expression states from single-cell RNA-seq data
    Breda, Jeremie
    Zavolan, Mihaela
    van Nimwegen, Erik
    NATURE BIOTECHNOLOGY, 2021, 39 (08) : 1008 - +
  • [38] Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing data
    Kim, Jong Kyoung
    Marioni, John C.
    GENOME BIOLOGY, 2013, 14 (01): : 1 - 12
  • [39] Bayesian inference of gene expression states from single-cell RNA-seq data
    Jérémie Breda
    Mihaela Zavolan
    Erik van Nimwegen
    Nature Biotechnology, 2021, 39 : 1008 - 1016
  • [40] Identification of gene regulation models from single-cell data
    Weber, Lisa
    Raymond, William
    Munsky, Brian
    PHYSICAL BIOLOGY, 2018, 15 (05)