Markov Chain Ontology Analysis (MCOA)

被引:8
|
作者
Frost, H. Robert [1 ]
McCray, Alexa T. [1 ]
机构
[1] Harvard Univ, Sch Med, Ctr Biomed Informat, Boston, MA 02115 USA
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
GENE SET ANALYSIS; ENRICHMENT ANALYSIS; TERM ENRICHMENT; MODEL; TOOL;
D O I
10.1186/1471-2105-13-23
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biomedical ontologies have become an increasingly critical lens through which researchers analyze the genomic, clinical and bibliographic data that fuels scientific research. Of particular relevance are methods, such as enrichment analysis, that quantify the importance of ontology classes relative to a collection of domain data. Current analytical techniques, however, remain limited in their ability to handle many important types of structural complexity encountered in real biological systems including class overlaps, continuously valued data, inter-instance relationships, non-hierarchical relationships between classes, semantic distance and sparse data. Results: In this paper, we describe a methodology called Markov Chain Ontology Analysis (MCOA) and illustrate its use through a MCOA-based enrichment analysis application based on a generative model of gene activation. MCOA models the classes in an ontology, the instances from an associated dataset and all directional inter-class, class-to-instance and inter-instance relationships as a single finite ergodic Markov chain. The adjusted transition probability matrix for this Markov chain enables the calculation of eigenvector values that quantify the importance of each ontology class relative to other classes and the associated data set members. On both controlled Gene Ontology (GO) data sets created with Escherichia coli, Drosophila melanogaster and Homo sapiens annotations and real gene expression data extracted from the Gene Expression Omnibus (GEO), the MCOA enrichment analysis approach provides the best performance of comparable state-of-the-art methods. Conclusion: A methodology based on Markov chain models and network analytic metrics can help detect the relevant signal within large, highly interdependent and noisy data sets and, for applications such as enrichment analysis, has been shown to generate superior performance on both real and simulated data relative to existing state-of-the-art approaches.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] CONVERSATIONAL ROUTINES - A MARKOV-CHAIN ANALYSIS
    THOMAS, AP
    LANGUAGE & COMMUNICATION, 1985, 5 (04) : 287 - 296
  • [22] Analysis of Selection Algorithms: A Markov Chain Approach
    Chakraborty, Uday Kumar
    Deb, Kalyanmoy
    Chakraborty, Mandira
    EVOLUTIONARY COMPUTATION, 1996, 4 (02) : 133 - 167
  • [23] MARKOV CHAIN ANALYSIS OF MUSICAL DICE GAMES
    Volchenkov, D.
    Dawin, J. R.
    CHAOS, COMPLEXITY AND TRANSPORT, 2012, : 204 - 229
  • [24] PARAMETRIC CONVERGENCE ANALYSIS OF AN AGGREGATED MARKOV CHAIN
    Dogancay, Kutluyil
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1154 - 1158
  • [25] Markov chain decomposition for convergence rate analysis
    Madras, N
    Randall, D
    ANNALS OF APPLIED PROBABILITY, 2002, 12 (02): : 581 - 606
  • [26] Analysis of a Bose-Einstein Markov chain
    Diaconis, P
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2005, 41 (03): : 409 - 418
  • [27] A Markov chain Monte Carlo analysis of the CMSSM
    de Austri, Roberto Ruiz
    Trotta, Roberto
    Roszkowski, Leszek
    JOURNAL OF HIGH ENERGY PHYSICS, 2006, (05):
  • [28] Markov chain analysis of Leading Ones problem
    Du Y.
    Aoki K.
    Sakamoto M.
    Furutani H.
    Yamamori K.
    Artificial Life and Robotics, 2017, 22 (4) : 443 - 448
  • [29] Dynamic Cross Impact Analysis with Markov Chain
    Mamdouh, Amany M.
    Ahmed, Abd El-Hadi N.
    Saleh, Mohamed M.
    Agami, Nedaa E.
    2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND OPERATIONS MANAGEMENT (IEOM), 2015,
  • [30] markophylo: Markov chain analysis on phylogenetic trees
    Dang, Utkarsh J.
    Golding, G. Brian
    BIOINFORMATICS, 2016, 32 (01) : 130 - 132