Markov chain Monte Carlo for active module identification problem

被引:4
|
作者
Alexeev, Nikita [1 ]
Isomurodov, Javlon [1 ,2 ]
Sukhov, Vladimir [1 ,2 ]
Korotkevich, Gennady [1 ]
Sergushichev, Alexey [1 ]
机构
[1] ITMO Univ, Comp Technol Lab, St Petersburg, Russia
[2] JetBrains Res, St Petersburg, Russia
关键词
Protein-protein interaction; Active module; Markov chain Monte Carlo; Gene expression; Soft classification; FUNCTIONAL MODULES; NETWORK;
D O I
10.1186/s12859-020-03572-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundIntegrative network methods are commonly used for interpretation of high-throughput experimental biological data: transcriptomics, proteomics, metabolomics and others. One of the common approaches is finding a connected subnetwork of a global interaction network that best encompasses significant individual changes in the data and represents a so-called active module. Usually methods implementing this approach find a single subnetwork and thus solve a hard classification problem for vertices. This subnetwork inherently contains erroneous vertices, while no instrument is provided to estimate the confidence level of any particular vertex inclusion. To address this issue, in the current study we consider the active module problem as a soft classification problem.ResultsWe propose a method to estimate probabilities of each vertex to belong to the active module based on Markov chain Monte Carlo (MCMC) subnetwork sampling. As an example of the performance of our method on real data, we run it on two gene expression datasets. For the first many-replicate expression dataset we show that the proposed approach is consistent with an existing resampling-based method. On the second dataset the jackknife resampling method is inapplicable due to the small number of biological replicates, but the MCMC method can be run and shows high classification performance.ConclusionsThe proposed method allows to estimate the probability that an individual vertex belongs to the active module as well as the false discovery rate (FDR) for a given set of vertices. Given the estimated probabilities, it becomes possible to provide a connected subgraph in a consistent manner for any given FDR level: no vertex can disappear when the FDR level is relaxed. We show, on both simulated and real datasets, that the proposed method has good computational performance and high classification accuracy.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Markov chain Monte Carlo for active module identification problem
    Nikita Alexeev
    Javlon Isomurodov
    Vladimir Sukhov
    Gennady Korotkevich
    Alexey Sergushichev
    [J]. BMC Bioinformatics, 21
  • [2] ACCELERATING MARKOV CHAIN MONTE CARLO WITH ACTIVE SUBSPACES
    Constantine, Paul G.
    Kent, Carson
    Bui-Thanh, Tan
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (05): : A2779 - A2805
  • [3] Markov Chain Monte Carlo
    Henry, Ronnie
    [J]. EMERGING INFECTIOUS DISEASES, 2019, 25 (12) : 2298 - 2298
  • [4] A Markov chain Monte Carlo method for the groundwater inverse problem
    Lu, ZM
    Higdon, D
    Zhang, DX
    [J]. Computational Methods in Water Resources, Vols 1 and 2, 2004, 55 : 1273 - 1283
  • [5] A Bayesian Markov chain Monte Carlo solution of the bilinear problem
    Ochs, AF
    Stoyanova, RS
    Brown, TR
    Rooney, WD
    Springer, CS
    [J]. BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2001, 567 : 274 - 284
  • [6] Markov chain Monte Carlo in statistical mechanics: The problem of accuracy
    Mignani, S
    Rosa, R
    [J]. TECHNOMETRICS, 2001, 43 (03) : 347 - 355
  • [7] Markov Chain Monte Carlo methods in human skeletal identification
    Konigsberg, Lyle W.
    Frankenberg, Susan R.
    [J]. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2018, 165 : 144 - 144
  • [8] Markov chain monte carlo defect identification in nde images
    Dogandzic, Aleksandar
    Zhang, Benhong
    [J]. REVIEW OF PROGRESS IN QUANTITATIVE NONDESTRUCTIVE EVALUATION, VOLS 26A AND 26B, 2007, 894 : 709 - +
  • [9] System identification using evolutionary Markov chain Monte Carlo
    Zhang, B.-T.
    Cho, D.-Y.
    [J]. 2001, Elsevier (47)
  • [10] System identification using evolutionary Markov chain Monte Carlo
    Zhang, BT
    Cho, DY
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2001, 47 (07) : 587 - 599