De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet

被引:3
|
作者
Winkler, Sebastian [1 ,2 ]
Winkler, Ivana [2 ,3 ,4 ]
Figaschewski, Mirjam [1 ]
Tiede, Thorsten [1 ]
Nordheim, Alfred [3 ,5 ]
Kohlbacher, Oliver [1 ,6 ,7 ]
机构
[1] Univ Tubingen, Dept Comp Sci, Appl Bioinformat, Tubingen, Germany
[2] Int Max Planck Res Sch IMPRS Mol Organism, Tubingen, Germany
[3] Univ Tubingen, Interfac Inst Cell Biol IFIZ, Tubingen, Germany
[4] German Canc Res Ctr, German Canc Consortium DKTK, Heidelberg, Germany
[5] Leibniz Inst Aging FLI, Jena, Germany
[6] Univ Tubingen, Inst Bioinfor Mat & Med Informat, Tubingen, Germany
[7] Univ Hosp Tuebingen, Translat Bioinformat, Tubingen, Germany
关键词
Biomolecular networks; Fractional integer programming; De-novo subnetwork enrichment; Functional enrichment; Omics data; IDENTIFYING FUNCTIONAL MODULES; SPLEEN TYROSINE KINASE; HEPATOCELLULAR-CARCINOMA; TRANSCRIPTIONAL REGULATION; THERAPEUTIC TARGET; GENE-EXPRESSION; PATHWAY; NETWORK; CATENIN; ALGORITHM;
D O I
10.1186/s12859-022-04670-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background With a growing amount of (multi-)omics data being available, the extraction of knowledge from these datasets is still a difficult problem. Classical enrichment-style analyses require predefined pathways or gene sets that are tested for significant deregulation to assess whether the pathway is functionally involved in the biological process under study. De novo identification of these pathways can reduce the bias inherent in predefined pathways or gene sets. At the same time, the definition and efficient identification of these pathways de novo from large biological networks is a challenging problem. Results We present a novel algorithm, DeRegNet, for the identification of maximally deregulated subnetworks on directed graphs based on deregulation scores derived from (multi-)omics data. DeRegNet can be interpreted as maximum likelihood estimation given a certain probabilistic model for de-novo subgraph identification. We use fractional integer programming to solve the resulting combinatorial optimization problem. We can show that the approach outperforms related algorithms on simulated data with known ground truths. On a publicly available liver cancer dataset we can show that DeRegNet can identify biologically meaningful subgraphs suitable for patient stratification. DeRegNet can also be used to find explicitly multi-omics subgraphs which we demonstrate by presenting subgraphs with consistent methylation-transcription patterns. DeRegNet is freely available as open-source software. Conclusion The proposed algorithmic framework and its available implementation can serve as a valuable heuristic hypothesis generation tool contextualizing omics data within biomolecular networks.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Identification of colon cancer subtypes based on multi-omics data-construction of methylation markers for immunotherapy
    Xu, Benjie
    Lian, Jie
    Pang, Xiangyi
    Gu, Yue
    Zhu, Jiahao
    Zhang, Yan
    Lu, Haibo
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [32] Construction of a prognostic model and identification of key genes in liver hepatocellular carcinoma based on multi-omics data
    Kun Tang
    Mingjiang Liu
    Cuisheng Zhang
    Scientific Reports, 15 (1)
  • [33] Making multi-omics data accessible to researchers
    Ana Conesa
    Stephan Beck
    Scientific Data, 6
  • [34] A cloud solution for multi-omics data integration
    Tordini, Fabio
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 559 - 566
  • [35] Making multi-omics data accessible to researchers
    Conesa, Ana
    Beck, Stephan
    SCIENTIFIC DATA, 2019, 6 (1)
  • [36] Integrative clustering methods for multi-omics data
    Zhang, Xiaoyu
    Zhou, Zhenwei
    Xu, Hanfei
    Liu, Ching-Ti
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (03)
  • [37] Towards multi-omics synthetic data integration
    Selvarajoo, Kumar
    Maurer-Stroh, Sebastian
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [38] Machine learning for the analysis of multi-omics data
    Sun, Yanni
    METHODS, 2021, 189 : 1 - 2
  • [39] Integrating multi-omics data for crop improvement
    Scossa, Federico
    Alseekh, Saleh
    Fernie, Alisdair R.
    JOURNAL OF PLANT PHYSIOLOGY, 2021, 257
  • [40] Representation Learning for the Clustering of Multi-Omics Data
    Viaud, Gautier
    Mayilvahanan, Prasanna
    Cournede, Paul-Henry
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 135 - 145