Dense Subgraphs with Restrictions and Applications to Gene Annotation Graphs

被引:0
|
作者
Saha, Barna [1 ]
Hoch, Allison [2 ]
Khuller, Samir [3 ]
Raschid, Louiqa [4 ]
Zhang, Xiao-Ning [5 ,6 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[3] Univ Maryland, Dept Comp Sci & UMIACS, College Pk, MD 20742 USA
[4] Univ Maryland, UMIACS, Robert H. Smith Sch Business, College Pk, MD 20742 USA
[5] St Bonaventure Univ, Dept Biol, St Bonaventure, NY 14778 USA
[6] Univ Maryland, Dept Cell Biol & Mol Genet, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
FUNCTIONAL MODULES; PROTEIN; ONTOLOGY;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this paper, we focus on finding complex annotation patterns representing novel and interesting hypotheses from gene annotation data. We define a generalization of the densest subgraph problem by adding an additional distance restriction (defined by a separate metric) to the nodes of the subgraph. We show that while this generalization makes the problem NP-hard for arbitrary metrics, when the metric comes from the distance metric of a tree, or an interval graph, the problem can be solved optimally in polynomial time. We also show that the densest subgraph problem with a specified subset of vertices that have to be included in the solution can be solved optimally in polynomial time. In addition, we consider other extensions when not just one solution needs to be found, but we wish to list all subgraphs of almost maximum density as well. We apply this method to a dataset of genes and their annotations obtained from The Arabidopsis Information Resource (TAIR). A user evaluation confirms that the patterns found in the distance restricted densest subgraph for a dataset of photomorphogenesis genes are indeed validated in the literature; a control dataset validates that these are not random patterns. Interestingly, the complex annotation patterns potentially lead to new and as yet unknown hypotheses. We perform experiments to determine the properties of the dense subgraphs, as we vary parameters, including the number of genes and the distance.
引用
收藏
页码:456 / +
页数:3
相关论文
共 50 条
  • [21] Dense Induced Bipartite Subgraphs in Triangle-Free Graphs
    Matthew Kwan
    Shoham Letzter
    Benny Sudakov
    Tuan Tran
    Combinatorica, 2020, 40 : 283 - 305
  • [22] Mining Highly Reliable Dense Subgraphs from Uncertain Graphs
    Lu Yihong
    Huang Ruizhi
    Huang Decai
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (06): : 2986 - 2999
  • [23] Complete r-partite subgraphs of dense r-graphs
    Nikiforov, Vladimir
    DISCRETE MATHEMATICS, 2009, 309 (13) : 4326 - 4331
  • [24] Spanning 3-colourable subgraphs of small bandwidth in dense graphs
    Boettcher, Julia
    Schacht, Mathias
    Taraz, Anusch
    JOURNAL OF COMBINATORIAL THEORY SERIES B, 2008, 98 (04) : 752 - 777
  • [25] COMPLETE SUBGRAPHS OF BIPARTITE GRAPHS AND APPLICATIONS TO TRACE LANGUAGES
    GUAIANA, G
    RESTIVO, A
    SALEMI, S
    RAIRO-INFORMATIQUE THEORIQUE ET APPLICATIONS-THEORETICAL INFORMATICS AND APPLICATIONS, 1990, 24 (04): : 409 - 417
  • [26] Small dense subgraphs of polarity graphs and the extremal number for the 4-cycle
    Tait, Michael
    Timmons, Craig
    AUSTRALASIAN JOURNAL OF COMBINATORICS, 2015, 63 : 107 - 114
  • [27] Many disjoint dense subgraphs versus large k-connected subgraphs in large graphs with given edge density
    Boehme, Thomas
    Kostochka, Alexandr
    DISCRETE MATHEMATICS, 2009, 309 (04) : 997 - 1000
  • [28] On Finding Dense Subgraphs
    Khuller, Samir
    Saha, Barna
    AUTOMATA, LANGUAGES AND PROGRAMMING, PT I, 2009, 5555 : 597 - 608
  • [29] Planted Dense Subgraphs in Dense Random Graphs Can Be Recovered using Graph-based Machine Learning
    Levinas, Itay
    Louzoun, Yoram
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 541 - 568
  • [30] Planted Dense Subgraphs in Dense Random Graphs Can Be Recovered using Graph-based Machine Learning
    Levinas I.
    Louzoun Y.
    Journal of Artificial Intelligence Research, 2022, 75 : 541 - 568