Subjective interestingness of subgraph patterns

被引:24
|
作者
van Leeuwen, Matthijs [1 ,2 ]
De Bie, Tijl [3 ,4 ]
Spyropoulou, Eirini [3 ]
Mesnage, Cedric [3 ]
机构
[1] Katholieke Univ Leuven, Dept Comp Sci, Machine Learning, Leuven, Belgium
[2] Leiden Univ, Leiden Inst Adv Comp Sci, Leiden, Netherlands
[3] Univ Bristol, Intelligent Syst Lab, Bristol, Avon, England
[4] Univ Ghent, Data Sci Lab, Ghent, Belgium
基金
英国工程与自然科学研究理事会; 欧洲研究理事会;
关键词
Dense subgraph patterns; Community detection; Subjective interestingness; Maximum entropy; DISCOVERY;
D O I
10.1007/s10994-015-5539-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The utility of a dense subgraph in gaining a better understanding of a graph has been formalised in numerous ways, each striking a different balance between approximating actual interestingness and computational efficiency. A difficulty in making this trade-off is that, while computational cost of an algorithm is relatively well-defined, a pattern's interestingness is fundamentally subjective. This means that this latter aspect is often treated only informally or neglected, and instead some form of density is used as a proxy. We resolve this difficulty by formalising what makes a dense subgraph pattern interesting to a given user. Unsurprisingly, the resulting measure is dependent on the prior beliefs of the user about the graph. For concreteness, in this paper we consider two cases: one case where the user only has a belief about the overall density of the graph, and another case where the user has prior beliefs about the degrees of the vertices. Furthermore, we illustrate how the resulting interestingness measure is different from previous proposals. We also propose effective exact and approximate algorithms for mining the most interesting dense subgraph according to the proposed measure. Usefully, the proposed interestingness measure and approach lend themselves well to iterative dense subgraph discovery. Contrary to most existing approaches, our method naturally allows subsequently found patterns to be overlapping. The empirical evaluation highlights the properties of the new interestingness measure given different prior belief sets, and our approach's ability to find interesting subgraphs that other methods are unable to find.
引用
收藏
页码:41 / 75
页数:35
相关论文
共 50 条
  • [11] Interestingness measure for mining sequential patterns in sports
    Hrovat, Goran
    Fister, Iztok, Jr.
    Yermak, Katsiaryna
    Stiglic, Gregor
    Fister, Iztok
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 29 (05) : 1981 - 1994
  • [12] Bayesian approaches to ranking sequential patterns interestingness
    Vellaisamy, Kuralmani
    Li, Jinyan
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 241 - 250
  • [13] Pruning Based Interestingness of Mined Classification Patterns
    Al-Hegami, Ahmed
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2009, 6 (04) : 336 - 343
  • [14] Maximum entropy models and subjective interestingness: an application to tiles in binary databases
    Tijl De Bie
    Data Mining and Knowledge Discovery, 2011, 23 : 407 - 446
  • [15] Online summarization of dynamic graphs using subjective interestingness for sequential data
    Kapoor, Sarang
    Saxena, Dhish Kumar
    van Leeuwen, Matthijs
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (01) : 88 - 126
  • [16] Online summarization of dynamic graphs using subjective interestingness for sequential data
    Sarang Kapoor
    Dhish Kumar Saxena
    Matthijs van Leeuwen
    Data Mining and Knowledge Discovery, 2021, 35 : 88 - 126
  • [18] Mining Unexpected Patterns by Decision Trees with Interestingness Measures
    Chiang, Rui-Dong
    Chang, Ming-Yang
    Keh, Huan-Chao
    Chan, Chien-Hui
    2013 FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2013), 2013, : 117 - 122
  • [19] Measuring interestingness of discovered skewed patterns in data cubes
    Kumar, Navin
    Gangopadhyay, Aryya
    Bapna, Sanjay
    Karabatis, George
    Chen, Zhiyuan
    DECISION SUPPORT SYSTEMS, 2008, 46 (01) : 429 - 439
  • [20] Searching Social Networks for Subgraph Patterns
    Ogaard, Kirk
    Kase, Sue
    Roy, Heather
    Nagi, Rakesh
    Sambhoos, Kedar
    Sudit, Moises
    SENSORS, AND COMMAND, CONTROL, COMMUNICATIONS, AND INTELLIGENCE (C3I) TECHNOLOGIES FOR HOMELAND SECURITY AND HOMELAND DEFENSE XII, 2013, 8711