Effectively and efficiently mining frequent patterns from dense graph streams on disk

被引:29
|
作者
Braun, Peter [2 ]
Cameron, Juan J. [2 ]
Cuzzocrea, Alfredo [1 ]
Jiang, Fan [2 ]
Leung, Carson K. [2 ]
机构
[1] ICAR CNR, Via P Bucci 41C, I-87036 Arcavacata Di Rende, CS, Italy
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Data mining; frequent pattern mining; graph streams; knowledge-based and intelligent information & engineering systems; knowledge discovery; limited memory; stream mining;
D O I
10.1016/j.procs.2014.08.114
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on dense graph streams, which can be generated in various applications ranging from sensor networks to social networks, from bio-informatics to chemical informatics. We also investigate the problem of effectively and efficiently mining frequent patterns from such streaming data, in the targeted case of dealing with limited memory environments so that disk support is required. This setting occurs frequently (e.g., in mobile applications/systems) and is gaining momentum even in advanced computational settings where social networks are the main representative. Inspired by this problem, we propose (i) a specialized data structure called DSMatrix, which captures important data from dense graph streams onto the disk directly and (ii) stream mining algorithms that make use of such structure in order to mine frequent patterns effectively and efficiently. Experimental results clearly confirm the benefits of our approach. (C) 2014 The Authors. Published by Elsevier B.V.
引用
收藏
页码:338 / 347
页数:10
相关论文
共 50 条
  • [1] Can frequent itemset mining be efficiently and effectively used for learning from graph data?
    Karunaratne, Thashmee
    Bostrom, Henrik
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 409 - 414
  • [2] EFFICIENTLY MINING FREQUENT PATTERNS IN RECENT MUSIC QUERY STREAMS
    Li, Hua-Fu
    Hsiao, Ming-Ho
    Chen, Hsuan-Sheng
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1269 - +
  • [3] Efficiently mining frequent patterns from dense datasets using a cluster of computers
    Sucahyo, YG
    Gopalan, RP
    Rudra, A
    [J]. AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 233 - 244
  • [4] EFFICIENTLY MINING RECENT FREQUENT PATTERNS OVER ONLINE TRANSACTIONAL DATA STREAMS
    Chen, Hui
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (05) : 707 - 725
  • [5] Efficient Frequent Itemset Mining from Dense Data Streams
    Cuzzocrea, Alfredo
    Jiang, Fan
    Lee, Wookey
    Leung, Carson K.
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 593 - 601
  • [6] On Dense Pattern Mining in Graph Streams
    Aggarwal, Charu C.
    Li, Yao
    Yu, Philip S.
    Jin, Ruoming
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 975 - 984
  • [7] Efficient mining of constrained frequent patterns from streams
    Leung, Carson Kai-Sang
    Khan, Quamrul I.
    [J]. 10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 61 - 68
  • [8] Approximate mining of frequent patterns on streams
    Silvestri, Claudio
    Orlando, Salvatore
    [J]. INTELLIGENT DATA ANALYSIS, 2007, 11 (01) : 49 - 73
  • [9] Efficiently Mining Maximal Frequent Patterns from Traversals on Weighted Directed Graph Using Statistical Theory
    Geng, Runian
    Dong, Xiangjun
    Liu, Guoling
    Xu, Wenbo
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 586 - +
  • [10] Mining Weighted Frequent Patterns from Uncertain Data Streams
    Ovi, Jesan Ahammed
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 917 - 936