Efficient mining algorithm of frequent itemsets for uncertain data streams

被引:0
|
作者
Wang Qianqian [1 ,2 ]
Liu Fang-ai [1 ,2 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[2] Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan, Peoples R China
基金
中国国家自然科学基金;
关键词
frequent itemsets; uncertain data; data streams; data mining;
D O I
10.1109/ISCID.2016.215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of computer technology, web services has been widely used. In these applications, the uncertain data is in the form of streams. In view of this kind of situation, present a new generalized data structure, that is, PSUF - tree, to store uncertain data streams, all itemsets in recent window are contained in global PStree in a condensed format, establish a header table in which contains dynamic array of expected value whose header table saved the same itemsets. Based on PSUF-tree, present a new mining algorithm for frequent itemsets, that is, PSUF-streaming algorithm, frequent itemsets could be mined by traversing the dynamic array, the maintaining of PSUF-tree just handles the header table corresponds to the oldest batch of itemsets in window. The experimental results show that PSUF-streaming algorithm has good efficiency and scalability, and reduce memory usage to some extent.
引用
收藏
页码:443 / 446
页数:4
相关论文
共 50 条
  • [1] Uncertain Frequent Itemsets Mining Algorithm on Data Streams with Constraints
    Yu, Qun
    Tang, Ke-Ming
    Tang, Shi-Xi
    Lv, Xin
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 192 - 201
  • [2] An Efficient Algorithm for Mining Closed Frequent Itemsets in Data Streams
    Ao, Fujiang
    Du, Jing
    Yan, Yuejin
    Liu, Baohong
    Huang, Kedi
    [J]. 8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, : 37 - +
  • [3] Mining of Frequent Itemsets from Streams of Uncertain Data
    Leung, Carson Kai-Sang
    Hao, Boyu
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1663 - 1670
  • [4] An Efficient Frequent Closed Itemsets Mining Algorithm Over Data Streams
    Tan, Jun
    Bu, Yingyong
    Yang, Bo
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 3, PROCEEDINGS, 2009, : 65 - +
  • [5] An Efficient Frequent Closed Itemsets Mining Algorithm Over Data Streams
    Tan, Jun
    Yu, Shao-jun
    [J]. 2011 SECOND INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND EDUCATION APPLICATION (ICEA 2011), 2011, : 197 - 201
  • [6] Efficient Data Streams Based Closed Frequent Itemsets Mining Algorithm
    Tan, Jun
    [J]. ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2910 - 2913
  • [7] Mining of Probabilistic Frequent Itemsets over Uncertain Data Streams
    Liu Lixin
    Zhang Xiaolin
    Zhang Huanxiang
    [J]. 2014 11TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2014, : 231 - 237
  • [8] Efficient mining of frequent itemsets from data streams
    Leung, Carson Kai-Sang
    Brajczuk, Dale A.
    [J]. SHARING DATA, INFORMATION AND KNOWLEDGE, PROCEEDINGS, 2008, 5071 : 2 - 14
  • [9] DSM-FI: an efficient algorithm for mining frequent itemsets in data streams
    Li, Hua-Fu
    Shan, Man-Kwan
    Lee, Suh-Yin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (01) : 79 - 97
  • [10] DSM-FI: an efficient algorithm for mining frequent itemsets in data streams
    Hua-Fu Li
    Man-Kwan Shan
    Suh-Yin Lee
    [J]. Knowledge and Information Systems, 2008, 17 : 79 - 97