Fast Algorithms for Frequent Itemset Mining from Uncertain Data

被引:35
|
作者
Leung, Carson Kai-Sang [1 ]
MacKinnon, Richard Kyle [1 ]
Tanbeer, Syed K. [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
关键词
Association analysis; data mining algorithms; expected support; frequent patterns; tree structures; uncertain data;
D O I
10.1109/ICDM.2014.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The majority of existing data mining algorithms mine frequent itemsets from precise data. A well-known algorithm is FP-growth, which builds a compact FP-tree structure to capture important contents of precise data and mines frequent itemsets from the FP-tree. However, there are situations in which data are uncertain. To capture important contents (e.g., existential probabilities) of uncertain data for mining frequent itemsets, the UF-growth algorithm uses a UF-tree structure. However, the UF-tree can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of looser upper bounds on expected supports. To solve this problem, we propose fast algorithms that use compact tree structures for capturing uncertain data with tightened upper bounds to expected support (tube) for frequent itemset mining from uncertain data. Experimental results show the tightness of tube provided by our algorithms and the compactness of our tree structures.
引用
收藏
页码:893 / 898
页数:6
相关论文
共 50 条
  • [31] Mining frequent itemsets from uncertain data
    Chui, Chun-Kit
    Kao, Ben
    Hung, Edward
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 47 - +
  • [32] A data mining proxy approach for efficient frequent itemset mining
    Jeffrey Xu Yu
    Zhiheng Li
    Guimei Liu
    The VLDB Journal, 2008, 17 : 947 - 970
  • [33] A data mining proxy approach for efficient frequent itemset mining
    Yu, Jeffrey Xu
    Li, Zhiheng
    Liu, Guimei
    VLDB JOURNAL, 2008, 17 (04): : 947 - 970
  • [34] An Efficient Outlier Detection Approach Over Uncertain Data Stream Based on Frequent Itemset Mining
    Hao, Shangbo
    Cai, Saihua
    Sun, Ruizhi
    Li, Sicong
    INFORMATION TECHNOLOGY AND CONTROL, 2019, 48 (01): : 34 - 46
  • [35] The research of depth-first algorithms for mining frequent itemset
    Lei, W
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 680 - 683
  • [36] A Performance based Empirical Study of the Frequent Itemset Mining Algorithms
    Sivakumar, Ramah
    Sathiaseelan, J. G. R.
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1627 - 1631
  • [37] A Theoretical Comparison of Two Maximal Frequent Itemset Mining Algorithms
    Li, Haifeng
    PROCEEDINGS OF THE 2016 5TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND COMPUTER SCIENCE, 2016, 80 : 363 - 366
  • [38] Probabilistic Frequent Itemset Mining Algorithm over Uncertain Databases with Sampling
    Li, Hai-Feng
    Zhang, Ning
    Zhang, Yue-Jin
    Wang, Yue
    FUZZY SYSTEMS AND DATA MINING II, 2016, 293 : 159 - 166
  • [39] An efficient algorithm for frequent itemset mining on data streams
    Xie Zhi-Jun
    Chen Hong
    Li, Cuiping
    ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 474 - 491
  • [40] AT-Mine: An Efficient Algorithm of Frequent Itemset Mining on Uncertain Dataset
    Wang, Le
    Feng, Lin
    Wu, Mingfei
    JOURNAL OF COMPUTERS, 2013, 8 (06) : 1417 - 1426