Fast Algorithms for Frequent Itemset Mining from Uncertain Data

被引:35
|
作者
Leung, Carson Kai-Sang [1 ]
MacKinnon, Richard Kyle [1 ]
Tanbeer, Syed K. [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
关键词
Association analysis; data mining algorithms; expected support; frequent patterns; tree structures; uncertain data;
D O I
10.1109/ICDM.2014.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The majority of existing data mining algorithms mine frequent itemsets from precise data. A well-known algorithm is FP-growth, which builds a compact FP-tree structure to capture important contents of precise data and mines frequent itemsets from the FP-tree. However, there are situations in which data are uncertain. To capture important contents (e.g., existential probabilities) of uncertain data for mining frequent itemsets, the UF-growth algorithm uses a UF-tree structure. However, the UF-tree can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of looser upper bounds on expected supports. To solve this problem, we propose fast algorithms that use compact tree structures for capturing uncertain data with tightened upper bounds to expected support (tube) for frequent itemset mining from uncertain data. Experimental results show the tightness of tube provided by our algorithms and the compactness of our tree structures.
引用
下载
收藏
页码:893 / 898
页数:6
相关论文
共 50 条
  • [21] A Review of Frequent Pattern Mining Algorithms for Uncertain Data
    Bhogadhi, Vani
    Chandak, M. B.
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 974 - 983
  • [22] A Review of the Maximal Frequent Itemset Mining Algorithms over Dynamically Changed Data
    Li, Haifeng
    Proceedings of the 2016 International Symposium on Advances in Electrical, Electronics and Computer Engineering (ISAEECE), 2016, 69 : 346 - 350
  • [23] Peak-jumping frequent itemset mining algorithms
    Dexters, Nele
    Purdom, Paul W.
    Van Gucht, Dirk
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 487 - +
  • [24] An uncertainty-based approach: Frequent itemset mining from uncertain data with different item importance
    Lee, Gangin
    Yun, Unil
    Ryang, Heungmo
    KNOWLEDGE-BASED SYSTEMS, 2015, 90 : 239 - 256
  • [25] Efficient Frequent Itemset Mining from Dense Data Streams
    Cuzzocrea, Alfredo
    Jiang, Fan
    Lee, Wookey
    Leung, Carson K.
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 593 - 601
  • [26] Frequent Weighted Itemset Mining from Gene Expression Data
    Baralis, Elena
    Cagliero, Luca
    Cerquitelli, Tania
    Chiusano, Silvia
    Garza, Paolo
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2013,
  • [27] Fast Mining Algorithm of Frequent Itemset Based on Spark
    Ding J.-M.
    Li H.-B.
    Deng B.
    Jia L.-Y.
    You J.-G.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2446 - 2464
  • [28] Efficient weighted probabilistic frequent itemset mining in uncertain databases
    Li, Zhiyang
    Chen, Fengjuan
    Wu, Junfeng
    Liu, Zhaobin
    Liu, Weijiang
    EXPERT SYSTEMS, 2021, 38 (05)
  • [29] Probabilistic Frequent Pattern Growth for Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 38 - 55
  • [30] Parallel Frequent Itemset Mining on Streaming Data
    He, Yanshan
    Yue, Min
    2014 10TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2014, : 725 - 730