Fast Algorithms for Frequent Itemset Mining from Uncertain Data

被引:35
|
作者
Leung, Carson Kai-Sang [1 ]
MacKinnon, Richard Kyle [1 ]
Tanbeer, Syed K. [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
关键词
Association analysis; data mining algorithms; expected support; frequent patterns; tree structures; uncertain data;
D O I
10.1109/ICDM.2014.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The majority of existing data mining algorithms mine frequent itemsets from precise data. A well-known algorithm is FP-growth, which builds a compact FP-tree structure to capture important contents of precise data and mines frequent itemsets from the FP-tree. However, there are situations in which data are uncertain. To capture important contents (e.g., existential probabilities) of uncertain data for mining frequent itemsets, the UF-growth algorithm uses a UF-tree structure. However, the UF-tree can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of looser upper bounds on expected supports. To solve this problem, we propose fast algorithms that use compact tree structures for capturing uncertain data with tightened upper bounds to expected support (tube) for frequent itemset mining from uncertain data. Experimental results show the tightness of tube provided by our algorithms and the compactness of our tree structures.
引用
下载
收藏
页码:893 / 898
页数:6
相关论文
共 50 条
  • [1] Constrained Frequent Itemset Mining from Uncertain Data Streams
    Leung, Carson Kai-Sang
    Hao, Boyu
    Jiang, Fan
    2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 120 - 127
  • [2] Probabilistic frequent itemset mining over uncertain data streams
    Li, Haifeng
    Zhang, Ning
    Zhu, Jianming
    Wang, Yue
    Cao, Huaihu
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 112 : 274 - 287
  • [3] An algorithm for mining constrained maximal frequent itemset in uncertain data
    Du, Haizhou
    Journal of Information and Computational Science, 2012, 9 (15): : 4509 - 4515
  • [4] A Review on Frequent Itemset Mining Algorithms in Social Network Data
    Dharsandiya, Ankit N.
    Patel, Mihir R.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1046 - 1048
  • [5] Probabilistic Frequent Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 119 - 127
  • [6] Fast algorithms for frequent itemset mining using FP-trees
    Grahne, G
    Zhu, JF
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) : 1347 - 1362
  • [7] Efficient Probabilistic Frequent Itemset Mining in Big Sparse Uncertain Data
    Xu, Jing
    Li, Ning
    Mao, Xiao-Jiao
    Yang, Yu-Bin
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 235 - 247
  • [8] Data heterogeneity's impact on the performance of frequent itemset mining algorithms
    Trasierras, Antonio Manuel
    Luna, Jose Maria
    Fournier-Viger, Philippe
    Ventura, Sebastian
    INFORMATION SCIENCES, 2024, 678
  • [9] Frequent Itemset Mining in Big Data With Effective Single Scan Algorithms
    Djenouri, Youcef
    Djenouri, Djamel
    Lin, Jerry Chun-Wei
    Belhadi, Asma
    IEEE ACCESS, 2018, 6 : 68013 - 68026
  • [10] Algorithms for frequent itemset mining: a literature review
    Chee, Chin-Hoong
    Jaafar, Jafreezal
    Aziz, Izzatdin Abdul
    Hasan, Mohd Hilmi
    Yeoh, William
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (04) : 2603 - 2621