Fast Algorithms for Frequent Itemset Mining from Uncertain Data

被引:35
|
作者
Leung, Carson Kai-Sang [1 ]
MacKinnon, Richard Kyle [1 ]
Tanbeer, Syed K. [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
关键词
Association analysis; data mining algorithms; expected support; frequent patterns; tree structures; uncertain data;
D O I
10.1109/ICDM.2014.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The majority of existing data mining algorithms mine frequent itemsets from precise data. A well-known algorithm is FP-growth, which builds a compact FP-tree structure to capture important contents of precise data and mines frequent itemsets from the FP-tree. However, there are situations in which data are uncertain. To capture important contents (e.g., existential probabilities) of uncertain data for mining frequent itemsets, the UF-growth algorithm uses a UF-tree structure. However, the UF-tree can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of looser upper bounds on expected supports. To solve this problem, we propose fast algorithms that use compact tree structures for capturing uncertain data with tightened upper bounds to expected support (tube) for frequent itemset mining from uncertain data. Experimental results show the tightness of tube provided by our algorithms and the compactness of our tree structures.
引用
收藏
页码:893 / 898
页数:6
相关论文
共 50 条
  • [41] Weighted Frequent Itemset Mining Using OWA on Uncertain Transactional Database
    Wazir, Samar
    Beg, M. M. Sufyan
    Ahmad, Tanvir
    DATA COMMUNICATION AND NETWORKS, GUCON 2019, 2020, 1049 : 183 - 193
  • [42] Probabilistic maximal frequent itemset mining methods over uncertain databases
    Li, Haifeng
    Hai, Mo
    Zhang, Ning
    Zhu, Jianming
    Wang, Yue
    Cao, Huaihu
    INTELLIGENT DATA ANALYSIS, 2019, 23 (06) : 1219 - 1241
  • [43] Anytime Frequent Itemset Mining of Transactional Data Streams
    Goyal, Poonam
    Challa, Jagat Sesh
    Shrivastava, Shivin
    Goyal, Navneet
    BIG DATA RESEARCH, 2020, 21
  • [44] Novel algorithm for frequent itemset mining in data warehouses
    Xu L.-J.
    Xie K.-L.
    Journal of Zhejiang University-SCIENCE A, 2006, 7 (2): : 216 - 224
  • [45] Parallel Incremental Frequent Itemset Mining for Large Data
    Song, Yu-Geng
    Cui, Hui-Min
    Feng, Xiao-Bing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (02) : 368 - 385
  • [46] A Survey on Closed Frequent Itemset Mining on Data Streams
    Bai, Pavitra . S.
    Kumar, Ravi . G. . K.
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 542 - 547
  • [47] Frequent Itemset Mining in High Dimensional Data: A Review
    Zaki, Fatimah Audah Md
    Zulkurnain, Nurul Fariza
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, 2019, 481 : 325 - 334
  • [48] Approximate Frequent Itemset Mining for Streaming Data on FPGA
    Li, Yubin
    Sun, Yuliang
    Dai, Guohao
    Xu, Qiang
    Wang, Yu
    Yang, Huazhong
    2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,
  • [49] Efficient Incremental Itemset Tree for Approximate Frequent Itemset Mining On Data Stream
    Bai, Pavitra S.
    Kumar, Ravi G. K.
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 239 - 242
  • [50] A novel algorithm for frequent itemset mining in data warehouses
    徐利军
    谢康林
    Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2006, (02) : 216 - 224