A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives

被引:58
|
作者
Lee, Gangin [1 ]
Yun, Unil [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Correctness; Data mining; Existential probability; Frequent pattern mining; Uncertain pattern; SEQUENTIAL PATTERNS; ALGORITHM;
D O I
10.1016/j.future.2016.09.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The concept of uncertain pattern mining was recently proposed to fulfill the demand for processing databases with uncertain data, and various relevant methods have been devised. However, previous approaches have the following limitations. State-of-the-art methods based on tree structure can cause fatal problems in terms of runtime and memory usage according to the characteristics of uncertain databases and threshold settings because their own tree data structures can become excessively large and complicated in their mining processes. Various approximation approaches have been suggested in order to overcome such problems; however, they are methods that increase their own mining performance at the cost of accuracy of the mining results. In order to solve the problems, we propose an exact, efficient algorithm for mining uncertain frequent patterns based on novel data structures and mining techniques, which can also guarantee the correctness of the mining results without any false positives. The newly proposed list-based data structures and pruning techniques allow a complete set of uncertain frequent patterns to be mined more efficiently without pattern losses. We also demonstrate that the proposed algorithm outperforms previous state-of-the art approaches in both theoretical and empirical aspects. Especially, we provide analytical results of performance evaluation for various types of datasets to show efficiency of runtime, memory usage, and scalability in our method. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:89 / 110
页数:22
相关论文
共 50 条
  • [1] An Efficient Approach for Mining Frequent Patterns over Uncertain Data Streams
    Shajib, Md. Badi-Uz-Zaman
    Samiullah, Md.
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 980 - 984
  • [2] ILUNA: Single-pass incremental method for uncertain frequent pattern mining without false positives
    Davashi, Razieh
    [J]. INFORMATION SCIENCES, 2021, 564 : 1 - 26
  • [3] Efficient Mining of Frequent Patterns on Uncertain Graphs
    Chen, Yifan
    Zhao, Xiang
    Lin, Xuemin
    Wang, Yang
    Guo, Deke
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 287 - 300
  • [4] An Efficient Approach for Updating the Structure for Mining Frequent Patterns
    Yen, Show-Jane
    Lee, Yue-Shi
    Gu, Jia-Yuan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2012, : 879 - 883
  • [5] Hyper-structure mining of frequent patterns in uncertain data streams
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, Jaehwan John
    Tu, Yi-cheng
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 219 - 244
  • [6] Hyper-structure mining of frequent patterns in uncertain data streams
    Chandima HewaNadungodage
    Yuni Xia
    Jaehwan John Lee
    Yi-cheng Tu
    [J]. Knowledge and Information Systems, 2013, 37 : 219 - 244
  • [7] Mining frequent patterns from univariate uncertain data
    Liu, Ying-Ho
    [J]. DATA & KNOWLEDGE ENGINEERING, 2012, 71 (01) : 47 - 68
  • [8] Review of Algorithm for Mining Frequent Patterns from Uncertain Data
    Yue, Liwen
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 17 - 21
  • [9] Mining Frequent Subgraph Patterns from Uncertain Graph Data
    Zou, Zhaonian
    Li, Jianzhong
    Gao, Hong
    Zhang, Shuo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1203 - 1218
  • [10] Mining Weighted Frequent Patterns from Uncertain Data Streams
    Ovi, Jesan Ahammed
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 917 - 936