ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data

被引:3
|
作者
Davashi, Razieh [1 ,2 ]
机构
[1] Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
[2] Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, Iran
关键词
Data mining; Frequent pattern mining; Uncertain frequent pattern; Uncertain data; Interactive mining; ITEMSETS; TREE; THRESHOLD; SUPPORT;
D O I
10.1016/j.eswa.2022.119156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-K Uncertain Frequent Pattern (UFP) mining is an interesting topic in data mining. The existing TUFP algorithm supports static mining of Top-K UFPs; however, in the real world, users need to repeatedly change the K threshold to extract the information according to the requirements of their application. In interactive environments, the TUFP algorithm needs to re-scan the database and create the UP-Lists and CUP-Lists from scratch which is very time-consuming. In this paper, a fast method called ITUFP is proposed for interactive mining of Top-K UFPs. The proposed method uses a new data structure called IMCUP-List to store information of patterns efficiently. It creates the UP-Lists with a single database scan, extracts the patterns by generating IMCUP-Lists, and stores all the lists. When K changes, the proposed algorithm only updates the IMCUP-Lists without having to create the lists from scratch. Accordingly, ITUFP conforms to the "build once, mine many" principle, where the UP-Lists and IMCUP-Lists are created only once and used in mining with different K values. This is the first study on interactive mining of Top-K UFPs. Extensive experimental results with sparse and dense uncertain data prove that the proposed method is very efficient for interactive mining of Top-K UFPs.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Mining top-k frequent patterns from uncertain databases
    Tuong Le
    Bay Vo
    Van-Nam Huynh
    Ngoc Thanh Nguyen
    Sung Wook Baik
    [J]. Applied Intelligence, 2020, 50 : 1487 - 1497
  • [2] Mining top-k frequent patterns from uncertain databases
    Tuong Le
    Bay Vo
    Van-Nam Huynh
    Ngoc Thanh Nguyen
    Baik, Sung Wook
    [J]. APPLIED INTELLIGENCE, 2020, 50 (05) : 1487 - 1497
  • [3] Interactive mining of top-K frequent closed itemsets from data streams
    Li, Hua-Fu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10779 - 10788
  • [4] Mining Top-K Frequent Closed Patterns from Gene Expression Data
    Ji, Shufan
    Wang, Xuejiao
    Zong, Yi
    Gao, Xiaopeng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 732 - 739
  • [5] Mining Top-k Minimal Redundancy Frequent Patterns over Uncertain Databases
    Wang, Haishuai
    Zhang, Peng
    Wu, Jia
    Pan, Shirui
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 111 - 119
  • [6] Mining top-K frequent itemsets from data streams
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 13 (02) : 193 - 217
  • [7] Mining top-K frequent itemsets from data streams
    Raymond Chi-Wing Wong
    Ada Wai-Chee Fu
    [J]. Data Mining and Knowledge Discovery, 2006, 13 : 193 - 217
  • [8] Mining top-k frequent patterns over data streams sliding window
    Chen, Hui
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (01) : 111 - 131
  • [9] Mining top-k frequent patterns over data streams sliding window
    Hui Chen
    [J]. Journal of Intelligent Information Systems, 2014, 42 : 111 - 131
  • [10] ExMiner: An efficient algorithm for mining top-k frequent patterns
    Quang, Tran Minh
    Oyanagi, Shigeru
    Yamazaki, Katsuhiro
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 436 - 447